TY - JOUR
T1 - ClusPath
T2 - a temporal-driven clustering to infer typical evolution paths
AU - Rizoiu, Marian Andrei
AU - Velcin, Julien
AU - Bonnevay, Stéphane
AU - Lallich, Stéphane
N1 - Publisher Copyright:
© 2015, The Author(s).
PY - 2016/9/1
Y1 - 2016/9/1
N2 - We propose ClusPath, a novel algorithm for detecting general evolution tendencies in a population of entities. We show how abstract notions, such as the Swedish socio-economical model (in a political dataset) or the companies fiscal optimization (in an economical dataset) can be inferred from low-level descriptive features. Such high-level regularities in the evolution of entities are detected by combining spatial and temporal features into a spatio-temporal dissimilarity measure and using semi-supervised clustering techniques. The relations between the evolution phases are modeled using a graph structure, inferred simultaneously with the partition, by using a “slow changing world” assumption. The idea is to ensure a smooth passage for entities along their evolution paths, which catches the long-term trends in the dataset. Additionally, we also provide a method, based on an evolutionary algorithm, to tune the parameters of ClusPath to new, unseen datasets. This method assesses the fitness of a solution using four opposed quality measures and proposes a balanced compromise.
AB - We propose ClusPath, a novel algorithm for detecting general evolution tendencies in a population of entities. We show how abstract notions, such as the Swedish socio-economical model (in a political dataset) or the companies fiscal optimization (in an economical dataset) can be inferred from low-level descriptive features. Such high-level regularities in the evolution of entities are detected by combining spatial and temporal features into a spatio-temporal dissimilarity measure and using semi-supervised clustering techniques. The relations between the evolution phases are modeled using a graph structure, inferred simultaneously with the partition, by using a “slow changing world” assumption. The idea is to ensure a smooth passage for entities along their evolution paths, which catches the long-term trends in the dataset. Additionally, we also provide a method, based on an evolutionary algorithm, to tune the parameters of ClusPath to new, unseen datasets. This method assesses the fitness of a solution using four opposed quality measures and proposes a balanced compromise.
KW - Detection of long-term trends
KW - Evolutionary clustering
KW - Pareto front estimation
KW - Semi-supervised clustering
KW - Temporal cluster graph
KW - Temporal clustering
UR - http://www.scopus.com/inward/record.url?scp=84952040626&partnerID=8YFLogxK
U2 - 10.1007/s10618-015-0445-7
DO - 10.1007/s10618-015-0445-7
M3 - Article
SN - 1384-5810
VL - 30
SP - 1324
EP - 1349
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
IS - 5
ER -