From Hierarchical Clustering to Phylogenetic CSPs
Evangelos Chatziafratis
Assistant Professor of Computer Science & Engineering, University of California Santa Cruz
Abstract: Hierarchical Clustering (HC) is a widely studied problem in unsupervised learning and exploratory data analysis, usually tackled by simple agglomerative procedures like average-linkage, single-linkage or complete-linkage. Applications of HC include reasoning about text documents, understanding the Evolution of species and the Tree of life, decomposing social networks like Facebook, or even organizing large data centers efficiently. Surprisingly, despite the plethora of heuristics for tackling the problem, until recently there was no optimization objective associated with it; this is in stark contrast with flat clustering objectives like k-means, k-median and k-center. In this talk, we will give an overview of the optimization objectives for Hierarchical Clustering, we will discuss connections to Phylogenetic and Triplet Reconstruction methods, we will see some simple algorithms to find approximate solutions, and finally we will discuss some recent hardness of approximation results and new connections to the notion of approximation resistance of CSPs.
Bio: Vaggos Chatziafratis is an Assistant Professor of Computer Science & Engineering at the University of California in Santa Cruz, where he is part of the Theoretical Computer Science group. His research lies at the intersection of approximation algorithms and machine learning, focusing on the design and analysis of approximation algorithms/hardness for clustering problems, and on understanding neural networks through the lens of dynamical systems. Vaggos completed his MS and PhD at Stanford University in the Computer Science Department, where he was advised by Tim Roughgarden and Moses Charikar. Before Stanford, he finished with a Diploma from the ECE department of the National Technical University of Athens. Before joining UC Santa Cruz, he did a postdoc at Google Research under Vahab Mirrokni and Mohammad Mahdian. He also did a postdoc at Northwestern under Konstantin Makarychev, Aravindan Vijayaraghavan and Samir Khuller. Vaggos is the recipient of a FODSI postdoc fellowship at MIT (under Piotr Indyk) and Northeastern. His research at UCSC has been supported by a Hellman Fellowship.