Loading Now

Summary of Kernel Kmeans Clustering Splits For End-to-end Unsupervised Decision Trees, by Louis Ohl et al.


Kernel KMeans clustering splits for end-to-end unsupervised decision trees

by Louis Ohl, Pierre-Alexandre Mattei, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

First submitted to arxiv on: 19 Feb 2024

Categories

  • Main: Machine Learning (stat.ML)
  • Secondary: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
A novel end-to-end trained unsupervised binary tree for clustering, called Kauri, is introduced. Unlike previous works that focus on interpreting with trees the result of another clustering algorithm, Kauri performs greedy maximization of the kernel KMeans objective without requiring centroid definition. The model is compared to recent unsupervised trees on multiple datasets and shows identical performance when using a linear kernel, outperforming other models for non-linear kernels.
Low GrooveSquid.com (original content) Low Difficulty Summary
Kauri is a new way to group similar things together without knowing what they are in advance. It’s like a special kind of tree that helps find patterns in data. The usual approach is to use another clustering method and then explain the results with a tree, but Kauri does it all itself. It works by trying to maximize the kernel KMeans objective in a step-by-step manner without needing any initial definitions. This new model is tested on several datasets and performs just as well as other methods when using a simple linear approach. For more complex patterns, Kauri often outperforms the combination of another clustering method and a decision tree.

Keywords

* Artificial intelligence  * Clustering  * Decision tree  * Unsupervised