IEEE VIS 2024 Content: Accelerating hyperbolic t-SNE

Accelerating hyperbolic t-SNE

Martin Skrodzki -

Hunter van Geffen -

Nicolas F. Chaves-de-Plaza -

Thomas Höllt -

Elmar Eisemann -

Klaus Hildebrandt -

Room: Bayshore V

2024-10-16T15:03:00ZGMT-0600Change your timezone on the schedule page
2024-10-16T15:03:00Z
Exemplar figure, described by caption below
An embedding of the C.Elegans data set with colored clusters on the right. Left shows an overlay of our tree acceleration structure. The red mark indicates the query point where the grid resolution is high, whereas it is low everywhere else in the embedding. This speeds up embedding computations significantly.
Fast forward
Keywords

Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML) Dimensionality reduction, t-SNE, hyperbolic embedding, acceleration structure

Abstract

The need to understand the structure of hierarchical or high-dimensional data is present in a variety of fields. Hyperbolic spaces have proven to be an important tool for embedding computations and analysis tasks as their non-linear nature lends itself well to tree or graph data. Subsequently, they have also been used in the visualization of high-dimensional data, where they exhibit increased embedding performance. However, none of the existing dimensionality reduction methods for embedding into hyperbolic spaces scale well with the size of the input data. That is because the embeddings are computed via iterative optimization schemes and the computation cost of every iteration is quadratic in the size of the input. Furthermore, due to the non-linear nature of hyperbolic spaces, Euclidean acceleration structures cannot directly be translated to the hyperbolic setting. This paper introduces the first acceleration structure for hyperbolic embeddings, building upon a polar quadtree. We compare our approach with existing methods and demonstrate that it computes embeddings of similar quality in significantly less time. Implementation and scripts for the experiments can be found at this https URL.