The Geometry That Was Never Taught (But Required for Modern ML?)
Summary
The article argues that traditional machine learning embeddings, which assume data resides in flat Euclidean space (ℝⁿ), are fundamentally mismatched for hierarchical data structures like WordNet or biological taxonomies. Euclidean space exhibits polynomial volume growth (rⁿ), whereas hierarchical data, characterized by branching, requires exponential growth in representational capacity (bᵈ nodes at depth d). This geometric mismatch leads to distortion in Euclidean embeddings, which cannot be resolved by simply adding more dimensions. Hyperbolic space, with its constant negative curvature (κ < 0), offers exponentially more room (Vol(r) ~ e^((n−1)r)), making it naturally suited for representing branching hierarchies without distortion. Research by Sarkar demonstrated that any finite weighted tree can be embedded into a 2D hyperbolic plane with arbitrarily low distortion, a significant improvement over high-dimensional Euclidean embeddings for hierarchical data.
Key takeaway
For NLP Engineers and AI Scientists working with hierarchical datasets such as ontologies, taxonomies, or organizational charts, you should consider hyperbolic embeddings instead of defaulting to Euclidean space. Your current Euclidean models may be incurring significant distortion and inefficiency, requiring far more dimensions than necessary. Adopting hyperbolic geometry can lead to dramatically better representation quality with fewer parameters, as demonstrated by 5-dimensional Poincaré embeddings outperforming 200-dimensional Euclidean ones on WordNet.
Key insights
Euclidean embeddings are geometrically ill-suited for hierarchical data, which is better represented in hyperbolic space.
Principles
- Hierarchical data requires exponential representational capacity.
- Hyperbolic space offers exponential volume growth.
- Trees are metrically similar to hyperbolic space.
Method
Hyperbolic embeddings utilize models like the hyperboloid or Poincaré ball to represent hierarchical data, optimizing within ℝⁿ with a modified distance function to capture exponential growth.
In practice
- Use Poincaré embeddings for WordNet or biological taxonomies.
- Evaluate δ-hyperbolicity for datasets with latent tree-like structures.
Topics
- Hyperbolic Embeddings
- Hierarchical Data
- Poincaré Embeddings
- Riemannian Geometry
- WordNet
Best for: NLP Engineer, AI Scientist, Research Scientist, AI Researcher, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by NLP on Medium.