July 15th, 2024

Guide to Machine Learning with Geometric, Topological, and Algebraic Structures

The paper discusses the shift in machine learning towards handling non-Euclidean data with complex structures, emphasizing the need to adapt classical methods and proposing a graphical taxonomy to unify recent advancements.

Read original articleLink Icon
Guide to Machine Learning with Geometric, Topological, and Algebraic Structures

The paper titled "Beyond Euclid: An Illustrated Guide to Modern Machine Learning with Geometric, Topological, and Algebraic Structures" explores the shift in machine learning towards handling non-Euclidean data with complex structures. Traditional machine learning has been rooted in Euclidean geometry, but the rise of data with intricate geometric, topological, and algebraic properties requires a broader mathematical approach. The authors highlight the need to adapt classical methods to unconventional data types by incorporating geometry, topology, and algebra. They propose a graphical taxonomy to unify recent advancements in a comprehensible framework. The review discusses current challenges and outlines future opportunities in this evolving field. This work aims to provide an accessible entry point into the realm of modern machine learning with non-Euclidean structures, drawing parallels to historical mathematical revolutions that led to non-Euclidean geometry.

Related

Linear Algebra 101 for AI/ML

Linear Algebra 101 for AI/ML

Introduction to Linear Algebra for AI/ML emphasizes basic concepts like scalars, vectors, matrices, vector/matrix operations, PyTorch basics, and mathematical notations. Simplified explanations aid beginners in understanding fundamental concepts efficiently.

The History of Machine Learning in Trackmania

The History of Machine Learning in Trackmania

Trackmania, a racing game, inspires machine learning projects aiming for superhuman performance. Players like Rottaca and TMRL use supervised and unsupervised learning. Innovations include track lookahead and diverse learning algorithms. Programs show progress but not human-level skills. The goal is a top-performing AI on a desktop PC, blending gaming and machine learning for innovation.

Data Structures Cheat Sheet

Data Structures Cheat Sheet

This article delves into data structures, highlighting graphs' role in real-world applications. It explains creating nodes and relationships in Memgraph efficiently. Various structures like Linked Lists, Queues, Stacks, and Trees are covered, along with traversal algorithms like BFS and DFS in Memgraph for efficient graph exploration. Readers are encouraged to explore further in Memgraph's documentation and community.

The moment we stopped understanding AI [AlexNet] [video]

The moment we stopped understanding AI [AlexNet] [video]

The video discusses high-dimensional embedding spaces in AI models like AlexNet and Chat GPT. It explains AlexNet's convolutional blocks for image analysis and Chat GPT's transformer use for responses, emphasizing AI model evolution and challenges in visualizing activations.

Leetcode but for ML

Leetcode but for ML

Deep-ML provides machine learning and linear algebra challenges categorized by difficulty and topic. Challenges include covariance matrices, Jacobi method, K-Means clustering, and sigmoid function. Users can track progress by signing up.

Link Icon 7 comments
By @dpflan - 6 months
The paper's references have some good ones for getting more acquainted with these subjects; this one being a nice dense one to start with:

- Geometric Deep Learning Grids, Groups, Graphs, Geodesics, and Gauges: https://geometricdeeplearning.com/

By @Pseudomanifold - 6 months
Great selection of works, but I am missing a lot of references from topology in ML, with the article only assuming a very cursory perspective in terms of 'topology captures connectivity and/or continuity.'

Some works from my colleagues and me go a little bit deeper (no pun intended), for instance:

- Neural Persistence Dynamics: https://arxiv.org/abs/2405.15732

- Simplicial Representation Learning with Neural $k$-Forms: https://openreview.net/forum?id=Djw0XhjHZb

- A general review on topology in machine learning: https://www.frontiersin.org/journals/artificial-intelligence...

There are more things in topology and machine learning, Horatio, than are dreamt of in your article ;-)

By @itissid - 6 months
One common theme I see in the paper(e.g. in protein folding) is:

"Identify what properties are important (geometry, algebra, topo) and which one is an useful prior and then "use" the guide to select an initial struct. This is probably harder than it sounds(unlike bayesian priors which are more forgiving for one to select, but quite like them in that they both require special assumptions)."

I wonder: could one use it to bring together certain multimodal data and a proposed network for a task? Like could one bring in sensor, map topology, urban topology, pictures which have certain properties and that help me use this guide to make a statement like : "Street data could be embedded with Sensor data to do ABC kind of inference using XYZ NNetwork structure because this paper suggests that is a reasonable thing to do"?

By @mjhay - 6 months
I am 100% convinced that these kind of approaches will be what delivers ML research from the current resource-hungry and ungeneralizable status quo. Low-dimensional Euclidean geometry is special. Higher-dimensional Euclidean spaces are less special. Most real-life data is high-dimensional, not at all smooth, and possessing a structure you cannot call Euclidean with a straight face. Look at what works with tabular data (which is probably most of what practitioners work with in the wild). It's gradient boosted trees, not neural networks.

There is a fundamental mismatch between the data we usually work with and the spaces we shove it into. Tools from algebraic topology and geometry are old hat in physics. If anything, they should be even more useful in ML.

By @funnygiraffe - 6 months
Is geometric, topological, and algebraic ML/data analysis actually used in the industry? It is certainly beautiful math. However, during grad school I met a few pure math PhD students who were saying that after finishing their PhD they will just go into industry to do topological data analysis (this was about 10 years ago and ML wasn't yet as hyped up). However, I have never heard of anybody actually having success on that plan.
By @mistrial9 - 6 months
some people on this thread are asking about jobs. The bigger picture here is that previously intractable problems are going to be solved with a new combination of math, data and compute.. there are lots of commercial cases that will change dramatically. How can individual people or small groups benefit from serious problem solving, economically?
By @OutOfHere - 6 months
Note that the GPU hardware is setup for Euclidean matrix operations. Even if you had a deep structure learner, it won't necessarily help you if you have to go back to emulating it on Euclidean hardware.