October 13th, 2024

Counterintuitive Properties of High Dimensional Space

High-dimensional geometry reveals counterintuitive properties, such as the volume of \(d\)-spheres approaching zero with increasing dimensions, concentration of surface area near the equator, and varying kissing numbers across dimensions.

Read original articleLink Icon
ConfusionIntrigueSurprise
Counterintuitive Properties of High Dimensional Space

High dimensional space exhibits properties that often defy our three-dimensional intuition. For instance, while a circle is a 1-sphere and a standard sphere is a 2-sphere, the terminology can lead to confusion as higher-dimensional spheres are simply referred to as \(d\)-spheres. An interesting phenomenon occurs when examining the relationship between cubes and spheres in various dimensions. In two and three dimensions, an inner sphere can fit entirely within a cube, but in four dimensions, the inner sphere just touches the cube's sides, and in five dimensions, it begins to extend outside. The volume of a unit \(d\)-sphere decreases to nearly zero as dimensions increase, despite initially increasing from one to five dimensions. This counterintuitive behavior is further illustrated by the concentration of measure, where in high dimensions, most of the surface area of a sphere is found near its equator. Additionally, the concept of kissing numbers, which refers to the maximum number of non-overlapping circles or spheres that can touch another, varies significantly across dimensions, with known values only for certain dimensions. The complexity and peculiarities of high-dimensional geometry challenge our understanding and highlight the limitations of our spatial intuition.

- High dimensional spheres exhibit counterintuitive properties compared to their lower-dimensional counterparts.

- The volume of a unit \(d\)-sphere approaches zero as dimensions increase.

- Most surface area of high-dimensional spheres is concentrated near the equator.

- Kissing numbers vary widely across dimensions, with exact values known only for a few.

- The study of high-dimensional geometry reveals significant challenges to our spatial intuition.

AI: What people are saying
The discussion around high-dimensional geometry reveals various insights and misconceptions.
  • Comments highlight the concept of near-orthogonality in high dimensions, which is crucial for machine learning applications.
  • There is debate over the interpretation of volume comparisons between different dimensions, with some arguing that proportions are more meaningful than absolute volumes.
  • Many find the properties of high-dimensional spaces counterintuitive, particularly regarding the volume of spheres and their distribution of surface area.
  • Some commenters express confusion about the definition of "dimension" and its implications for understanding higher-dimensional spaces.
  • References to the implications of high-dimensional geometry in fields like information theory and deep learning are noted.
Link Icon 16 comments
By @gcanyon - 7 months
One that isn't listed here, and which is critical to machine learning, is the idea of near-orthogonality. When you think of 2D or 3D space, you can only have 2 or 3 orthogonal directions, and allowing for near-orthogonality doesn't really gain you anything. But in higher dimensions, you can reasonably work with directions that are only somewhat orthogonal, and "somewhat" gets pretty silly large once you get to thousands of dimensions -- like 75 degrees is fine (I'm writing this from memory, don't quote me). And the number of orthogonal-enough dimensions you can have scales as maybe as much as 10^sqrt(dimension_count), meaning that yes, if your embeddings have 10,000 dimensions, you might be able to have literally 10^100 different orthogonal-enough dimensions. This is critical for turning embeddings + machine learning into LLMs.
By @crazygringo - 6 months
> The volume of the unit d-sphere goes to 0 as d grows! A high dimensional unit sphere encloses almost no volume!

This feels misleading to me.

Directly comparing volumes in different dimensions doesn't make any sense because the units are different. It doesn't make sense to say that a quantity in m^3 is larger or smaller than a quantity in m^4. Because it doesn't make any sense to compare the area of a circle with the volume of a sphere.

> More accurate pictorial representations of high dimensional cubes (left) and spheres (right).

The cube one is arguably accurate -- e.g. in 100 dimensions, if the distance from the center of a cube to the center of a face is 1, then the distance from the center of the cube to a corner is 10.

But the sphere one, I don't know. Every point on a 100-dimensional sphere is still the same distance away from its center. The sphere is staying spherical in an intuitive way, it's just that the corners of the enclosing cube have gotten so much further away.

So what is accurate to say is that the proportion of volume of a sphere relative to that of its bounding cube keeps decreasing. Which, rather than being supposedly "counterintuitive", makes perfect intuitive sense -- because every time you add a dimension, you can think of it as "extruding" the previous sphere into the new dimension and then shaving it round, the way a 2D circle can be extruded into a cylinder in 3D and then shaved down to make it into a sphere. Every time you add a dimension, you shave off more.

The article suggests that a 3D sphere has greater volume than a 2D circle -- with a unit radius, the sphere is 4/3π while the circle is just π. But again, they're in different units, so it's a meaningless statement. It makes much more sense to say that a 2D circle takes up (1/4)π≈0.79 of its bounding square, a 3D sphere takes of (1/6)π≈0.52 of its bounding cube, a 4D sphere takes up (π/32)π≈=0.31, and so forth. So no, the volume doesn't go up and then down -- it just goes down every time when taken as a unitless proportion (and proportions are comparable).

By @rectang - 7 months
Time to share my favorite quote from Symbols, Signals and Noise by John R. Pierce, where he discusses how Shannon achieved a breakthrough in Information Theory:

> This chapter has had another aspect. In it we have illustrated the use of a novel viewpoint and the application of a powerful field of mathematics in attacking a problem of communication theory. Equation 9.3 was arrived at by the by-no-means-obvious expedient of representing long electrical signals and the noises added to them by points in a multidimensional space. The square of the distance of a point from the origin was interpreted as the energy of the signal represented by a point.

> Thus a problem in communication theory was made to correspond to a problem in geometry, and the desired result was arrived at by geometrical arguments.

By @remcob - 7 months
The distance between two uniform random points on an n-sphere clusters around the equator. The article shows a histogram of the distribution in fig. 11. While it looks Gaussian, it is more closely related to the Beta distribution. I derived it in my notes, as (surprisingly) I could not find it easily in literature:

https://xn--2-umb.com/21/n-sphere

By @FabHK - 7 months
For high-dimensional spheres, most of the volume is in the "shell", ie near the boundary [0]. This sort of makes sense to me, but I don't know how to square that with the observation in the article that most of the surface area is near the equator. (In particular, by symmetry, it's near any equator; so, one would think, in their intersection. That is near the centre, though, not the shell.)

Anyway. Never buy a high-dimensional orange, it's mostly rind.

[0] https://www.math.wustl.edu/~feres/highdim

By @brazzy - 6 months
> The volume of the unit -sphere goes to 0 as grows! A high dimensional unit sphere encloses almost no volume! The volume increases from dimensions one to five, but begins decreasing rapidly toward 0 after dimension six.

What the absolute fuck?

That one caught me truly off guard. I don't think "counterintuitive" is a strong enough word.

By @bmitc - 7 months
Actually, the most counterintuitive is 4-dimensional space. It is rather mathematically unique, often exhibiting properties no other dimension does.
By @mattxxx - 7 months
Yea - high dimensional spaces are weird and hard to reason about... and we're working very frequently in them, especially when dealing with ML.
By @nyc111 - 6 months
I don't understand what is meant here by "dimension." Is the definition of "dimension" consistent for 3-dimensional figures and n-dimensional figures? In other words, what is the importance of the orthogonality of the axes? The axes of dimensions beyond 3-d are not orthogonal, does this change the definition of "dimension".

I cannot conceive a geometrical image of higher dimensions. Algebraically, yes, but not geometrically.

By @derbOac - 7 months
I love this stuff because it's so counterintuitive until you've worked through some of it. There was an article linked to on HN a while back about high-dimensional Gaussian distributions that was similar in message, and probably mathematically related at some level. It has so many implications for much of the work in deep learning and large data, among other things.
By @ljouhet - 6 months
Thank you! I love this article, but I couldn't find it.

I always search "Curse of dimensionality" instead of "Counterintuitive properties..."