The math that explains why bell curves are everywhere

Posted by ibobev 2 days ago

The math that explains why bell curves are everywhere(www.quantamagazine.org)

123 points | 67 commentspage 2

bluGill 9 hours ago|

100 year floods are not happening more often in most cases - it is just that the central limit therom teachs us the 10 year flood is almost as high water as the 100 or even 1000 year flood.

thaumasiotes 7 hours ago||

> it is just that the central limit therom teachs us the 10 year flood is almost as high water as the 100 or even 1000 year flood.

No, the central limit theorem specifically doesn't address that. It says that the sum of iid random variables is well approximated by a normal distribution near the mean; it doesn't tell you how well that approximation works in the tails. The rarer the event you're modeling is, the less relevant the normal approximation is.

gowld 9 hours ago||

Explain?

What are "most cases"?

DroneBetter 11 hours ago||

I hate Quanta a lot

a vast amount of fluff for less than a college statistics professor would (hopefully) be able to impart with a chalkboard in 10 minutes, when Quanta has the ability to prepare animated diagrams like 3Blue1Brown but chooses not to use it

they could go down myriad paths, like how it provides that random walks on square lattices are asymptotically isotropic, or give any other simple easy-to-understand applications (like getting an asymptotic on the expected # of rolls of an n-sided die before the first reoccurring face) or explain what a normal distribution is, but they only want to tell a story to convey a feeling

they are a blight upon this world for not using their opportunity to further public engagement in a meaningful way

andyjohnson0 7 minutes ago||

I probably don't have your mathematical sophistication - but I like and appreciate Quanta precisely because it helps people like me to understand a little bit about challenging things. This enriches my tiny life, and I hope it also makes the world a fractionally better place for us all.

Perhaps you're just not in their intended audience?

tptacek 11 hours ago|||

A lot of times on HN when a math topic comes up that isn't about 3b1b, someone will jump in to say "this isn't as good as 3b1b". Last time I saw that, I was moved to comment:

https://news.ycombinator.com/item?id=45800657

3b1b doesn't have the same goal as Quanta, or as introductory guides. It's actually not that great a teaching tool (it's truly great at what it is for, which is (a) appreciation and motivation, and (b) allowing people to signal how smart they are on message board threads by talking about how much people would get out of watching 3b1b).

This is prose writing about math. It's something you're meant to read for enjoyment. If you don't enjoy it, fine; I don't enjoy cowboy fiction. So I don't read it. I don't so much look for opportunities to yell at how much I hate "The Ballad of Easy Breezy".

bmenrigh 11 hours ago|||

I don’t fault Quanta (or 3b1b) for being the way they are. Each is serving their goal audience pretty well.

My compliant is only that there should be a dozen more just like them, each competing with each other for the best, most engaging math and science content. This would allow for more a broader audience skillevel to be reached.

As it stands, we’re lucky even to have Quanta and 3b1b.

I think there is hope though, quite a few new-ish creators on YouTube are following in Grant’s footsteps and producing very technically detailed and informative content at similar quality levels.

paulpauper 9 hours ago|||

there is no getting around that learning math requires actually having to buckle down and read and do math . A video will not suffice.

DroneBetter 8 hours ago|||

well for one who does buckle down and read and do math, the expected amount of new information brought to them by a 3B1B video as supplementary material upon a topic (with the normal distribution being one that admits a direct comparison from the article) is nonzero, by merit of it possibly having ideas to convey from outside their usual purview and formal background that may be applicable to the doing of math (as has been the case for me, someone who [does math](https://oeis.org/wiki/User:Natalia_L._Skirrow)), while for Quanta fluff pieces it's zero.

by the metric of "if this expository piece were to be taken to a time before its subject had been considered and presented to researchers, how useful would its outline be towards reproducing the theory in its totality," Quanta's writings (on both classical and research math) mostly score 0

tptacek 9 hours ago|||

Couldn't agree more, which is why I think it's odd to suggest that a pop-sci magazine article is somehow a disservice that 3b1b would correct.

KnuthIsGod 9 hours ago|||

3Blue1Brown

Seems a bit like Ted Talks. Lightweight popcorn for the simple minded.

throwaway81523 3 hours ago||

Quanta used to have tons of good stuff and not much crap. Now there's enough crap that if there's still good stuff, it gets lost in the noise.

nsnzjznzbx 7 hours ago||

So Abraham de Moivre was the worlds first quant?

jibal 7 hours ago||

https://en.wikipedia.org/wiki/Central_limit_theorem

> suppose that a large sample of observations is obtained, each observation being randomly produced in a way that does not depend on the values of the other observations, and the average (arithmetic mean) of the observed values is computed. If this procedure is performed many times, resulting in a collection of observed averages, the central limit theorem says that if the sample size is large enough, the probability distribution of these averages will closely approximate a normal distribution.

EGreg 8 hours ago||

WCSTombs 7 hours ago||

It's not a bad article, but I have to point something out:

> Laplace distilled this structure into a simple formula, the one that would later be known as the central limit theorem. No matter how irregular a random process is, even if it’s impossible to model, the average of many outcomes has the distribution that it describes. “It’s really powerful, because it means we don’t need to actually care what is the distribution of the things that got averaged,” Witten said. “All that matters is that the average itself is going to follow a normal distribution.”

This is not really true, because the central limit theorem requires a huge assumption: that the random process has finite variance. I believe that distributions that don't satisfy that assumption, which we can call heavy-tailed distributions, are much more common in the real world than this discussion suggests. Pointing out that infinities don't exist in the real world is also missing the point, since a distribution that just has a huge but finite variance will require a correspondingly huge number of samples to start behaving like a normal distribution.

Apart from the universality, the normal distribution has a pretty big advantage over others in practice, which is that it leads to mathematical models that are tractable in practice. To go into a slightly more detail, in mathematical modeling, often you define some mathematical model that approximates a real-world phenomenon, but which has some unknown parameters, and you want to determine those parameters in order to complete the model. To do that, you take measurements of the real phenomenon, and you find values for the parameters that best fit the measurements. Crucially, the measurements don't need to be exact, but the distribution of the measurement errors is important. If you assume the errors are independent and normally distributed, then you get a relatively nice optimization problem compared to most other things. This is, in my opinion, about as much responsible for the ubiquity of normal distributions in mathematical modeling as the universality from the central limit theorem.

However, as most people who solve such problems realize, sometimes we have to contend with these things called "outliers," which by another name are really samples from a heavy-tailed distribution. If you don't account for them somehow, then Bad Things(TM) are likely to happen. So either we try to detect and exclude them, or we replace the normal distribution with something that matches the real data a bit better.

Anyway, to connect this all back to the central limit theorem, it's probably fair to say measurement errors tend to be the combined result of many tiny unrelated effects, but the existence of outliers is pretty strong evidence that some of those effects are heavy-tailed and thus we can't rely on the central limit theorem giving us a normal distribution.

D-Machine 7 hours ago||

This is also right I believe, normal distributions are not ubiquitous really, just they are approximately ubiquitous (and only really if "ignoring rare outliers", and if you also close your eyes to all the things we don't actually understand at all).

The point on convergence rates re: the central limit theorem is also a major point otherwise clever people tend to miss, and which comes up in a lot of modeling contexts. Many things which make sense "in the limit" likely make no sense in real world practical contexts, because the divergence from the infinite limit in real-world sizes is often huge.

EDIT: Also from a modeling standpoint, say e.g. Bayesian, I often care about finding out something like the "range" of possible results for (1) a near-uniform prior, (2), a couple skewed distributions, with the tail in either direction (e.g. some beta distributions), and (3) a symmetric heavy-tailed distribution (e.g. Cauchy). If you have these, anything assuming normality is usually going to be "within" the range of these assumptions, and so is generally not anything I would care about.

Basically, in practical contexts, you care about tails, so assuming they don't meaningfully exist is a non-starter. Looking at non-robust stats of any kind today, without also checking some robust models or stats, just strikes me as crazy.

abetusk 5 hours ago||

The fact the article said that is a gross error. You've identified the issue head on.

The sum of independent identically distributed random variables, if they converge at all, converge to a Levy stable distribution (aka fat-tailed, heavy tailed, power law). In this sense, Levy stable distributions are more "normal" than the normal distribution. They also show up with regular frequency all over nature.

As you point out, infinite variance might be dismissed but, in practice, this just ends up getting larger and larger "outliers" as one keeps drawing from the distribution. Infinities are, in effect, a "verb" and so an infinite variance, in this context, just means the distributions spits out larger and larger numbers the more you sample from it.

throwaway81523 5 hours ago||

Now do power laws.

tsunamifury 8 hours ago|

Bell curves are everywhere because all distributions of any properties clump in some way at some level. The basics of any probability shows this. The result is you “seeing” bell curves everywhere. Aka clumps.

This is a tautology to the extreme.

abetusk 5 hours ago||

No, that's not true.

If sums of independent identically distributed random variables converge to a distribution, they converge to a Levy stable distribution [0]. Tails of the Levy stable distribution are power law, which makes them not Gaussian.

[0] https://en.wikipedia.org/wiki/Stable_distribution

D-Machine 7 hours ago|||

Yup. And in general more heavy-tailed bumps are in fact better models (assuming normality tends to lead to over-confidence). Really think the universality is strictly mathematical, and actually rare in nature.

jibal 7 hours ago||

First, every mathematical theorem is a tautology ... don't conflate "tautological" with "obvious".

Second, your "aka" is incorrect --- there is all sorts of clumping that is not a normal distribution.

thaumasiotes 6 hours ago||

As I'm sure tsunamifury would agree, it is incredibly common for people to label "bell curves" by eyeball, regardless of whether they are normal curves. To most people, "clumping" in a one-dimensional spectrum is all they mean by the phrase "bell curve".

D-Machine 5 hours ago|||

This was sort of my reading as well: I took "clumping" to mean "bump-shaped".

jibal 2 hours ago||||

This completely misses the point, which is that the central limit theorem says that it isn't just any old clumping, it's always the normal distribution. tsunamifury dismissed this strong finding as "tautology" because clumping is obvious ... but that it's always precisely a bell curve is far from obvious. Again,

> your "aka" is incorrect --- there is all sorts of clumping that is not a normal distribution.

That it's "incredibly common for people to label "bell curves" by eyeball, regardless of whether they are normal curves" is not just not relevant, it's anti-relevant ... the central limit theorem says that the distribution of the means is always a bell curve--a normal distribution--not merely a "bell curve".

Anyway, this is covered in far more detail in other comments and material elsewhere, so this is my last contribution.

tsunamifury 5 hours ago|||

[dead]