Slashdot Mirror


Gaussian Distribution being questioned

Robert Wilde writes "The Financial Times is reporting in two stories that a group of scientists have discovered that any scale-independent system does not follow the traditional Gaussian Bell Curve but a new curve. " Interesting implications-for above systems. For what I can gather from the article, for those systems in which this curve is more appropriate, rare events will occur more often then predicted by the Gaussian distribution. Anyone have more comments on this?

205 comments

  1. Re:How does this relate to standard deviations? by melancholy_dane · · Score: 1

    The curve is weird looking, but still readily quantifiable. Besides the mean, and standard deviation, you can use skew (un-centeredness) and kurtosis (bulging in the middle) to describe how different a given curve is from a bell-curve.

    More interesting though, is that fact that the curves shown in the ft.com article weren't properly normalized; comparing these graphs visualy doesn't begin to show what the differences are, and the axis on the graphs didn't make to much sense. It x = "rarity" then what does y correspond to. Typically you would show y and frequency and x as a value, and from this you would determine rarity.

    anyway, my two cents

    m.d.

    --
    -m.d.
  2. hm? by stoney · · Score: 1

    Easily. this is calles slashdot-effect.

  3. Re:Interesting not exceptional. I agree!!! by apio · · Score: 1

    The journalist completely missed the point. There are already lots of other curves that you can use to fit 'non-normal' data (e.g. Weibull, Gamma, etc) that could be more appropriate than a normal curve. The link with chaos theory and the new distribution is interesting though.

    --

    >
    'There is no intellectual exercise that is not ultimately useless' - Jorge Luis Borges
    >
  4. Dry by heroine · · Score: 1

    This is the driest article to appear on slashdot ever. As Hemos gets closer and closer to grad school the articles get drier and drier. How about doing something interesting, like teaching lab mice how to roll over.

  5. Re:Sceptic in Slashdotia by puppet10 · · Score: 2

    I admit the popular journalism of scientific is lacking (from the articles given here it is really impossible to form any informed opinion about the scientific validity/merit of the work reported on and no references to that work are given) I quickly checked the first name in the article (Donald Turcotte) and indeed he has been published in a number of peer reviewed journals including Science, about self organized critical behavior. What I don't understand is how you can judge whether the scientists involved in this research are doing good work or quackery based on one popular press article and not try to examine the facts before jumping to conclusions. I'm not saying what is described in the articles is right or groundbreaking, what I am saying is that these articles alone don't give nearly enough information to form a reasonable opinion (although I don't have enough time to go and do a full literature review to form an informed opinion either)

    Don't blame the scientists for poor reporting.

    --
    -------- This space intentionally left blank --------
  6. non-gaussian distributions by Anonymous Coward · · Score: 0

    The interesting thing here - and it has been long known but often ignored, is that there some measurable things in life . for which the average value, or the standard deviation cannot be calculated, because no matter how many values you average together, then along comes a bigger value to throw off your average. (Another word of saying the tails of the distribution are bigger than usual)

    Search for Pareto or Paretian distributions and you can find a few equations that deal with the same thing.

    But the article cited was kinda dumb, because in this case, they just found another equation to misuse.

  7. Re:Sceptic in Slashdotia by mjackson14609 · · Score: 1

    Turcotte has a 2-part paper on "Self-Affine Time Series" in a recent /Adv. Geophys./ that looks, by timing and title, as if it would provide the technical information those who are really interested would want.

    --
    I decided that behaving ethically was the most nihilistic thing I could do. - Paul Pavel
  8. Re:..:) by Axe · · Score: 1

    Do not suffer too much. That article is garbage. Read some books on related subjects instead. As an example Mandelbrot is pretty funny and many parts of his writing are accesible for a person without strong mathematics/statistics background..

    --
    <^>_<(ô ô)>_<^>
  9. Extremely Interesting, looking at God by Wah · · Score: 2

    Not to be too crazy, but if this holds up, and others find this curse, it is exceptional. The basic curves of life, and chaos. This is the stuff that explains why a seashell and a universe have the same design. Chaos theory and quantum mechanics both show a certain unpredictability to reality. Science like this shows there is some underlying pattern. At the very least this is extremely interesting, at least for all of us that want our own universe some day.

    --
    +&x
  10. Re:Gauss turns over in his grave by Axe · · Score: 1

    That's not an observation or a fact.

    You miss, sorry. This is only true for a finite variance variables. Observables in nature are not required to have finite variance - there are plenty of cases when they do not.

    You position is typical for those who just took some statistic classes, but never bothered to check the fine print and understand what it means (no insult, please) But be careful when you make strong statements in public. They sound funny.

    Check some references on "stable distributions" in statistics. For physical examples do a search on "Levy flights"

    Also Gaussian distr. is not "normal" in a sense of its frequent occurence. I would claim that scaling, or "power law" as physicists refer to it, distribution is much more common in natural phenomena.



    --
    <^>_<(ô ô)>_<^>
  11. What is the big deal? by illtyd · · Score: 1
    A couple of comments.

    Statisticians have said for ages that not all data follows the normal (a.k.a. gaussian) distribution. We even have names for the ways in which distributions differ from the normal. Skewness describes distributions where one tail is stretched out in one direction longer than the other like this, or this more extreme example.

    Kurtosis describes the "thickness" of the tails in comparison to the height of the centre of the distribution. (i.e. this has more kurtosis than this.

    So, with some distributions, the chance of rare events is greater than some others.

    Secondly, in the financial times (not my usual choice of statistical literature) articles there seems little link between the "universal curve" stuff and the distribution other than the normal.

    --
    ---- "First came stats, pulling habits out of rats ..." Steve Taylor - "Jung and the Restless"
    1. Re:What is the big deal? by illtyd · · Score: 1

      If you don't want to jump backwards and forwards, all the graphs are accessible from
      this link

      --
      ---- "First came stats, pulling habits out of rats ..." Steve Taylor - "Jung and the Restless"
  12. Ooh, i'm a believer (sung to music) by Wah · · Score: 2

    note: this is all dependant on whether this is actual or some disillusioned scientists. I tend to beleive it, mainly because these scientists would most likely not be the type to publish normally, but until I see it from another source I won't totally believe it, that being said, let me argue like I do.

    Let's say one night you watch the results of the lottery on TV, and the numbers '1-2-3-4-5-6' come up. Is that a rare occurence? No. That sequence is as likely to occur than your birthday and your girlfriend's birthday combined into esoteric equations.

    Example number 2: I'm with this girl one night. I say my astrological sign is Scorpio. "Really!" she exclaims, "I'm Scorpio too!" What are the probabilities of that happening? 1/144? No, just 1/12. At one point (and cryptos will be familiar with this) if you add people, it becomes a rare event that you do not find people with the same sign.


    Both of the examples you give here are actually rare occurences, not the number series themselves, but the fact that you recognize them as special series. You note their occurence as extremely rare (the water cooler talk if the lotto was 1-2-3-4-5-6!!) thus in fact making them rare.

    These guys were both looking at special curves, in fact random , that turned out to be the same. That is significant in the number of other patterns that can, or cannot, be explained. At the very least this will cause your insurance rates to go up :)

    We're 6 billion on this Earth. It's bound to happen to someone. Same thing with winning the grand prize lottery once or twice.

    That's what the story said, very rare occurences are more likely. Check out the Drake Equation if you think that couldn't be significant

    cold fusion
    this is different (so far) in that it was two totally seperate areas of study that found the same thing, not some freaks in the desert.

    Cool stuff regardless.
    Slashdotia
    pronounced Slash-dosh-ya? :)

    --
    +&x
    1. Re:Ooh, i'm a believer (sung to music) by Enoch+Root · · Score: 1
      cold fusion

      this is different (so far) in that it was two totally seperate areas of study that found the same thing, not some freaks in the desert.

      To me, the difference is that someone has made the claim that the Universe is radically different from what we know, based on a sample of data that was not peer-validated. If you remember the data on cold fusion, it made perfect sense if you adjusted the y-axis, and didn't lead to such mind-boggling conclusions.

      I'm willing to bet this is exactly the same. Inventive scientists deduce important rules based on experimental data. Rigorous scientists double-check their data before deducing important rules. What we need is inventive, rigorous scientists.

      Slashdotia

      pronounced Slash-dosh-ya? :)

      Sounds good to me. :) As we all know, Slashdotia is the Capital of Slashdom. :)

      "There is no surer way to ruin a good discussion than to contaminate it with the facts."

  13. mathematical nonsense by Trojan · · Score: 1

    This is all bullocks.

    The Gaussian distribution is 'the universal distribution' in the following sense:
    Consider a series of events that generate some value. For example, rolling of a dice, which generates a value from 1 to 6. Assume that these events are independent, meaning that, say, the 10th outcome will in no way influence, say, the 20th outcome. Now take the first N outcomes, add them together and divide by N. The larger you take N, the better the distribution of this average follows the Gaussian distribution. (And I should add that there are some mild conditions that have to be satisfied).

    Now what are they saying here? That the 'rareness' of species does not follow the Gaussian distribution? How do you quantify 'rareness'? How can this satisfy any kind of independence condition (where there's one rare animal, there are bound to be more).

    What's the weirdest of all, is the statement that rare species are more common that expected. What a joke! If something is more common than expected, then by definition it is not as rare as you thought!

    1. Re:mathematical nonsense by Don+Sample · · Score: 1

      As you mention the Guasian distribution only really applies to a fairly narrowly defined type of phenomena. One that you rarely encounter in the real world, but for a lot of cases it is close enough.

      For other things there are better curves to use. The examples they gave for cases where the Gausian didn't work sounded like the Poisson distribution would have been a more appropriate one to use, and surprise, surprise, the Poisson skews in favour of less likely events.

    2. Re:mathematical nonsense by Trojan · · Score: 1

      Not really an all that narrowly defined... but in any case a well defined class of phenomena...

      The 'outcomes' have to be independent. Say you take 'length of human being'. If you choose the humans in a purely random way, those lengths will be independently distributed, and (sum_of_N_lengths - N * average_length) / (sqrt(N)*st.dev.) will be approximately distributed according to the standard Gaussian distribution, for N large enough.

      If you take 'rareness' of an animal, and define 'rareness' in a way like Ben Tilly describes it, or in any other way I can imagine, there is no way that your outcomes are independent.

  14. Isn't This Chi-Square Distribution??? by Anonymous Coward · · Score: 0

    Like someone already said, show me a formula for the distribution. I have to guess that if it really has anything to do with Chaos Theory the actual distribution curve is less than half the story. Here is a picture of chi-square(actually it's flip-flopped, but you get the idea) http://www.ruf.rice.edu/~lane/hyperstat/A100557.ht ml

    1. Re:Isn't This Chi-Square Distribution??? by tim+bates · · Score: 1

      exactly: chi square or Poisson more liekly (the distribution of accidents (unlikely events). this is just more chaos drivel whipped along by ignorant journalism. cheers, tim

  15. Oh, another point on Chaos Theory... by Enoch+Root · · Score: 2
    Since when is Chaos Theory ridiculed by the scientific community because it's a little wild? And how in hell could the popularity of Jurassic Park ruin the work of Chaos Theoricists?

    They're trying to sweeten up the deal by placing the guys behind this as innovators who took on a controversial path. That's just downright silly. I took Chaos Theory grad. courses in college, and let me tell you it's so widely-used that it's like saying electricity is a controversial theory. Let me also tell you that what they're trying to say has absolutely nothing to do with Chaos Theory.

    I mean! I hope they never make a movie starring Jeff Goldblum about Newton's life, because we might end up refuting Classical Mechanics (even at non-relativistic speeds) tomorrow, wouldn't we? And those movies 'IQ' and 'Young Einstein' really ruined Relativity for me. Drat.

    "There is no surer way to ruin a good discussion than to contaminate it with the facts."

    1. Re:Oh, another point on Chaos Theory... by Anonymous Coward · · Score: 0

      aye. Chaos theory has been saying things like this for years and has lots of publications, expirements and data, to back it up. Well I guess the original theorist who came up with the big bang had the same...but years later a bunch of weenies got credit(eventually a nobel prize) for it without even referencing him. Acedemia and science are strange places.

  16. Nothing new about this by jerrytcow · · Score: 2

    This is exactly what one sees when plotting exponential distributions on a log scale. If you have a reaction where A -> B at some rate, then plotting, for example, event durations you would get this distibution as long as the x-axis is log. When working with any system where there is a delta G of reaction(s) the distribution is not gaussian and you can see this graphically.

  17. This is well known stuff by Anonymous Coward · · Score: 0

    Researchers have known for years that natural processes typically don't follow Gaussian distributions - in fact if a process is self-similar with certain parameters you can mathematically prove that it is non-Gaussian. But its nice to see this fact get some attention. There was some nice research on self-similar models of network traffic a few years back -- here's an experiment for you: Plot an hour's worth on pings to a distant site. It will be non-Gaussian, or I'll eat my hat.

    1. Re:This is well known stuff by Anonymous Coward · · Score: 0

      OF COURSE the distribution of ping times will not be Gaussian. There is no possible way it could be --- because the Gaussian distribution extends to infinity in both directions, and you simply cannot possibly get negative ping times.

    2. Re:This is well known stuff by Anonymous Coward · · Score: 0

      It's funny how slashdotters consider themselves so academically elite and yet when any article gets posted with some kind of mathematical content the ignorance of remarks is no less than you'd get with any sample of average people. Next thing you'll be telling me the samples of peoples heights could not follow a gaussian distribution because the heights of people do not extend to and from +/- infinity. In fact, precisely what distributions extend to and from infinity ? I guess it wasnt really of any practical use then, the gaussian distribution ?

  18. why scientists/engineers use the gaussian dist. by theaphila · · Score: 1

    is not because it's more correct, but because it makes the math nice. and it turns out, the things that we do with that nice math seem to have pretty accurate results. for instance, my data has sufficient outliers that an exponential distribution would probably model them better, but using a gaussion distribution means i can use easy math to get answers that agree with my data to experimental uncertainty, and i'm not going to spend any time making a more precise model that won't get me any improvement in my results (my data was all taken long ago, so the experimental uncertainty is fixed)

  19. Re:Having trouble understanding the graph... by Anonymous Coward · · Score: 0

    >{Yes, I know you can have asymmetric gaussian distributions.}

    Sorry, I was thinking three dimensions. It really wouldn't be gaussian if it were assymetric in two.

  20. Re:Interesting not exceptional by Compuser · · Score: 1

    As you said, for central limit theorem you need a lot of
    independent rv's. I wonder if self-similarity causes
    interdependent rv's, such that they can be shown to
    converge to a specific curve, or if their research is just:
    "look at our data - it looks about the same" type stuff.
    Does anyone know if there is a formal theoretical
    basis for this work?

  21. Gaussian has been out for a while.. by Richard+Dansereau · · Score: 1

    I wonder why there is the sudden interest in this. While I'll admit that many of my colleagues still haven't figured out that the Gaussian curve is not supreme, it has been known for many decades that most things don't follow straight Gaussian randomness (or white noise as many like to call it). Since I started looking at chaos and fractals many years ago, all of the research I've done and looked at ranging from particular motion, to weather patterns, to fluid dynamics, to DNA, to Internet traffic, to images and textures, to EKG signals, and the list goes on.. have all had very non-Gaussian but still random characteristics. Our descriptions for the randomness was through chaos and fractal theories.

    I'm glad to see that this is getting some press time, but, it does seem strange to me since much of this has been known since well before the 1970s as quoted in the articles.

    I suppose it is time to get the word out a little more and through off the limiting shackles of the Gaussian distribution and white noise
    (try brown and pink noise instead.. much more pleasing).

  22. Stats 101 by Anonymous Coward · · Score: 0

    This is interesting but I think that like many popular accounts it is misleading. From my undergraduate statistics class I remember the Central Limit Theorem which (loosely) states that the distribution of a large number of random samples with a from a population with a given mean and variance will be approximately Guassian, regardless of what the population distribution is. The key here is the word approximately, what the article seems to indicate is that for large samples drawn from chaotically driven processes (a chaotic process being deterministic but extremely sensitive to initial conditions) their curve is a better approximation of the expected distribution than a pure Gaussian. I wouldn't be surprised though if as the sample size went to infinty, their curve (really a family of curves like any distribution) did not become approximately Guassian too, albeit "slowly".

  23. Re:Having trouble understanding the graph... by Bacteriophage · · Score: 1

    Thanks, people. I was looking at the graph merely in one dimension, and didn't even consider that it represented an X-Y axis. Now I understand perfectly. :)
    "There are no shortcuts to any place worth going."

    --
    "Be regular and orderly in your life, so that you may be violent and original in your work." -Flaubert
  24. Chaos Theory by Anonymous Coward · · Score: 0

    I would just like to take this opportunity to say that chaos theory is completely misunderstood by almost everybody, especially journalists. Chaos theory is no use in predicting specific events, nor does it pretend to be. Chaos theory tells us that there can be order in random events, but that this order is not predictable.

  25. Rare Occurences. by Shokwave · · Score: 1

    I think that anyone who deals with large amounts of computer hardware (ie. enough to be a statistically sound sampling) could attest to the claim made.
    Certain failures and other occurences happen much more frequently than one expects from a straightforward analysis of uptimes and standard accepted failure rates.

    --


    I love you... Ok I love you AND the UNIX operating system, but then I've know it longer.
    1. Re:Rare Occurences. by Anonymous Coward · · Score: 0

      Man, do I know what you're talking about.
      Sometimes, we just get the random motherboard that just DOESN'T POST, or works fine, except for the IDE controller or sometime else that you don't find until you've installed windo[ze] and then wonder why the hell it keeps crashing. I usually blame it on windo[ze], but the problem goes away after i replace the hardware.
      It's the same with any other component. The first P-3 system i've built had a bad motherboard that caused it to run sluggishly and crash a lot... I don't know, that may just be how windo[ze] NT is supposed to run, but i think it should have some degree of useability...
      Oh well, this article has given me a lot to ponder over why i'm having so much trouble with that machine...

      --TwistedGreen
      (I didn't want A/C, but I couldn't log in!)

    2. Re:Rare Occurences. by tlhIngan · · Score: 1

      Well, equipment curves aren't gaussian. If you plotted Failure vs. Time, you'll find two sharp peaks - at the beginning (close to Time = 0), and near the end (Time = ??). In-between it's a flat line (that's not near zero). Ideally, manufacturers would burn in equipment until it's in the "flat" region, but in this "gotta-have-it-now" age, I think testing is going down the tubes...

      (The other end is because the device dies from old age. Of course, the MTBF numbers are just another statistic...)

      No numbers, but it's the approximate curve.

    3. Re:Rare Occurences. by Anonymous Coward · · Score: 0

      This, I think, is what is known as the bathtub curve - plot a graph of the curve described above... Steff (who can't remember his login details)

  26. I knew my teachers were wrong... by .pentai. · · Score: 1

    You know, this isn't really DIRECTLY on topic, however:

    In highschool a few years back I was a teachers-aide, and my teachers all talked about the standard curve, blah blah blah.

    Yet, never once did I ever see the distribution - but no, I had to be wrong, right? I mean, who am I to tell the teacher they're wrong (not that I was ever slow to disagree >:P )

    Anyways, looking at the graph, it seems a bit more realistic than a standard curve, because in reality, intelligence grows fast but falls faster :)

    (much like my grades...quick to raise, quicker to fall)

    Oh, and anyone notice how if you turn your head sideways this kind of looks like half a turnip...which this and the poll option, has Rob revealed a secret fetish?

    1. Re:I knew my teachers were wrong... by Mathieu · · Score: 1

      I don't know what your teachers said.
      Nevertheless concerning independent
      identically distributed random functions
      the law is demonstrated.

      Problem is that not all system behave
      like that: One striking example, the
      pricing model of option (call and put on
      stock) by black and scholes relies on the assumption of independence and gaussian law.
      More precisely the hypothesis is that buy and
      sells of a stock are independent. For the
      most part this is true. If you sell your
      stock to buy a new car this is independent.

      Nevertheless if one follows the black and
      scholes to it's ultimate consequencies one get the conclusion. The probability of Stock Krach is
      very low, one Krach per millenium and this is clearly false.








  27. Re:Sceptic in Slashdotia by Wah · · Score: 2

    also go here halfway down the page
    jump to "turcotte"

    --
    +&x
  28. What the curve looks like. by Derek+Pomery · · Score: 1

    http://www.ft.com/hippocampus/q14ae5a.htm
    Also, I don't understand how self-similarity would change the bell curve. You'd think every portion would still have the same probabilities, no?

    --
    -- perl -e'print pack"H*","6e656d6f406d38792e6f7267"' /. ate my old sig. Bastards.
    1. Re:What the curve looks like. by smale · · Score: 2

      from the second article:

      "The reason the systems behaved in the same fashion, they agreed, was that they shared a feature known as self-similarity. If an object is self-similar, it means it looks the same when viewed from far away or nearby. One example is the cauliflower: just as it is made up of individual florets, so each floret is made up of still smaller florets. If you were given a picture with no sense of scale, you could not tell if you were looking at a whole cauliflower or just one floret."

      I grepped the article for "fractal" and not once was it mentioned. Gee I'm pretty sure thats the term used for what the author describes, or is the target audience so simplistic that the proper terms have to be dumbed down?

      Fear the popular press's interpretation of mathematical research data, especially when they need to mention Jurassic Park in the body of the story.

    2. Re:What the curve looks like. by Anonymous Coward · · Score: 0
      I grepped the article for "fractal" and not once was it mentioned. Gee I'm pretty sure thats the term used for what the author describes, or is the target audience so simplistic that the proper terms have to be dumbed down?

      I have no idea what the scientists said to the reporters in this case, but FWIW:

      A fractal is an object where two different ways to compute its dimension (topological and Hausdorff) disagree. Many textbook examples of fractals are self-similar, in the sense that if you take two small regions of the object, parts of them will be related by a simple geometrical transformation. However, there's no intrinsic relation between self-similarity and fractal-ness. For instance, a line is self-similar but not fractal. And an example of a something which is not self-similar but obviously fractal is... um... well, now I see why textbooks keep using snowflake curves. Maybe go outside and look at an oak tree for an example...

  29. I knew it by ch-chuck · · Score: 2

    this is just another ruse for the insurance company to raise my flood insurance premiums again!

    Chuck
    Conspiracy theorist

    --
    try { do() || do_not(); } catch (JediException err) { yoda(err); }
    1. Re:I knew it by Anonymous Coward · · Score: 0

      Move where it don't flood. Problem solved.

  30. It is distribution-distribution plot by craw · · Score: 2
    The graph is rather confusing. This is my interpretation of it. Go out in the field and count the number of critters and categorize them by their species (id). Then normalize this count by some factor (perhaps total number of critters that were counted). For instance, I counter 1K monkeys, 500 cats, 500 dogs, 480 turnips, 200 rats, 50 snakes, 10 roaches, 5 hippos, 3 programmers, and 2 script kiddies. Now plot this distro.

    The monkeys were less rare and therefore plot to the right, while the programmers, and script kiddies are rare and plot to the left. The "mean" value is the dogs and cats; this plots more to the right.

    So what they are saying is that there are more species that have a smaller (rarer) number of critters that they could find. The "most common" value corresponds to the "average" number of critters per species.

    I guessing now, but if one did a similar survey of the world's population using nationality instead of species, one may get a similar type of distribution.

  31. Ooops! by Derek+Pomery · · Score: 1

    He had linked to both articles.
    sorry.

    --
    -- perl -e'print pack"H*","6e656d6f406d38792e6f7267"' /. ate my old sig. Bastards.
  32. Re:BBell Curve with a skew. by mighty+erudite · · Score: 1

    I think what they mean is that the breakthrough was the linkage between this particular skewed curve, and a whole slew of previously 'unpredictable' events, (e.g.)demagnetisation of magnets using heat, turbulence flows, etc. Also, the possible applications on other types of so-called 'self-similar' events, if their theory turns out to have some merit.

  33. As we all know... by mattdm · · Score: 3
    One-in-a-million chances happen nine times out of ten.

    --

    1. Re:As we all know... by BugMaster+ChuckyD · · Score: 1

      One-in-a-million chance always happen but only if the chances are exactly one in a million!

    2. Re:As we all know... by Otto · · Score: 1

      >One-in-a-million chances happen nine times out of ten.

      Yes, but only in a large thaumic field. For those who don't know, the thaum is defined as the amount of magic needed to produce, out of thin air, three medium sized billiard balls or one small dove.

      ---

      --
      - Give a man a fire and he's warm for a day, but set him on fire and he's warm for the rest of his life.
  34. Einstein would be ROTFLHAO... by Anonymous Coward · · Score: 0

    From what I see here, few folks 'get' what these guys 'got.' They seem to have empirical evidence that randomness is an illusion, as per Einstein's own fervent declarations. I predict that upon closer examination, classic 'random' models of *anything* will be found to be seriously flawed. In Real Life, randomness does not exist. At All. Seemingly random events are the convergence of a literally infinite number of causations at one 'point' ( again, a sucky term). You don't see those cauations because they happen at all sorts of levels, from the quantum on up to that car coming at you. It's Mach's principle on acid. Make sense?

    1. Re:Einstein would be ROTFLHAO... by Anonymous Coward · · Score: 0

      Randomness is an illusion? Of course you can say something isn't truly random if it follows a distribution other than the uniform distribution. But just because it follows a distribution doesn't mean jack about perfectly identifying the next value of the random variable. The distribution only gives you hints about likelyhood of the random variable being some value. You are obviously one of the many folks that don't 'get' what these guys 'got.'

  35. Having trouble understanding the graph... by Bacteriophage · · Score: 1

    In the 1st article, there is a graph about midway that appears to illustrate the notion that, with the new curve, you are more likely to find the rarest creature than the least-rarest creature. I must not be interpreting right, and I tried reading that part a few more times.

    Also, unrelated to the above question, how come it took scientists so long to analyze the obviousness of the microcosm in such detail within the field of statistics? Shouldn't this have been obvious? Why do you think it wasn't? I have no clue.

    "There are no shortcuts to any place worth going."

    --
    "Be regular and orderly in your life, so that you may be violent and original in your work." -Flaubert
    1. Re:Having trouble understanding the graph... by Royster · · Score: 1

      If the curve represents a count of species, the most common species would be represented by a single point near the right hand side of the curve. The work is just an observation that, in certain apparently unrelated fields, a new probability distribution is operating.

      In my own field, the distribution of stock market returns is often taken to be distributed log-normal. Unfortunately, extreme downturns in the market that have been observed should be so rare that they should never be observed with the frequency that they are. A new distribution that gives increased weight to rare events would be very useful.

      You ask, "Shouldn't this have been obvious?" No, not really. New distributions are not often found. One can mathematically derive any number of distributions, but they have little use unless you can find physical processes that exemplify them. With the development of chaos theory and fractal theory (the self-similarity referred to in the article) new physical processes have been defined. These have only been recognized in the last 25 years or so.

      --
      I have discovered a truly marvelous sig, unfortunately the sig limit is too small to contain i
    2. Re:Having trouble understanding the graph... by Anonymous Coward · · Score: 0
      Okay, the way I understand this is (probably an overly simplistic and/or inappropriate example. but...):

      You have a bag of coloured beads (yes, that one), say there's two yellow, fourteen black and eighty-four red, (100 in total) you pick one out at random, note the colour, and put it back. You repeat this for however long.

      Logically you'd expect an average of 2 in every 100 picked to be yellows, 14 in 100 to be black and 84 in 100 to be red (for an infinite sample), this fits the Gaussian distribution (as I understand this, correct me if I'm wrong). What I think they're saying is that they're getting values more like 3 in 100 yellow, 25 in 100 black and 72 in 100 red (also for an infinite sample, an eternal walk in the woods).
      I'm aware that what they're saying probably doesn't apply to such a simple system, but if you take the example of rare animals, when wondering through this habitat they recorded more sightings of the rare animals than they would expect to see given the number in the habitat, and recorded approximately the expected number of sightings of the more common animals.

      Now if it was just this you might be able to put it down to psychology and the researchers spotting the rare ones more often just because they are rare, and therefore more interesting and eyecatching. They've found it with turbulence though, which I imagine can give you a significant number of samples and produce meaningful results.

      Basically what they seem to be saying is that the logical probability of something happening is not necessarily the actual probability, and the more chaotic a system is, the closer it will adhere to the graph they gave.

      The only problem I can see with this is what happens when you start plotting the probability of a given system producing this graph :-)

      Could I italicise any more :-)

      Hudson (still waiting for my account)

    3. Re:Having trouble understanding the graph... by jafac · · Score: 1

      I wonder about that too, but there's a difference between a statistical graph, and a graph of a mathematical function, and there's a disconnect between the two - it doesn't give you a model for the underlying mechanism that causes the behavior, so you may have a more accurate way to predict what the statistic is going to be, you still can't use this to predict the moment the earth starts shaking.

      "The number of suckers born each minute doubles every 18 months."

      --

      These are my friends, See how they glisten. See this one shine, how he smiles in the light.
    4. Re:Having trouble understanding the graph... by youngsd · · Score: 1

      Is it just me, or does the graph they showed look like a standard, everyday bell curve plotted on some kind of logarithmic scale? If so, then I wonder whether this is that big a deal -- instead of plotting along the X axis, you plot along the X^10 axis (or something similar) and you have the bell curve. Am I missing something?

      -Steve

      --
      Democracy is a poor substitute for liberty.
    5. Re:Having trouble understanding the graph... by Rares+Marian · · Score: 1

      It would be if you were measuring a system that interferes with itself.

      --
      The message on the other side of this sig is false.
    6. Re:Having trouble understanding the graph... by Anonymous Coward · · Score: 0

      I went back and had another look. Does seem that way, except you would have difficulty explaining a negative number on the x-axis. So this cannot really just be a log-scale guassian, as such a curve would leave points in the fourth quadrant. Just the same it would be neat to see a fit of such a curve to these data points.

    7. Re:Having trouble understanding the graph... by youngsd · · Score: 1

      You're right. I always have been too quick to simply label things logarithmic versions of other things. Got to get over that. :-) It does have that smooshed-out logarithmic look though. Oh well.

      -Steve

      --
      Democracy is a poor substitute for liberty.
    8. Re:Having trouble understanding the graph... by trcull · · Score: 1

      I thought the graph worked like this:

      There are very few rare species and very few extremely common species and mostly sort of common species. I don't think the graph counts the number of *animals* IN a species, I think it counts the number of *species* IN an ecosystem.

      For example, a given ecosystem might have only 2 rare species, say a tree frog and a spotted owl, and only 2 common species, say a brown mouse and a blue jay, but a whole truckload of species that are neither rare nor common.

      What they're saying is that there are more different rare species than we think.

      IMHO

      --
      Programming is not a religion A hobby,job,profession,craft,art. But not a religion
    9. Re:Having trouble understanding the graph... by Squid · · Score: 2

      The graph makes more sense if you relabel its axes: x=number of individuals of a species, y=number of species with exactly that number of individuals.

      In other words: we aren't talking about the likelihood that you will encounter an individual of the species, we're talking about counting the species itself. A few really common species, a good spread of "average" species, and a few species represented by few individuals.

      'Course I could just be full of it. Wouldn't be the first time...

    10. Re:Having trouble understanding the graph... by Anonymous Coward · · Score: 2

      >In the 1st article, there is a graph about midway that appears to illustrate the notion that, with the new curve, you are more likely to find the rarest creature than the least-rarest creature. I must not be interpreting right, and I tried reading that part a few more times.


      It took me a minute, too - I'll try to distill my understanding into english. Assume that the rarity of a species is related to the number of times it is found (duh). The x-axis can be thought of as the number of findings of a given species. The y-axis can be thought of as the number of species that were found X number of times. Using the gaussian distribution, you would expect a symmetric tail-off in both the more-rare and the less-rare directions from the peak value. {Yes, I know you can have asymmetric gaussian distributions.} What this new curve is showing is that the tail-off is much less in the more-rare directions. In other words, assume the peak of the curve is at 100 sightings of a specie, with a standard deviation of 10 sightings. You would expect to some number of species to have 130 sightings (3-sigma). Under the gaussian distribution, you would expect to see the same number of species that only have 70 sightings. This new distribution says that the number of species with only 70 sightings would be much higher than the number of species with 130 sightings.

      Fascinating - I will certainly have to explore this further.

  36. Plagiarism by jafac · · Score: 2

    Isn't this just Murphy's Law?

    "The number of suckers born each minute doubles every 18 months."

    --

    These are my friends, See how they glisten. See this one shine, how he smiles in the light.
  37. Log Normal Linux by craw · · Score: 2
    How can one get a long-tailed statistical distribution as oppose to a symmetrical Gaussian distribution? There is one simple model that will generate this.

    Suppose that ppl's programming skills are statistically Gaussian distributed. These ppl then decide to produce a "new" OS called linux. The contribution of these ppl are then plotted up. One would find that the majority of ppl produced a lot of "minor" improvements, smaller programs, scripts, and responses on mailing lists. There would be a smaller group of ppl that contributed a lot of important stuff.

    This is the lognormal statistical distribution, IIRC. A bunch of ppl are capable of writing good code in support of this new OS. Unfortunately, only a smaller subset of these ppl have the time to work on the project for a long period of time. Then only a smaller subset of these ppl have the inclination to volunteer their services for this long period of time. Additionally, only a smaller portion of these ppl have the overall skills to do this. The result is that their are only a few ppl that have all of these attributes.

    Sorry for this simplistic explanation (it is late and should really be sleeping now). A log normal is really a summation of normal distributions in log space (multiplication in regular space). Another way to view this is to ask yourself a bunch of statistical what if questions (the questions should really generate a set of answers that are Gaussian distributed). When you answer no then you are out of the game. More ppl are eliminated early.

    1. Re:Log Normal Linux by PurpleBob · · Score: 1

      Oh great, you just HAD to bring Linux into this... now there's going to be a new kind of distribution wars.

      "I use Gaussian because it's mathematically pure!"
      "Fat-tails is a distribution for the REAL WORLD!"
      "Yeah, but fat-tails just copied Gaussian and added a little bit!"
      etc.
      --

      --
      Win dain a lotica, en vai tu ri silota
  38. In other words by rde · · Score: 3

    The new curve is broader and more gently sloping, suggesting that the rarest events occur more often than predicted by the bell-shaped curve.
    Or, as wizzards have known for years, million to one chances happen nine times out of ten.

    But seriously, folks. This reminds me a lot in terms of its applicability to pretty much everything of an article in New Scientist that I also found darn interesting.

    1. Re:In other words by Anonymous Coward · · Score: 0

      Its the same as New Scientist article and the same as in the book "How Nature Works" by Per Bak. Gaussian is only useful for those rare large linear systems, homework problems, and academic publications.

      The self similar power laws (event probability proportional to 1/event size raised to a power) they are talking about is good for the most common systems that are large, coupled, and non-linear like the Internet.

    2. Re:In other words by SpinyNorman · · Score: 1

      If you're interested in this stuff then you should check out the work being done on complexity at the Santa Fe Institute, and also James Gleick's classic book on Chaos, and M. Mitchell Waldrop's on Complexity (I forget the full titles).

  39. Old Hat by bap · · Score: 1

    The Bayesian community has known about this for many years. It is a log Gaussian, which is the prior commonly used for SCALE PARAMETERS in Bayesian estimation. It is interesting that it applies to other scale parameters, but it's what you'd expect, not some big breakthrough.

    1. Re:Old Hat by Brett+Viren · · Score: 1

      I thought it was a Jeffries prior one should use as a LIP for a scale parameter.

  40. Re:Baloney by richnut · · Score: 1

    Spoken like a true college student. Possibly even a grad student(?)

    Just because something does not fit the current model does not mean it's wrong (Well it does when you're in college).

    I bet you get A's dont you? Fudge your lab data alot?

    This might turn out to be on par with cold fusion or it might be significant. Lets wait for the additional research and find out.

    -Rich

  41. Another example by Anonymous Coward · · Score: 0

    The analysis of test data for composite materials in order to determine a single value for yield strength to use in design purposes uses the Weibull distribution. Metallic materials use Gaussian. If you are interested, track down copies of the MIL-5-HDBK (metallics), and MIL-17-HDBK (composites) for aerospace applications.

  42. This is because of a crutch by color+of+static · · Score: 1

    If I remember back to my Random Processes class not much really is Gaussian. There are two reasons that that assumption is often made. The first is that we have so many tools that assume it, and work OK if we are near it. The second is the abuse of the central limit theorem which says (correct me here if I'm not percise) the sum of a large set of random variables tends toward a gaussian dist as the number of variables approaches infinite. The problem is that people tend to short the infinite part and exagerate the tends to part.

    What we really need to do is stop teaching statistics classes that depend on a gaussian distribution. Down with standard deviations:-).

    1. Re:This is because of a crutch by fart_face · · Score: 1

      You are not necessarily required to use a true Gaussian curve in statistical analysis, if my memory serves me well.
      As I recall, we were encouraged to map our distributions manually, to discover the shape of our curves, which resemble the Gaussian curve, but are skewed in one direction or another.
      The curve in the article looks much more like some of the curves we'd come up with, but the excercise was to demonstrate that the larger your sample, the more your curve began to look like the classic Gaussian curve.
      Remember, non-math people, (like me), that all a statistic can show is that something happened that cannot be attributed to random chance alone. That's it. Naturally, the closer your sample is to reality, the more you can be sure that you have results that are statistically significant, and the more discernable this will be when compared with the Gaussian curve, which is intended to be a close approximation of what you get with a distribution of truly random events. The goal of using statistics is to attempt to prove a correlation between conjecture and reality. Statistics is the only way we have of doing this.
      Everyone should get a copy of "How to Lie with Statistics", because it explains much better than I can what exactly statistics actually "prove".

  43. Re:Another thought: When one side is near saturati by incubus · · Score: 1

    The infinitely defined portion of the gaussian curve doesn't add anything... because as x -> inifity, gaussian(x) -> 0 much faster... off the top of my head, I can't remember the expression for the gaussian though.

    Not every pseudo-random event that you plot will produce a gaussian curve... tracking the rolls of a die will be just a flat linear curve, while tracking the sum of two dice rolled together will produce a guassian bell curve...

    There's also more fun you can have with a 'Lorentz' distribution.... as well as however many other distributions there are out there.

    If I remember correctly though, a poisson distribution is just a discrete gaussian distribution. Basically for n infinity.

    This article is nothing in and of itself.

  44. link to the scientific paper(s)? by jab · · Score: 1

    Could someone please provide a link to the actual scientific paper(s)?

    1. Re:link to the scientific paper(s)? by Axe · · Score: 1
      Articles:

      Trails of the Wandering Albatross

      Applying the mathematics of haphazard motion

      The flight patterns of the wandering albatross appear to fit a special type of random motion called a Levy flight, which may also prove useful for describing aspects of heartbeat rhythms and other biological phenomena.



      References:

      Klafter, J., M.F. Shlesinger, and G. Zumhofen. 1996. Beyond Brownian motion. Physics Today (February):33.



      Viswanathan, G.M., et al. 1996. Levy flight search patterns of wandering albatrosses. Nature 381(May 30):413.



      Further Readings:

      Gillespie, D.T. 1996. The mathematics of Brownian motion and Johnson noise. American Journal of Physics 64(March):225.



      Hayes, B. 1994. Nature's algorithms. American Scientist 82(May/June):206.




      Lavenda, B.H. 1985. Brownian motion. Scientific American (February):70.




      Mandebrot, B.B. 1983. The Fractal Geometry of Nature. New York: W.H. Freeman.




      Peng, C.-K., et al. 1993. Long-range anticorrelations and non-Gaussian behavior of the heartbeat. Physical Review Letters 70(Mar. 1): 1343.




      Shlesinger, M.F. 1989. Levy flights: variations on a theme. Physica D 38:304.




      Slade, G. 1996. Random walks. American Scientist 84(March/April):146.




      Worsley, K.J. 1996. The geometry of random images. Chance 9(No. 1):27.



      --
      <^>_<(ô ô)>_<^>
    2. Re:link to the scientific paper(s)? by sgs · · Score: 1

      The "magic word" that you need for the search engines is "infinite variance".

      A quick look in AltaVista gives lots of hits, including a 1994 textbook.

      This ain't fresh news.

  45. stats all over again! by Rev.Servok · · Score: 1

    I wonder how this will affect the whole field of data analysis? If this curve proves to pretty common then wouldn't it affect the assupmtion (in at least the social sciences) that your distirbution is normal (ie as in z & t-tests and ANOVA tests?). I guess if you can't assume normalcy then you will need find other analysis techniques.

    --
    -------------------------------------------- It looks just like a Telefunken U-47! -Frank Zappa
  46. not really too surprising by melancholy_dane · · Score: 1

    This really isn't the biggest discovery ever. In fact what they've accomplished is to rediscover the base assumptions of the bell curve. The normal curve (bell curve) is a product of stochastic interactions between atomistic events; it pretty much only reflects behavior in systems where new actions are not affected by the history of the system. If you have a saturated system (like the ground being unable to absorb more water in the case of the floding example) you've got a messed up curve. Any decent book on statistics will give you the basics about this.

    --
    -m.d.
  47. Re:Statistical Mechanics by Anonymous Coward · · Score: 0

    That's a very good question. Am I the only person who noticed that this curve - at least what we can see of it - looks very much like the black body radiation curve? If any one knows of a way to get a more complete graph, I would be very interested in seeing it.

  48. fsck scheduling by iainh · · Score: 1

    Least likely events happen when you least need them.

  49. Gaussian curve is mathematics and therefore right by Anonymous Coward · · Score: 0

    You can't prove established, proved mathematics wrong, because it doesn't claim anything about reality. It only states that, given some assumptions, Gaussian distribution emerges. If the assumptions does not fit to a certain application, then Gaussian curve does not necessarily emerge. Assumptions behind the Gaussian curve are very general but certainly not universal. For sure there are phenomena that do not follow Gaussian curve. (Roughly speaking, if you have lots of independent rangom variables, their sum follows the Gaussian distribution. That's why it emerges in so many places: most things we measure have effects of countless independent error sources included in an additive manner.) When you sum up those random variables, it's usually the tails which converge most slowly towards the Gaussian distribution. It's therefore easy to find examples where the tails behave in a non-Gaussian way. (No, I didn't read the article.)

  50. Chaos again by Anonymous Coward · · Score: 0

    Geee! Scientist have discovered chaos again (and again and again).


    Genesis, Wind and Wuthering,

    MartinS

  51. Very badly described - read this for clarity. by Anonymous Coward · · Score: 0

    In the figure given, the x axis (horiz) [if it is indeed a bell curve] is eg 'height of person' ie it is the value you are measuring , or 'percentage of slashdot readers in a given area who are mathematically incompetent' (which is actually quite high imo - computer literacy != mathematical competence as i have frequently discovered). The y axis is the value of this measurement found. So the least frequently found measurements are at either end of the x-axis (ie huge people or tiny people) where the y-value is going to be low (eg noone less than 2 inches high or more than 50ft high). Hope this clears up facts a bit.

  52. This is just Poisson curve - 200 years old. by Anonymous Coward · · Score: 0

    Pretty stupid claim that the distribution is wrong. Gaussian is only an approximation, usually for infinite statistics. When you have a finite one you should use Poisson distribution and this is how they curve look like. I am sure in the research they will find binomial and exponential distribution next.

  53. Conspiracy theories... by Anonymous Coward · · Score: 0
    As a number of readers has pointed out, this isn't a world-shaking issue.

    The interesting issue here is the reporting of the science. It makes a good story if you can paint a picture of a brilliant free-thinking scientist being oppressed by a conservative scientific majority. And so even the most trivial new theory often gets reported in this light. Think about it - more than half the scientific articles you read in the non-scientific press take exactly this form.

    And in all the cases in fields I understand, either the theory is rasonable but the story has totally misrepresented it, or the proponent is a crackpot after publicity.

  54. Is this what I think it is? by edheil · · Score: 1
    It sounds like what they're talking about is the Self-Organized Criticality model. (Do a Google search on Self-Organized Criticality -- you'll get tons of references.)

    The difference between this and the gaussian model is that with the gaussian you are merely dealing with the summed behavior of a large number of independent variables. With the SOC model you are dealing with a particular pattern -- the frequency of changes as a function of their magnitude is described by a power law. It's not just a bunch of stuff happening randomly, it's a particular state in which the rarity of a change is correlated in a precise way to its magnitude.
    If I'm correct and that *is* what they're talking about, this isn't all THAT new. I have a neuroscientist friend who's been working on applying the SOC model to brain function with some success for a couple years now.

    But the article is vague enough that it's not totally clear that's what they're talking about.

  55. This is new? by Anonymous Coward · · Score: 0
    Did I just fall asleep for the last 20 yrs? Stuff like self-similarity, fractals, 1/f^a and chaos are not exactly new nor are their applications. This article makes it seem like this hot new sh*t; it's not.

    BTW, the "new" curve that they show looks like a Rayleigh or a lognormal distribution.

  56. This says *absolutely* nothing by hawk · · Score: 2

    The authors have found things that were mismodeled as gaussian and instead follow another distribution. So what? There are plenty of distributions besides the normal that are assymettric and have fatter tails.

    It *may* be that they've found another distribution that appears in multiple fields, but there's not enough here to judge this as a statistician. If it has any parameters beyond mean and variance, I'm not likely to be impressed--I can probably produce a three parameter beta distribution that's close.

    hawk,wearing his Ph.D. statistician hat for the moment

    1. Re:This says *absolutely* nothing by Anonymous Coward · · Score: 0

      This guy got it right. You can stop reading the thread now.

  57. fractal, anyone? by chocolatetrumpet · · Score: 1

    Isn't this whole thing just a fractal?

    --
    Spoon not. Fork, or fork not. There is no spoon.
    1. Re:fractal, anyone? by smale · · Score: 1

      amen brother

      http://www.indep.k12.mo.us/THS/student/aforreste r/mandelbig.html

  58. Self-Similar is key by astrophysics · · Score: 1

    The key that some people seem to be ignoring is that this is only for self-similar phenomena. This gives rise to the asymmetry and emphasis for rare events as compared to the gaussian distribution. The assertion is not that a Gaussian is incorrect, but that it does not accurately model certain self-similar phenomena.

  59. Distribution by jovlinger · · Score: 1

    Ok, I'm getting "arbetsskada". I read that and thought to myself, "Sweet name for a new distribution of linux."

    Slap.

  60. News for non-mathematicians, stuff that ... by Anonymous Coward · · Score: 0

    Hey Jeff, you have misunderstood the article. The article that you read was not very interesting, it was _vague_ reading made for people who do not understand maths. I strongly disagree with the topic: "Gaussian Distribution questioned"... The article was clearly pointless and the writer clearly didn't understand the subject. Gauss distribution is what it is and it's rigorous construction cannot be undone with any new curve that predicts some events better than gaussian does.

  61. Relevent Sci. American article: COBE not Gaussian? by MobiusKlein · · Score: 1

    The September Sci American has an interesting blurb about possible non-gaussian distribution of the Cosmic Background radiation.

    http://www.sciam.com/1999/0999issue/0999scicit5. html

    Unfortunatly, does not reference the papers that it is based on. Sigh.

    MobiusKlein.

  62. crappy graph by Anonymous Coward · · Score: 0

    Not really your fault. It was labelled poorly (as others have explained) and it was misleading: the two curves that were being compared contained two different areas.

  63. Re:Baloney by Anonymous Coward · · Score: 0

    Given an infinite number of samples, the data will perfectly fit some distribution (maybe not any distribution we commonoly use or even know about). So unless you are talking about infinite samples, there's no way in HELL that you're going to have your data ever perfectly fit a probabilistic distribution.

    All too often... well I do it too... researchers use the Gaussian distribution because it's very easy to use (only two moments). So the Gaussian distribution isn't wrong but it just isn't correct selection of distributions for that particular experiment.

  64. me2 by Anonymous Coward · · Score: 0

    I saw the New Scientist article, too; it claims that Benford's Law describes the (only) scale independent distribution. Another description can be found in this Mathland column.

  65. Use the Web, Luke. by Anonymous Coward · · Score: 0

    You know, Web search engines are useful for this kind of thing. The Donald L. Turcotte homepage seems wildly out of date. But he also wrote a textbook explaining all this.

  66. Heart Data by mccaffer · · Score: 1
    I am currently studying ecg data from hearts. We have showed that a healthy heart follows just such a distribution as that mentioned by the article in the financial times.

    The interpretation of this is that the heart is in a self-similar state, that is all lengths of time between heart beats occur, at all scales - the distribution of which is a power law. The heart is in a similar state to a condensed matter phase transition, that is its control mechanism keeps the heart in a critically balenced state, ready to change period rapidly.

  67. Found Some of Scholars Mentioned by Anonymous Coward · · Score: 0
    I think I found some of the scholars mentioned in the article:

    Donald Turcotte

    Dr. Steve Bramwell

    John Harte

  68. Re:"universal" curves - a "universal" example by bogado · · Score: 1

    I guess that the diference between your example is that the equations that are used to explaing the experiments that you listed are rather basic ones. As I understood the curve explained by the article is very complicated and would make sence that it would explain well just one specific experiment. But it did explained a lot more then expected, actualy it turn out the it could be a basic curve as well.

    --
    "take the red pill and you stay in wonderland and I'll show you how deep the rabitt hole goes"

    --
    []'s Victor Bogado da Silva Lins

    ^[:wq

  69. Not quite the same thing? by Anonymous Coward · · Score: 0

    Equipment failures in general are generally assumed to follow the Weibull distribution, which has already been discovered (by Weibull)

    jsm

  70. not lognorm by Anonymous Coward · · Score: 0

    Indeed I suspect that they just have some variation on a lognormal curve Prob not lognorm as has negative numbers on it, but I would guess that it can be put together from beta and gamma functions. More generally, I wonder what the properties of this density function are? Social sciences (or at least, economics fer sure) don't really assume that things are distributed Gaussian -- they just recognise the fact that if you use some sort of screwy distribution that doesn't add, subtract and scale like the Gaussian, you're going to be an old man before you get a working model. On the other hand, if the new curve does scale & add, that would be a discovery .... jsm

  71. Why the Gaussian isn't dead by Anonymous Coward · · Score: 0

    Given an infinite number of samples, the data will perfectly fit some distribution (maybe not any distribution we commonoly use or even know about).

    Well, actually, given an infinite number of samples, the data will (must) fit a Gaussian distribution. All distributions converge in the limit to the normal. It's called the "central limit theorem", and I don't think that it's going anywhere soon, no matter what the "self-similarity" bunch say.

    jsm

  72. Other interesting facts in the article... by Anonymous Coward · · Score: 0

    1. The earth is flat 2. Sun revolves around the earth 3. Moon landing was faked 4. Elvis is not really dead 5. Pink teletubby is really gay

  73. Re:Baloney by mizerai · · Score: 1

    You're right, the journalists missed the point. The part that's "wrong" is the over-application of the gaussian distribusion as a model of everything .

    --

    --Mizerai

  74. References anyone? by Anonymous Coward · · Score: 0

    More crap journalism. No references, as far as I can see, to the original source of the alternative curve so that we can find out what it's really about. Anyone know of a better reference or even the original publication?

  75. I think the authors missed the real point. by Dastardly · · Score: 1

    This doesn't prove a Gaussian curve is "wrong". What it is saying is that the new data is evidence that a gaussian distribution is probably the wrong model to be using under the circumstances described. The theory is that self similar phenomena do not follow a gaussian distribution, but follow this new distribution. It seems to me that the deep mathematical analysis has not really been done, but the experimental evidence suggests the existence of this distribution. There is probably a lot of work ahead in coming up with a mathematical model for the new distibution. What would be real interesting would be if the mathematical model for the distribution reduces to gaussian under specific conditions, kind of like how special relativity reduces to classical mechanics at low speeds.

    The biggest implication of the model is int he insurance industry. If it is found that floods, fire, earthquakes, and hurricanes follow the new distribution. It may allow insurers to go back to insuring against earthquakes and hurricanes because tey can actually predict long term income and expenditures more accurately. Maybe they will actually do their job instead of claiming hardship whenever a disaster strikes somewhere.

    Insurance Exec: Oh wahhh!!! We can't pay a billion in claims, go to the government.

    Translation: We have taken in a net profit of 2 billion dollars over the last two years. But, the billion dollars for this disaster will affect our earnings numbers for the next quarter or two and my stock options will be worthless.

    1. Re:I think the authors missed the real point. by drstatgeek · · Score: 1
      Most people only know about the existence of one distribution: the normal distribution (aka bell curve, Gaussian distribution, many others). The fact is that statisticians routinely use quite a few different distributions, some (called "heavy-tailed") which are used to simulate "rare events."

      A few comments on the article:

      1. Nothing follows the Gaussian distributions (the "normal distribution" is really a family). This is something humans constructed to understand their world. By the same token, nothing follows this new distribution (yet another page in my handbook, I guess). They just do "well enough" (in addition, the normal distribution is the easiest of all to work with).
      2. Why is this a big deal? In fact, who was using the normal distribution to predict earthquakes and airplane crashes, anyway? There are different, more realistic models for this; a lot of studies are being done on "heavy tails" for rare event prediction, etc. etc. Again, just another distribution in the statistician's handbook.
      3. For that matter, the use of the Gaussian (and all the models that assume it) need to be re-examined in other fields. E.g. there is evidence that indicates that intelligence follows a bimodal distribution (many people tend to be near one of two "areas" of intelligence, instead of near one grand average like the Gaussian case).

      I guess this article shows the general public's lack of understanding of statistics, not that statistics is that easy to understand in the first place.

      --
      -drstatgeek (close enough, at least ...)
  76. Your Supprised?? by BadlandZ · · Score: 2
    Come on now people... Forgive me, but this is hardly shocking.

    I looked over the articals, and all I can say is "So What?" the Gaussian distribution is based on pure random-ness. Did you expect everything to be a completely random event?

    Neither artical seems to go into great detail about how the new curve was calculated, but it's simply a _FACT_ that applying the Gaussian distribution to most events is considered a "simplification" of the problem, assuming it's random. Take away some random-ness, and of course the Gaussian distribution won't fit.

    Intelligence (however mesured) will not be purely random, nor will floods, grade distributions, tornados, or anything

    What's missing from both of these pieces is an explination behind the way the new curve was built, and on what foundation. The Poisson (spelling is way off there) distribution is frequently used in place of Gaussian because it "fits better," but again, doesn't prove that the events have much to do with the math.

    This is a case of "curve fitting gone wild" here, and unless I can see someone spell out in scientific detail the relationship between the events and the distribution, I don't buy it. So, they have a new equasion, and a new curve, it doesn't mean that the events are related to the math directly. If you look for anything hard enouth, you will start to find it everywhere.

    I do award them credit for a new curve that better fits some models. If the equasion for thier curve is manageable. If it's a complex equasion, it's worthless, because the whole point is to make some equasion fit a distribution of events. If theirs fits, and it's easy to calculate, it's benificial. But it does not emply a direct coorilation between the functions and the variables in the distribution. How do I explain this in SlashDot terms... (/me get's frusturated).

    Ok, take Moore's Law, you all know that right? Processor power doubles every 18 months? Or, the more accurately I believe he stated something to the effect that the number of circuts would double every 18 months. Well, a loosely fit exponential function will almost match this trend (roughly). But then you have to "adjust" the month scale between 12 and 24 untill the curve fits well. Now, that's a "model" but does not prove scientificly that circuts and design engineers are behaving exactly as can be predicted. At some point in the future everyone has predicted Moores Law will fail. See... It's a model! Curve Fitting.... Doesn't PROVE anything about what's going on in developers minds, or much tangable other that the "estimation" that things will get more powerfull in the computing world.

    Now, take it a step further, say Moores Law fails right as people develop a new method of increacing computing preformance, like say 3D circuts, or something not yet concieved, and with less "countable circuts" you get more preformance. Suddently, new devices start to a few less circuts, and more power. Now the Moores Law curve goes down, slowly at first, leveling off, and maybe dropping just a tad, and it starts to look like a "bell shaped curve" only half drawn. You could go "Curve fitting crazy" and say "Hey, it's Gaussian, it's going to go down now, and within another 15 years, we will all be back to 8 bit processors!" That's just idiotic.

    In short, curve fitting is useful to predict many things, but it can not be assumed that the curve implyes natural phenomona. Any curve that fits data is useful. A curve that fits data does not directly imply complete coorelation of events, or diffinitive proff that God does or doesn't play dice (hope he does personally, has to have fun sometime!). And Furthermore:

    For those who continue to doubt that it could all be so simple, Prof Turcotte has a suitably direct response. "People say: 'You can't do it because it's too complicated a problem'," he says. "We say: 'Just look at the data'."

    So his data fit, so what? Any reasonable math wiz should be able to come up with a few dozen equasions that fit a line. Doesn't prove a thing.

    Forgive my typos, bad grammer, and spelling, I got pretty pissed at tabloid junk science, and I had to vent. Feel free to prove me wrong, I would like to see how you can prove the new equasion and chaos theory is the best "insight into the universe" we have... BTW, if you can prove it, you'll probably be up for a Nobel Prize too.

  77. Re:Baloney by ruff · · Score: 1

    So the Gaussian distribution isn't wrong but it just isn't correct selection of distributions for that particular experiment.

    My point exactly- the people doing that research probably never intended for the article to lean towards a "Scrap Gaussian! Look at us!" thesis, but that's what happens from time to time when you get journalists in the act of "reporting" science.

  78. Crutches and probable hype over the mundane. by Claudius · · Score: 1

    Statistics classes which teach Gaussian distributions are fine provided students learn what is required for the central limit theorem to apply. I think you may be mistaken regarding the "noninfinite" part. We can quantify the fluctuations from a Gaussian law in a rigorous manner in cases where a noninfinite number of variables are concerned. The real difficulty, on the other hand, is that people sometimes apply the CLT when it is not valid.

    You should add "...approaches infinity, provided the fractional contribution from any one random variable to the sum uniformly converges to zero in the limit as N -> infinity." This is an important distinction. For instance, Levy distributions are a class of stable limit laws for which this is not the case--the largest variable in the sum can in fact dominate the sum. Symmetric Levy distributions may superficially resemble Gaussian laws, but with tails that decay slower (like power laws rather than exponentially fast).

    This article is amusing if only because it is a nostalgic throwback to the days of P.R. and hype over "chaos theory." Call me dense, but I don't understand why something as simple as scale-invariance needs to be dressed in the extra jargon and hype. Assuming the author did not miss anything terribly fundamental, I don't see anything novel in what was reported. Perhaps someone in the know can fill me in on just how exactly this turns statistical physics on its head?

    [For those who are interested, Levy distributions are treated quite adequately in Limit Distributions for Sums of Independent Random Variables (Gnedenko and Kolmogorov) (c)1954].

  79. Exactly my thought by tilly · · Score: 1

    I had not heard about the New Scientist article, but I have known about Benford's law for some years. Indeed without a description of what curve these people think that they have, I am not sure how it *differs* from Benford's law in practical import!

    Indeed I suspect that they just have some variation on a lognormal curve. (Which does indeed show up in many different places.)

    Incidentally one of the few things that I disagree with in Knuth is his presentation on Benford's law. Sure they toy mathematical model he generates is fun and all, but he says nothing about why it applies to the real world. And hence his "proof" says nothing about why real numbers that appear in real computers follow Benford's law. I personally find the general explanation in the article you listed to be far more convincing...

    Cheers,
    Ben

    --
    My usual seat in the cluetrain is at A HREF="http://pub4.ezboard.com/biwethey.ht
  80. Graph is incorrect by Cantara · · Score: 1

    Notice that they place 'most common' on the rightmost of the graph, instead of in the center of the curve.

  81. This is old, old, old... by Anonymous Coward · · Score: 0
    Even statistics texts will tell you that the value of the central limit theorem (that the distribution of sums of independent random random variables tends towards the Gaussian distribution) is more theoretical than practical. Nothing new here. As for the role of self-similarity in statistics, there is a mathematician by the name of Benoit Mandelbrot who has been looking into this since 1963 (according to his recently issued collected papers.

    P.S. While I generally try to be tolerant of the differences between British and American English, "maths" really, really grates on my nerves. I wish they'd learn proper slang.

  82. The GOE isn't new... by HiRes · · Score: 1

    Actually, I was wondering if it weren't related to the Gaussian Orthogonal Ensemble (GOE) distribution, which was a result of much of Wigner's work pioneering Random Matrix Theory (RMT) decades ago.

    Mathematically, the GOE distribution characterizes the eigenvalues of a Gaussian distribution of orthogonal matrices containing random elements. (Forgive me if I've got the math a bit wrong; I'm a physicist by trade...)

    Physically, the GOE distribution has been popping up in increasingly many physical systems for a while now. Years ago (maybe by Wigner himself? not sure) it was noticed that the energy level spacings of atomic nuclei have statistical properties consistent with the GOE distribution. Some time later, people fooling around with microwave cavities began seeing these distributions as well. The quantum dot folks have also run into the GOE distribution, I believe.

    The GOE distribution seems to provide a good test for broken symmetries in a system. As a system's symmetry is gradually broken by, say, shaving off a corner of a piezoelectric crystal, the statistics followed by the eigenvalues (in this example, the resonant frequencies) gradually shift from GOE to Poisson, the latter which characterizes the eigenvalues of a truly random system.

    Now, two really cool things about the apparent universality of the GOE distribution are:

    • The distribution is parameter-independent, and does not contain any information about the system being analyzed. I can glom together energy level data from many different nuclei and still obtain the same GOE distribution.
    • There appears to be a connection to chaos.

    Neat, chaos! Well, sort of. If you take a classically chaotic system, say, a Sinai billiard, and quantize it (solve the Schroedinger equation), time after time you will discover that the eigenvalues of the quantized system have these nice statistical properties that happen to fall out of RMT, namely, the GOE distribution.

    So does that mean all quantum systems that follow GOE statistics are chaotic? No. In fact, it's difficult to define what "chaos" really means for a quantum system that has no classical analog. But it implies there's a connection, it certainly is fun to think about, and perhaps continued research will reveal a deeper universal phenomenon at work. I wonder if these researchers haven't taken another step in that direction.

    Dang, I wish I had something up on the web about how my research relates to all this... well, you can email me.

    --
    wcb
  83. Re:How does this relate to standard deviations? by Oms · · Score: 1

    First, let me say that the graph in the article is poorly labeled (or at least their example


    Not to mention that the area under the new curve in their graph is significantly more than that under the bell curve. Which means that the total probability is above 1. To use their example, we have a very neat species distribution, say 50% wolves, 50% rabbits, and 30% bears, for a total of 130%... My question is, is the Financial Times always that bad at math?

  84. Everything doesn't fit in nice places by Status+Quo · · Score: 1

    Maybe in the long term, this will be "proven" or "disproven," but does anybody remember the phrase, "Assume a point mass ..." or "Assume a sphere ..." way back in school. What happens in the long run is that it's never as nice as you'd like. That's why there are other distributions than the Gaussian. Why should we assume that everything that can be found has already been found. Take the time to think, folks. Maybe this is like cold fusion, but maybe its like the transistor for model prediction. More advanced, more accurate models could be the result.

    --
    I'll never be as good as I want to be. I can only be as good as I am.
  85. Einstein by Terao · · Score: 1

    If memory serves me the "God doesn't play dice" came from a dispute Einstein had with Niels Bohr where Einstein didn't belive in the randomness of quantum mechanics but was proven wrong later, God does play dice.
    Did I remember correctly?

  86. Re:Aren't you guys missing a point? by Aos · · Score: 1

    That's why I said "the article claims they use Gaussian for this and that". I thought someone will have more insight. Yeah, I also heard that insurance companies employ a few good men/women. I would imagine they would use some extrapolated curve based on claims data for a century back or something, not just some silly distribution.

    In which case, you're absolutely right: WTF is the target audience for the article? Someone speculating on insurance companies stock?

  87. Who is Mandelbrot? by anewc2 · · Score: 1

    One of the original researchers in this field is Benoit Mandelbrot, who applied it to financial markets, showing that price changes are not (as is frequently assumed) gaussian. Why the Financial Times did not pick up on this angle is beyond me.

    --
    As the mystic said to the hot dog vendor, "Make me one with everything"
  88. Re:Psychohistory? by NYFreddie · · Score: 1

    Yes, I realize that it is statistics. However, it deals with determining what seemingly random events are most likely to happen everywhere, not in a single closed environment. Statistics are compilations of data. How that data is used is not statistics. "1 out of 10 blah blah blah" is spouting of statistics. What causes that 1 out of 10 is not statistics. That's the gist of this article.

    The reason I spoke of Psychohistory is because it is supposed to be using the statistics on an advanced probability engine. This is a step toward that equation. The more refined we get to deciding that "1 out of 10 blah blah blah because [insert reason]" is the closer we get to figuring out the universe and how humanity acts as a whole.

    -NYFreddie

    --
    Barbie of Borg - She doesn't just Assimilate, She Accessorizes too!
  89. We're ASSuming A LOT here! by Anonymous Coward · · Score: 0

    One thing that no one has mentioned is the possibility that this technology could be applied to anything that uses guassian distributions to analyze data. That means compression routines like .Z, .jpg and .png could be improved That is, of course, if the article isn't just a bunch of fluff....

  90. Is it real or perception? by Anonymous Coward · · Score: 0

    As I read this I wonder if the curve is a real distribution or whether it just models the human mode of perception. We are more likely to notice rare events instead of common ones. For example, take a field of flowers. Its 99% green but what we notice are the colored bits...

    1. Re:Is it real or perception? by Anonymous Coward · · Score: 0

      boy thats a question going back a few thousand years? hmmm anyone up for aristotle?

  91. Re:Even more universal by Anonymous Coward · · Score: 0

    Chemical engineers study a similar relationship between different systems, in a course often called Transport Phenomena after the name of a classic textbook on the subject. Replace the electrical bits in this explanation of System Dynamics with heat flow, and change the names of some terms (since it's not electrical-based), and you've got Transport Phenomena.

  92. Gaussian's can't be scale invarient. by Anonymous Coward · · Score: 0

    It not exactly surprising that Gaussian's aren't scale-invarient. Scale invarience requires
    that the distribution obey a property like:

    f( ax) = b* f(x) where a,b are arbitrary scales.

    Gaussian's ( e^(-cx^2) ) clearly can't meet that because the x^2 term is non-linear. Certain
    hyperbolic distributions (i.e. 1/(1+cx) ) meet this.

    It's a pitty the article didn't give the function they
    used.

    It's also a really common mistake to use a Gaussian to describe a process where there are limits which make it invalid. An example is the
    distribution of intervals in a line ( x e^-x) or a
    Rayleigh distribution (for noise intensity)
    ( x e ^(-ax^2) ; x >= 0). The Gaussian expectations will be systematically wrong for these cases.

    Just my half-cents.
    rob
    (harrison@asterix.jci.tju.edu)

  93. Hurst processes, anyone? by Kaa · · Score: 2

    In my own field, the distribution of stock market returns is often taken to be distributed log-normal

    You can also start with log returns (instead of "normal" returns). This will give you an approximation to a Gaussian (as opposed to a lognormal distribution), plus they are summable across time. I work almost exclusively with log returns -- they are a pain when you need to calculate portfolios, but nice otherwise.

    A new distribution that gives increased weight to rare events would be very useful

    There are several (e.g. Cauchy), but the problem is that they are much harder to deal with (analytically) than the Gaussian. And if you don't like any, you can always work with the empirical distribution -- no need to pollute the facts with assumptions about what they should be. However, not much of statistics will be useful to you -- the Bayesians offer some good tools.

    Getting back to the original point, I wonder if these guys heard of Hurst and Hurst processes. A persistent Hurst process (sometimes called black noise) will generate something like what they found, and Hurst himself developed his theory on the basis of natural phenomena (he started with the frequency of floods on the Nile which occurred, surprise, more often than should have been expected). Skim through Peters "Fractal market analysis" for more information.

    I bet these guys rediscovered Hurst processes.




    Kaa

    --

    Kaa
    Kaa's Law: In any sufficiently large group of people most are idiots.
  94. What is the big deal? by Axe · · Score: 1

    Yes, Gaussian is one case of the family of so called "stable" distribution - the only one with a finite variance. Stable distribution is such that if you add random variables with this distribution you will get a variable with the same distribution. - As in law of the large numbers, but without finite variance.
    Infinite variance means the tails of the distribution fall off slowly - it is more likely to get an event further from the mean value.

    So fucking what? Big news? Hardly.

    Stable distributions have a lot of applications in many areas of physics and finance. Do a literature search on "Levy flights" for examples. There was a good article on Levy flights in one recent "Nature" (IIRC) For some financial applications - check out very easily written (but for a specialist kinda useless - IMHO) Mandelbrot's "Fractals and Scaling in Finance". It has some good discussion on the subject.

    Guys, you look like fools, making news out of a rather well known field. And discussing it rather childishly...

    --
    <^>_<(ô ô)>_<^>
  95. SETI Client by Anonymous Coward · · Score: 1

    Quick! Somebody fix the SETI client! We don't want to miss any of the alien signals!!

    SETI GBC Analysis

    Just kiding. :)

  96. t-distribution by Anonymous Coward · · Score: 0

    It has been known for some time now that the assumption of infinite degrees of freedom inherent to the Gaussian is inappropriate in many cases where it is applied, essentially due to laziness and its analytic simplicity. The t-distribution, which has fatter tails (ie. rare events are more common) is commonly a better fit, but a pain to use.

    1. Re:t-distribution by Axe · · Score: 1

      Arrgh... you missed you statistics class.
      T-distibution is NOT a BETTER FIT. It is a distribution of a sample mean of a Gaussian variable, when you use a sample variance instead of a true variance (when it is unkhown) When your sample grows bigger - the estimator of the variance becaomes more accurate and your t-distribution approaches Gaussian.
      There is nothing painful about t to use once you got a clue.

      As for the Gaussian, as I mentioned above - it is a particular case of a stable distribution : with a finite variance. This is hardly news, but some recent developments in the self-affine processes made it more other stable distributions more widely known..

      --
      <^>_<(ô ô)>_<^>
    2. Re:t-distribution by Axe · · Score: 1

      I need to read before I submit.. Try again:
      but recent developments in the study of the self-affine (fractal) processes made other stable distributions used more often..
      Still bad, but screw my English...

      --
      <^>_<(ô ô)>_<^>
  97. Re:no REAL information here by Brett+Viren · · Score: 1

    Yes, my sentiments also. Does anyone have a link to a page which actually SAYS something on this topic?

  98. Not so clearly - lognormal has a claim by tilly · · Score: 1

    Normal distributions only make sense if your fundamental operation is addition. If, however, your fundamental operation is multiplying the random variables of interest, then you get a distribution whose logarithm is normal. Hence the name lognormal.

    This is just as natural as a normal distribution, and appears more often than straight normal distributions in subjects like finance and stochastic analysis.

    Cheers,
    Ben

    --
    My usual seat in the cluetrain is at A HREF="http://pub4.ezboard.com/biwethey.ht
  99. Fat tails by Anonymous Coward · · Score: 2

    This is a well know fact in statistics and finance, so called fat tails.

  100. Lots of Pratchett fans here? by tilly · · Score: 1

    Just in case people don't know where in the heck the above jokes are coming from, go out, buy some books by Terry Pratchett, and have a ball...

    Ben

    --
    My usual seat in the cluetrain is at A HREF="http://pub4.ezboard.com/biwethey.ht
    1. Re:Lots of Pratchett fans here? by dillon_rinker · · Score: 2

      And if you want the technical exposition (rather than the narrative one he provides in the novels), then pick up Pratchett's Discworld RPG from Steve Jackson Games. :) Hmmm...if Bill Gates lived on the Discworld, who would come for him when he died? And since rare events are much more common under this newly discovered distribution than under a Gaussian distribution, and since the Discworld is said to reside under the far tails of the probability curve, does that mean there are more Discworlds than were previously believed to exist?

  101. unless... by RoLlEr_CoAsTeR · · Score: 1

    it's a one-in-a-million chance of winning the lottery...

    --

    Insert mind here.
  102. umm is least rare most rare? by bloosqr · · Score: 1

    Umm is that article just on crack or what? I am sure that graph (Learning curves : a fresh approach) has to be a misrepresentation of something... Notice how the "old" graph says to the left we have a small distribution of "rare" species and then (as you move to the right) it gets larger and the for some reason only known to the business kidz very unrare species they think become umm rare again :). Obviously this just isn't true, no one maps out gaussian curves like that.. I think by the assymmetry of the "new" curve that they are showing that some things are better modelled using Poisson Statistics.. thats what the "new graph" looks like anyway.. the question is what on earth are they trying to show.. what is the Y axis??

    Anyway i'll put money on the fact whoever these profs are are trying to scam cash from financial wallstreet types.. (New curves new ways to predict the stock market give us money *cough*) this article was a plant.. but i'm feeling kind of cynical today :)

    -avi

    1. Re:umm is least rare most rare? by Axe · · Score: 1

      It is actually i big deal to try to persuade MBA types to understand a thing about stable distributions. Their first reaction "but it got infinite variance! " and every paper they have seen refer to variance of the process. They can not understand that when you use some parameter to describe a process - this parameters are not an intrinsic characteristic of the process - it is rather the charateristic of your model. So when you calculate a variance to describe a stock price trajectory - it does not mean that every model of this process has to have a finite variance. You just made a fit based on a finite observation.
      Gaussian fit this processes very badly indeed and a lot of "fixes" are developed to account for that. But people are reluctant to change their paradigm altogether - they did not studied that in that single business statistics class school they took. Gaussian theory is 100 years old, fractals, self-affine processes, Levy flights etc. are all too recent and not so well understood. It will take time.

      This paper is still a scam anyway... Better model will not make you a better trader - there are too many other variables affecting security prices - psychology of the buyers is one big unknown. With people trading over the net so fast, effects of news of their selling behaviour is too hard to guess...
      Just my 0.02c...

      --
      <^>_<(ô ô)>_<^>
    2. Re:umm is least rare most rare? by bloosqr · · Score: 1

      Okay I went thru the slashdot notes and hurt my head a bit in trying to make out what the graph *really* meant :) so it seems it is saying that we expect there to be more rare species than there to be unrare species. This makes sense as rare species don't effect the others in quite the same way as a very unrare species (which would dominate resources so you would expect there to be few of them), this really is just saying that there is species-species correlation, especially w/ dominant unrare species (i.e. not really random).

      -avi

  103. Re:"universal" curves - a "universal" example by David+Ishee · · Score: 1

    Here is an example of "universal" mathmatical models from a Mechanical Engineer:

    In school, I took a class called System Dynamics which is all about modeling dynamic behavior of systems. There is an interesting similarity of behavior between electrical, mechanical, and hydraulic systems in the equations used and how you define them.

    Driver:
    Electrical = voltage
    Mechanical = force
    Hydraulic = pressure

    Flow:
    Electrical = current
    Mechanical = velocity
    Hydraulic = flowrate of fluid

    Resistance:
    Electrical: voltage = constant*current
    Mechanical: force = constant*velocity
    Hydraulic: pressure = constant*flowrate of fluid

    Capacitance:
    Electrical: contant*integral(current) with time
    Mechanical: constant*distance traveled
    Hydraulic: constant*integral(flowrate) with time

    Inductance:
    Electrical: Voltage = constant*delta(voltage)/delta(time)
    Mechanical: Force = constant*delta(velocity)/delta(time)
    Hydraulic: Pressure = constant*delta(flowrate)/delta(time)
    (In the mechanical example, mass is the constant)

    The equations are very similar, but you don't see me calling the press and saying I've found a "universal" mathmatical model.

    Trying to claim a "universal" law is hype. Just because there is similar behavior for magnetic properties, turbulent flow, and distribution of species is interesting, but doesn't suggest that everything is related in a similar way. I think that is why Mr. Turcotte got such a hostile reaction. Before you claim here might be a "universal law linking patterns of mineral deposits, floods and landslides" you better look at the data first and don't argue from the specific to the general the way he did in this case.

    --
    Your password has expired, please login to change it.
  104. Gauss turns over in his grave by sl1006 · · Score: 1

    As many have pointed out, there is nothing new or nothing surprising in the claim. What the statistical theories claim is that if a variable is truly (mathematically) random the statistical distribution asymptotes to a Gaussian distribution (or the Bell curve). That's not an observation or a fact. That's a theorem which one can prove, in other words, it's more like a definition of a "true randomness" of a variable. Roughly speaking, if something is truly random, its distribution will begin to look like a Bell curve. The real question is, "what is truly random?"

    It's almost nonsensical to state that the nature does not follow the Gaussian curve just because a statistical variable does not follow it. Perhaps it tells you more about the variable itself. If a variable x has a perfect Gaussian distribution, the distribution of log(x) will look nothing like a Gaussian distribution. Does that tell us the Gaussian curve is not the normal curve? It only tells us that even if x is truly random log(x) is not.

  105. Overstated case. by drunkenkatori · · Score: 1
    The journalist tries to hype up the result as somehow overturning a "universal truth", that is, ubiquity of the gaussian bell curve.


    First, the bell curve is ubiquitous because so many random processes satisfy the assumptions of the Central Limit Theorem. (finiteness...)


    However, there are lots of natural phenomena that don't meet those requirements and so we use lots of probability distributions in science. Lorentzian and Poisson distributions come to mind.


    It's fascinating, but unsurprising that self-similarity leads to a different kind of probability distribution.


    The journalist heads into "Golly Gee" territory once he starts calling it a "Universal Curve".


    DK

  106. Nor is it particularly right... by Anonymous Coward · · Score: 0

    It assumed its pre-eminence, precisely because it has nice analytic properties (i.e. one can prove pencil-and-paper theorems about it). That does *NOT* imply that all stochastic phenomena obey Gaussians. Their distributions are what they are, and trying to fit any/all data with Gaussians is a disease. Get over it.

    1. Re:Nor is it particularly right... by dillon_rinker · · Score: 2

      YEAH! RIGHT ON!

      I feel the same way about the "least squares" technique for determining the line of best fit. It is popular precisely because it is easy to do calculus on x^2.

  107. Multiplication, instead of addition? by Anonymous Coward · · Score: 0

    Judging from the curve, it looks like it can be built by multiplying several random 'factors' togeter, instead of adding.

  108. Central limit theorem by hawkestein · · Score: 1

    What you're talking about is called the "central limit theorem", which holds for the summation of iid (independent, identically distributed) random variables of any kind of distribution with finite mean and variance.

    --
    -- Will quantum computers run imaginary-time operating systems?
  109. This is a breakthru? by demon-D · · Score: 1

    If you measure a set of mostly random events you will end up with a bell cuve.

    &nbsp &nbsp &nbsp it seems to me that external, modifying events are removed from scientific studies as much as possible.
    This act automagically skews the results at least slightly enuff to where you will find something else in nature.
    It would seem to me that it would be impossible to take EVERYTHING (i.e. everything) that might efect the results into account so we dont bother trying.

    &nbsp &nbsp &nbsp If we want to predict a mostly random event we apply the bell shaped curve. But I say 'mostly random' becouse most things are not truely random.

    &nbsp &nbsp &nbsp Just becouse we fail to predict or fully understand a problem does not mean that it is utterly random. This new curve helps to predict some things. Others might take a whole new curve. I do not believe that there will ever be a universally true curve. All that this points out (gasp) is that not all things are utterly random.

  110. Baloney by ruff · · Score: 3


    Just because your data doesn't precisely fit the distribution, it does not mean the distibution is "wrong." What it means is your data doesn't match your distribution.

    This appears to be another case where journalists have missed the point.

    The Gaussian distribution is not "wrong" in any shape or form.

  111. Psychohistory? by NYFreddie · · Score: 1

    It sounds like something out of an Asimov novel, actually. A common formula that can be used to judge seemingly random events when large masses are considered - the individual is random, but the collective is predictable.

    I wonder if I should take this back to school and demand they raise my grades for all those times I "created the bell curve".

    -NYFreddie

    --
    Barbie of Borg - She doesn't just Assimilate, She Accessorizes too!
    1. Re:Psychohistory? by Anonymous Coward · · Score: 0

      "It sounds like something out of an Asimov novel - the individual is random, but the collective is predictable." No. It's called *statistics* and it was invented, oh, about a few hundred years ago.

  112. For whom the Bell Curve Tolls , or something.... by The+Queen · · Score: 1

    Goddess, I love it when the status quo gets shaken up! Woo!
    To explain the 'rare more common than common' phenomenon, one need look no further than Hallmark or Precious Moments or crap like that: "We are all special, we are all unique, etc." Blah!
    Still giddy, this is cool!

    The Divine Creatrix in a Mortal Shell that stays Crunchy in Milk

    --

    The House Between - Original Sci-Fi Series
  113. Old news by Anonymous Coward · · Score: 0

    For people in the physical sciences. Many of use have been using binormal or power law (fractal) distributions to charcterize our data for some time.

  114. How does this relate to standard deviations? by Mr+Z · · Score: 4

    First, let me say that the graph in the article is poorly labeled (or at least their example poorly chosen), IMHO, since "rarity" is related to the number of standard deviations you are from the mean (whether or not the distribution is symmetrical), whereas their graph has rarity monotonically decreasing from left to right. I guess in this sense ("rarity of a species"), rarity != probability.

    This new graph stikes me as a bit odd, since it's not symmetrical. With the bell curve, you only need to know how many standard deviations you are from the mean. With this curve, "above the mean" and "below the mean" are vastly different territories.

    This curve brings up two questions for me:

    • Are there processes/events for which the mirror-image of this curve is the more appropriate distribution?
    • Whatever happened to the other distributions we know and love, like the Poisson distribution? Not all random events are evenly distributed, and we've known this for a long time.

    I guess this new curve is just another way of saying that "Hey, there's a class of 'random' events out there that share a common non-uniform distribution!" While that's useful to know, I don't see it as the ultimate refutation of the Gaussian distribution.

    --Joe
    --
  115. Rarity of Species by Anonymous Coward · · Score: 0

    The point of this article isn't really about math, but that one side or the other in the conservation movement is going to get some ammo to use against the other side.

    I suspect from the way that the article is written, that the authors are claiming that the extrapolations about the numbers of species, and by implication the numbers of species that have gone extinct, are too high.

    This would imply that the ecological crisis claimed by the extremist conservationists is vastly overstated.

    So this is a political piece, not a technical one. (I'm not going to comment, one way or the other, about the ecological issues that are implicit in this article. That's a completely different discussion.)

    skg

  116. Can this make you RICH? by Anonymous Coward · · Score: 0

    Rumor has it that wallstreet quants use fractal correlations to predict/exploit financial time series. This has been a long term interest of Mr. Fractal himself in a Scientific American cover story a few months back by Dr. Mandlebrot.

  117. Interesting not exceptional by PG13 · · Score: 5

    The use of the gausian curve is based on the assumption that the random variable we are considering is actually gereated as an average of many many independent random variables. It has been shown for all 'reasonable' independent random variables in the limit their average will be a gausian distribution. This is straightforwad mathematics no arguing with this.

    As such from a mathematical point of view this has nothing to do with replacing the gausian curve...it is still clearly the most 'natural' mathematical curve. However, what I understand the authors to be claiming is that certain types of real world events are not actually gaussian and are described better by this model. This shouldn't be that surprising as often the 'extreme' cases are not caused by a mere sum of the independent random variables mentioned earlier.

    For instance intelligence might be regarded as the influence of a great deal of small random variables (how some genes got arranged upbring etc..) but the truly tale end cases such as mental retardation do not occur because all of these factors go bad, (someone who is retarded is the result of some genetic defect usually not a combination of bad upbringing poor nutrition etc..). This is probably not the kind of thing the distribution describes but it shows that the gaussian really never has been the end all and be all.

    So while this is undoubtly a very interesting subbject it really isn't that exciting. Ohh and the claim that the greater incidence of natural disasters disproves the gaussian was really BS, while they may not be gaussian this doesn't appear to be a large enough sample size to make such definitive claims

    --
    Marriage is the "pseudo-ethics" that cloaks the messy truth of sexuality in the raiment of propriety -- it's "Don't Ask,
    1. Re:Interesting not exceptional by jafac · · Score: 1

      I guess that's where I'm confused, and since others read the article and were similarly boffed, is the line this guy came up with mathematical, or simply an overlay of statistics? I mean, there's a formula for the gaussian curve, right (assuming you have "random" numbers). Is there a formula for this new beast (the beanie-baby curve)?

      "The number of suckers born each minute doubles every 18 months."

      --

      These are my friends, See how they glisten. See this one shine, how he smiles in the light.
    2. Re:Interesting not exceptional by Anonymous Coward · · Score: 0

      This is an accurate comment, I just want to add that THERE IS NOTHING NEW about deviation from normal distribution in self-similar systems (including all financial markets where most money is for playing with such distributions). Normal distribution quickly (well, at least, exponentially) becomes a very mathematical concept as we approach tails since you need unrealistically large numbers of trials to make a meaningful observation. Uknown :) Ph.D.

  118. Re:For whom the Bell Curve Tolls , or something... by jafac · · Score: 1

    you're suggesting we call this the beanie-baby curve?

    "The number of suckers born each minute doubles every 18 months."

    --

    These are my friends, See how they glisten. See this one shine, how he smiles in the light.
  119. Events in self-similar system are not independent! by Anonymous Coward · · Score: 0

    You have grasped half of the situation: the derivation of the gaussian curve depends on the assumption of independence of events. When you deal with self-similar systems, there is an overall structure (e.g., a fractal structure) which means there is a coherence (correlation) between seemingly unrelated events (you just may not be able to articulate presicely what that correlation is--the fractal structure proves it exists nonetheless). In the real world, there is very often a low level interdependence among events, and the existance of that interdependence means there is no reason whatsoever to expect those events to fall along a gaussian distribution. What they have identified (as i interpret this very incomplete description of the discovery) is that there is apparently a universal form of distribution of the events causing evolution in self-similar (fractal) systems. This is something I definitely want to look into, but if something along the lines of my imperfect present understanding is in fact the case, this is a very very big deal!

  120. What's surprising? by Anonymous Coward · · Score: 0

    The bell shaped Gaussian distribution is just a model. In many cases, there are good theoretical reasons why it makes sense to use the Guassian model (e.g. the central limit theorem, easy to work with mathematically, etc.), but there are many different statistical models in use. For example, noise in undersea communications is better modeled using a Laplacian distribution than Gaussian. So I don't really see what's surprising about the fact that real data doesn't really fit the gaussian model.





  121. Math sup by zebul0n · · Score: 1

    This article is just confusing.
    Anyone out there who can give an honnest description of this whole thing in mathematical terms?
    (would be nice if it was someone who was in maths-sup-math-spe...thanks...)

    zeb

    1. Re:Math sup by Mathieu · · Score: 1

      The gaussian law appear in many contexts.
      For the most part it appears for uncorellated
      situations (gas particles, photon gas, stock
      options, etc).

      The article is not clear but I am not sure the thing is clear for the scientist involved too.
      maybe they have found a new universality law which would be a gret event in science. Maybe this is just a random coicindence which relies on nothing.

      For What I Understand they think their law applies to a great range of correlated phenomena.

  122. Another thought: When one side is near saturation. by Mr+Z · · Score: 2

    I happened to think of one possible reason why so many phenomena might fit a lopsided curve better: The bell curve implies the possibility of infinite extension in both directions. If the mean of the distribution is near one physical extreme (for instance, looking at average rainfall levels -- you can't have negative rainfall), then the curve must become lopsided.

    Perhaps that's what they've stumbled onto?

    --Joe
    --
  123. Feigenbaum knew it by el_ted · · Score: 1

    That curve is just the probabilistic map taken from a column right after the period 3 in any bifurcation map. Other columns also give different curves.

    --
    -- You are in a twisty maze of passages, all alike.
  124. Aren't you guys missing a point? by Aos · · Score: 1

    Aren't you guys missing a point?
    We have different distributions and none are "good" or "bad" by default, as anyone with science degree knows, but:

    1. This article is in financial times
    2. Scientist proposing the new distribution have
    established partially based on data about frequency of forest fires and air turbulences
    3. Article claims that for natural disasters insurance companies currently use Gaussian distribution
    4. Insurance comapanies = Tera $$$ = what "financial" stands for in the web site's name

    So simply, if it is true that insurance companies use Gaussian d. for e.g. forest fires, and research shows that the actual data doesn't fit that distribution, then there's a good chance those companies will utilize this research and insurance premiums for such events will change. For finance world, that IS big news.

    If this wasn't related to $$$, you wouldn't see an article like this in financial times. They don't give a damn about chaos theory or anything, unless it means $$$.

    For the implications of this new distribution in science, a better article with formulas and such would do the rest of us good.

    1. Re:Aren't you guys missing a point? by Axe · · Score: 1

      ASAIK - insurance companies do emply some pretty smart guys with a good background in statistics, physics and math. I know some people working in this area. They DO NOT jsut use a Gaussian distribution. This article is junk..
      In particular - scaling distribution of the earthquakes and other natural phenomena is well accepted and broadly used - I do not understand what is the target audience for this article. But this is not "news for geeks"..

      --
      <^>_<(ô ô)>_<^>
  125. Uh, your math is seriously hosed... by tilly · · Score: 1

    The Gaussian distribution is 'the universal distribution' in the following sense:
    Consider a series of events that generate some value. For example, rolling of a dice, which generates a value from 1 to 6. Assume that these events are independent, meaning that, say, the 10th outcome will in no way influence, say, the 20th outcome. Now take the first N outcomes, add them together and divide by N. The larger you take N, the better the distribution of this average follows the Gaussian distribution. (And I should add that there are some mild conditions that have to be satisfied).


    Perhaps you should get the calculation right. You have to subtract N times the expected value, then divide by the square root of N to see a real Gaussian distribution emerge.

    Now what are they saying here? That the 'rareness' of species does not follow the Gaussian distribution? How do you quantify 'rareness'? How can this satisfy any kind of independence condition (where there's one rare animal, there are bound to be more).

    What's the weirdest of all, is the statement that rare species are more common that expected. What a joke! If something is more common than expected, then by definition it is not as rare as you thought!


    I would have thought that to be pretty darned obvious. The species is rare if there are few members of that species. Their claim is that there are more species with only a few members than one would expect. (My retort is that humans have a lot to do with how many rare species there are!)

    Let me say that again slowly.

    Step 1: Categorize species based on how many members they have.

    Step 2: Look at the distribution of species by the population of the species.

    The non-technical articles don't tell us anything useful when it comes to judging their math. But given your math, when it comes to labelling things mathematical nonsense I can only think, pot, kettle, black.

    Sincerely,
    Ben

    --
    My usual seat in the cluetrain is at A HREF="http://pub4.ezboard.com/biwethey.ht
  126. rare events by Anonymous Coward · · Score: 0

    So, what you're telling me is that according to this distribution certain rare events will happen more often......like Microsoft releasing stable and useful products, actually getting help from tech support, etc., etc..... ;-) Microsoft Enthusiast

  127. Re:Another thought: When one side is near saturati by Anonymous Coward · · Score: 0

    I don't think the new distribution is due to the mean being near one physical extreme. This situation is very common and has been studied by statisticians. In such cases the data is not expected to be normally distributed, although sometimes a "transformation" (i.e. 1/x or sqrt(x)) can be used to get a normal distribution from such a data set.

    What I gathered from the article is that the Gaussian distribution is still correct. They have just discovered a set of situations, which follow a different distribution. While it may have important implications for some fields, this is not earth shattering. Many phenomena have non-gaussian distributions (i.e. nuclear events follow a Lorentzian distribution).

    Too bad the article didn't give many details about this new distribution!

  128. Explains some things...my ranting... by Any_doom?_a_cow_runs · · Score: 1

    Why coincidences happen more often than they should, and its not all 'in our heads' like the psychobabblists like to tell us.


    My bet is that it has something to do with quantum mechanics. Those things always seem to crop up and show us how classical physics can be full of shit under the right circumstances.

    You really can read the other guy's mind. You just don't know it :)


    Anonymous Coward, get it? :)

    --

    Anonymous Coward, get it? :)
    Not bad spelling, bad typing
  129. Re:Mmmmm, curves by Anonymous Coward · · Score: 0

    Too bad you never see any other than in a magazine...

  130. Geometry, Nature, Dice and Quanta. by ninoles · · Score: 1
    The Gaussian curve is one of the most regular curves that a mathematician can obtain with ideal situations. As said, it's what you can obtain when adding intercepting independant uniform probability curve, the equivalent of throwing an infinity of time a dice with an infinity of face. This curve fit very well in the Euclidian universe with their straight inifinite line, geometrical form.
    However, the chaos theory, which can be consider like a new form of geometry, demonstrate, by strict observation, that the nature is better represent by fractals than by (even complex one) euclidian form. A good example is the lenght of the England coast: if you take a straight meter to calculate it, you obtain a certain lenght. Now, use something a feet long: your total will be higher because you have now to follow irregularity that your previous meter must ignore. Now take the smallest rule you can, something near 10 E-35 meter. If you have the patience to do it so, you will obtain something near the infinity. That's it, a finite space with an infinite surface! This can shock your common sense but it still strictly a question of strict observation and mathematics. Nothing new, just the point of view.

    Fractal and self-similarity is more natural than euclidian geometry and gaussian probabilities, IMHO. First scientist use to name everything they find, distinguing the aort from the veins, fingers from the toes, etc. In the chaos theory, it seems easier to talk about self-similarity of a single component. The vein is really simply a small aort, and fingers are really like toes who evoluate differently because of different conditions. Chaos simply say that: euclidian matahematics is may be easy for the human mind, it's not for the nature. Nature prefer too use the simpler way of fractal geometry, with no regards for our simple mind. The chaos is not a new invention, is only the tools that give us a new model of the world around us. A model who is more precise than previous one.

    Finally, a little remark on the context of when Einstein had said: "God don't play with dice!" It was about quantic physics who said that probability is inherent to particules, for example, between their position and their energy. A particule CAN USE this incertitude to, e.g., gain enough energy to put itself out of the atom. That's call radioactivity. The problem Einstein had with this is that, even theorically, you can never predict what will happen. It's just like if God as said: "You're lucky being here, but they're still some chances that all the air of the atmosphere get suddenly out of the planet. I make thing this way, so even I can know which will happen." In fact, one interpretation of the quantic theory simply state that all probability happen, we simply not there to see the other.

    --
    Fabien Ninoles -- Debian GNU/Linux Developer
  131. Re:For whom the Bell Curve Tolls , or something... by Anonymous Coward · · Score: 0

    Damn discordians...

  132. Either the graph is backwards by Rares+Marian · · Score: 1

    This looks like a job for SLASHDOT...
    Seriously...

    Either the graph is backwards or journalismology, the study of hype and fluff in published science, is prevaling here.

    Actually it's worse rarity IS VERTICAL NOT HORIZONTAL.

    Put differently however...

    We should have two statistics:

    One for situations where there's a constant change that affects the system and indirectly itself.

    Use the new curve here.

    For example: Number of hours of studying and test scores.

    Small to high shouldn't be too steep in the beginning, but if you study for a very long time, there should be a quick drop because you don't actually understand the material and are attempting to memorise it.

    Which proves something I've known for a while:
    Computer Newbies Are Not Stupid. Give them some information and they get better. Spoonfeed them and they're helpless!

    The old curve should still apply to where a hundred rocks land if dropped from a height. There's no interaction between the objects themselves.

    As for the obviousness, BLAME IT ON NETWORK TV. All the mindnumbing leading brand detergent and pharmaceutical commercials are responsible for giving people a false sense of completion as far as statistical studies are concerned.

    My take on it is DUH! or 32DOHS.wav.

    --
    The message on the other side of this sig is false.
  133. Re:Another thought: When one side is near saturati by Anonymous Coward · · Score: 0

    That would make so little difference in most real world situations. If the average rainfall is 1 inch +- .1 inches, then according to the model, negative rainfall has probability > 0, but 10 sd's below the mean is ~10^-30 (top of the head estimate). That's not going to skew anything. AC

  134. Re:But what is it? by MG · · Score: 1
    I've tracked down a more informative article, published in a reputable journal.

    "Universality of rare fluctuations in turbulance and critical phenomena" S.T. Bramwell, P.C.W. Holdsworth and J.-F. Pinton, Nature 396, 552 (1998)

    Now why didn't the FT just give a reference, and save us all some time?

  135. "universal" curves by sjstuart · · Score: 1

    The concept that there could be a "universal" curve behind all statistical phenomena silly. About as silly as the implication that scientists currently think Gaussian curves are universally applicable.

    Every statistical process has a different distribution. Some are Gaussian, some become Gaussian under certain limits, and some can be approximated as Gaussians. But many others are simply something completely different: Lorentzian, Poissonian, etc. (Many of which have fatter tails than a Gaussian distribution.)

    These guys may or may not have come up with something. But there's certainly no news value to the claim that some distributions are non-Gaussian.

    -Steve Stuart

  136. Another stats distro by El+Volio · · Score: 1

    All that this article really says is that it appears that certain natural data fit a distribution other than the Gaussian. OK, fine. So do other processes. Besides, without some details on the distribution itself (give me formulae!) there's no real way to evaluate it.

    --

    "You can never have too many elephants on your team."

  137. This isn't new. by Anonymous Coward · · Score: 0

    If you take the law of large numbers into account: You can make this assumption rapidly. It's obviously stated that anything outside the ordinary can and will happen due to external events and internal events. I highly doubted the "group of scientist" thought of the simple truth before make this assertion. The assertion your making is based on a law that defines behaviour similar to this. If you state that the behaviour trails or follows another curve at a particular given sample : then another sample with another variant distribution will follow another. Keep in mind the Guassian distribution assumes alot of data output due to unknown factors. This keeps the values represenative in the analytic major, but proves a central behavior for percetange, not en entirety. So to say that something follows a new curve is silly. It's nothing new. That's like skewing statistics for you own personal use. Please, teach your children math, so they don't grow up to make stupid assumptions.

  138. Stupid journalists. by TheKodiak · · Score: 1

    "Fear the scientists! They can CONTROL your life! They know WHEN FLOODS will HAPPEN!"

    Or maybe I'm reading too much into scientific illiteracy.

    But I really _must_ protest the whole "Donald Turcotte recalls the hostile reaction he received when he suggested there might be a universal law linking patterns of mineral deposits, floods and landslides..." *sigh*

    I hope someone comes up with a better article on this.

    --
    -=Best Viewed Using [INLINE]=-
  139. no REAL information here by Fourier · · Score: 0

    Phrases like "has discovered a new mathematical curve" and "is derived from chaos theory" may sound good but don't tell me anything. Does anyone have a link to more informative articles?

  140. Mmmmm, curves by Kismet · · Score: 2

    I personally prefer the more voluptuous curves.

  141. Statistical Mechanics by Anonymous Coward · · Score: 0

    I wonder what implications this has, if any, to mechanics (clissical and maybe moreso quantum). With quantum being in a large part a statistical analysis, I wonder if some of these non gaussian events would describe better quantum mechanical events? I don't know? I'm just askin.

  142. Opensource it by PG13 · · Score: 0

    Now only if they GPL the curve. We should all write in and request they opensource the curve before microsoft can copyright it.

    --
    Marriage is the "pseudo-ethics" that cloaks the messy truth of sexuality in the raiment of propriety -- it's "Don't Ask,
  143. hrmmmm.... by Anonymous Coward · · Score: 0

    one thing that i sort of wondered is if there
    was a possibility that the curve is somehow
    changing shape over time?
    -i mean, theorhetically, universal laws could
    be allowed to change slightly over millions of
    years, yes/no?

    assuming that every rare occurence has the
    probabillity of spawning an infinite number
    of new probable and improbable outcomes, one would
    think the gaussian model would be correct,
    but i'm sort of looking at this from the perspective that
    an infinite number of rare occurances might not yet exist
    (or is this simply things that already exist?)

    hence, probable outcomes could only increase
    everytime a rare occurance becomes probable,
    where as rare outcomes have the capabillity to
    increase...well, randomly?

    -adam
    showtell.com
    javanet.com/~user

  144. One of the "authors" replies by Anonymous Coward · · Score: 2

    September 3, 1999

    Wow!

    It is interesting to see the response that this "research" article in the financial times generated. I'm a research associate (Bruce Malamud) working closely with Donald Turcotte. A student wrote me about the discussion your web site was having. Donald Turcotte was one of the scientists "quoted" in the financial times article. My research area has been in the areas of "time-series analysis" and also applying ideas of fractals and self-organized criticality to natural hazards. I did my Ph.D. with Donald Turcotte and am now doing a brief stint as a postdoc while I look for a "real" job in the world.

    First of all, this Financial Times article was a "quickly" researched article on the part of the person who wrote it. Donald Turcotte was contacted and interviewed by phone on Tuesday/Wednesday, with no contact afterwards from the Financial Times to see how correct they got the overall picture. This is how things are and he and I both gulped when we saw how the article appeared. We quickly prepared a short "response" from him (below) to the deluge of e-mails and telephone calls that he received yesterday.

    Bottom line, he was a bit misquoted, but the general idea holds. We are talking about applying the ideas of power-law frequency-size distributions (i.e., fractals) to extreme events, including floods, forest-fires, earthquakes, landslides, etc. Donald Turcotte has been active for many years in the area of applying fractals, self-organized criticality, and chaos theory to the earth sciences, and yes, he knows very well that he did not "invent" the idea, just made many applications (well, a bit more then that, but read his book).

    On the most basic level (and no, I'm not trying to be insulting, I'm sure many people on this site know what I'm talking about already as this is basic statistics), at one level the idea is a very simple one. Plot the frequency-size distribution of a set of data and see what curve is that best fits the data, i.e. what might be the underlying distribution. For some sets of data (such as forest-fire burn areas, earthquakes, and many other "natural" data sets) the frequency-size distribution follows a nice straight line in log-log space, i.e. it is follows a power-law (fractal or self-similar) distribution. Although one cannot say for SURE what an underlying distribution is, one can make certain (statistical) guesses as to whether a distribution follows more a Gaussian, log-normal, power-law, etc.

    Once on "believes" that a set of data follows a certain distribution, one can then begin to make some guesses as to what an "extension" of that curve might bring in time. If one has 30 years of flood-discharge data, one might then be able to make certain predictions as to the "size" of what the 100-year flood might be. Same with earthquakes. One has a better idea of the probability of having a certain size or greater earthquake, flood, forest-fire, etc. each year. It just happens that many of these events appear to follow power-law distributions, and these are not as "accepted" in the statistical community.

    Don just came in and is looking over my shoulder. He adds (to my above comments) that statisticians do not in general recognize power-law distributions because one cannot define a pdf for them. (Although one can define pdf's for certain distributions that are similar distributions to the power-law distributions, such as the Pareto distribution).

    So...in terms of the insurance community, they are of course very interested if a given "natural hazards" appears to follow more a power-law distribution vs. log-normal or Gaussian, as the resulting recurrence intervals will be very different. Power-law distributions tend to be very conservative for extreme events, i.e. one would expect more larger events in a given period of time, then say a Gaussian distribution. Others of course interested in this underlying distribution would be engineers trying to decide how big a flood one might expect in a given area in a given amount of time (and yes, we're dealing with extreme events, so the statistics are small and unsure), so as to know where people can build houses, how deep to make the bridge supports, etc. Bottom line is the statistics are unsure because there the data sets are small, but people need some sort of a starting point as a lot of money rides on the answers of what the "underlying" distribution might be.

    There are also many scientific implications, ranging from the simple "describing" what distribution a data set best follows, to understanding better (or in a different way) the underlying basic physics or equations that describe a given natural phenomena due to a better understanding of the statistics resulting from the equations vs. the actual data. In addition, many scientists are now beginning to think that the pervasive power-law distribution in nature is a general indication of self-organized critical behavior. One definition of self-organized behavior is when one has a complex system with a small steady input, and a power-law distribution of the "avalanches" (the events). Donald Turcotte and I wrote a paper (in Science, see below) applying this general idea of self-organized criticality to computer models and forest fires. Of the references listed below, this is probably the easiest for people to get.

    OK, before I start babbling. Below is the "reply" that Donald Turcotte wrote to many of the e-mails that came in during the last day.

    Bruce Malamud

    _________________________________________
    Wednesday September 2, 1999
    Ithaca, NY, USA

    Dear Interested Reader:

    Due to the large number of e-mails and telephone calls I have received with respect to the articles by Michael Peel, "New Curve Makes Life Predictable" and "Redrawing the Curve Reveals New Pattern of Events", that appeared in the Financial Times, September 2, 1999, I have prepared a short general reply. If you have further questions or comments after reading the below "comment" to the article, please do not hesitate to contact me for further information.

    These Financial Times articles emphasize the importance of power-law (also called fractal or fat-tail) distributions in estimating the probability of occurrence of extreme events. It is unfortunate the article implies that I invented the idea of power-law distributions, which have been recognized now for many decades. For instance, earthquake hazard assessment is based mainly on the Gutenberg-Richter relation; which is a power-law distribution of the number of earthquakes as a function of their magnitude [for some papers where I discuss this, see DLT, Annual Review of Earth and Planetary Sciences, Vol. 19, p. 263-281, 1991; DLT, Physics of Earth and Planetary Interiors, Vol. 111, 275-293, 1999].

    My work in power-law distributions is based on the concept of fractals, which is due to the pioneering work of Benoit Mandelbrot [for instance, see his book, The Fractal Geometry of Nature, Freeman, San Francisco, 1982]. Mandelbrot, along with many other researchers, have applied the concept of fractals to many phenomena in the natural and "man-made" world, including to financial time series. Other distributions, similar to the power-law, such as the Pareto distributions, have also been used for a long time. A good web page which discusses fractals and has many links is The Spanky Fractal Database (http://spanky.triumf.ca/].

    My own contributions have concerned applications to natural hazards and related phenomena. These are set forward in detail in my book [DLT, Fractals and Chaos in Geology and Geophysics, 2nd ed., Cambridge University Press, Cambridge, 1997] and in a major review paper on self-organized criticality [DLT, Reports on Progress in Physics, Vol. 62, 1999, available as a pdf document (preprint) which can be sent upon request].

    The principal contributions of my group have been the applications of fractal distributions to:

    (1) Fragmentation (by explosions in asteroids, etc.). [DLT, Journal of Geophysical Research, Vol. 91, p. 1921-1926, 1986]

    (2) Mineral deposits. [DLT, Economic Geology, Vol. 81, p. 1528-1532, 1986]

    (3) Floods. [DLT and L. Greene, Stochastic Hydrology and Hydraulics, Vol. 7, p. 33-40, 1993; DLT Journal of Research NIST, Vol. 99, p. 377-389 1994; B.D. Malamud, DLT, and CC Barton, Environmental and Engineering Geosciences, Vol. 2, p. 479-486, 1996. The last paper is available as a pdf document at http://coastal.er.usgs.gov/barton/pubs_online.html ]

    (4) Landslides. [J.D. Pelletier, B.D. Malamud, T. Blodgett, and DLT. Engineering Geology, Vol. 48., p. 255-268, 1997; available as a postscript file at http://www.gps.caltech.edu/~jon/]

    (5) Forest Fires. [B.D. Malamud, G. Morein, and DLT. Science, Vol. 281, p. 1840-1842, 1998; available as a pdf document for subscribers of Science, web site: http://www.sciencemag.org/]

    Many extreme-value events are directly related to time series that exhibit persistence or memory (for instance, time series of temperature, river discharge, the stock market, etc.). A good reference to applying persistent techniques (and a discussion of how to apply the techniques) is Advances in Geophysics, Vol. 40, B.D. Malamud, J.D. Pelletier, and DLT.

    Two other colleagues that have used power-law techniques applied to natural hazards include Dr. Bruce D. Malamud (Cornell University, e-mail: Bruce@Malamud.Com) and Dr. Christopher C. Barton (USGS, e-mail: barton@usgs.gov, home page: http://coastal.er.usgs.gov/barton/).

    Again, please do not hesitate to contact me for further questions.

    Donald L. Turcotte
    Maxwell Upson Professor of Engineering

    :::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::
    :: Donald L. Turcotte
    :: Department of Geological Sciences
    :: Cornell University, Snee Hall
    :: Ithaca, NY 14853-1504, USA
    :: Office: 607-255-7282; Fax: 607-254-4780
    :: e-mail: turcotte@geology.cornell.edu
    :::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::


  145. # of different species in a square meter? by HuvahCraftah · · Score: 1

    "In the same way, the distribution of species in 1 sq km of forest ought to be similar to that found in one square metre of the same habitat. Likewise, a turbulent or magnetic system is made up of a series of miniature systems, each of which is made up of a set of yet smaller arrays."

    Who comes up with this? Are the scientists expecting us to believe that this is valid information?

  146. Is there a technical publication of this online? by Anonymous Coward · · Score: 0
    Lots of people are making judgement on this discovery on the basis of a popular press account. All too often in these cases, the journalist doesn't understand what they are trying to explain, or omits important details in an effort to simplify things. Does anyone have a link to a technical description of this theory?

    The Gaussian is very useful and very important, but it is easy to misuse it. The Gaussian *cannot* be strictly correct when measuring a quantity that cannot be negative (such as number of individuals in a species, or an absolute strength of magnetization) because it extends with non-zero probability infinately in both directions. (It may still be a very good approximation, however, if the mean is many standard deviations above zero.)

    Another point is that any deviations between your true distribution and the Gaussian are likely to be most noticable at the extremes. Studies of extreme values should not use Gaussian distributions unless there is good theoretical and/or experimental support for doing so. Assuming a Gaussian distribution of heights is fine if you want to sell trousers, but is likely very poor if you want to recruit basket ball players or jockeys.

  147. can we make money off of this by Anonymous Coward · · Score: 0

    does the new curve apply to the rolling of two six sided dice and if so how can one use it to make money at the craps table?

  148. What is realy new in that ? by Anonymous Coward · · Score: 0

    Could someone telle me what is realy new in that ? Looking to that articles (examples), the presented courve are a simple convolution of the gaussian function and phasespace (sorry for slang, probability density (?)).

  149. Re:Sceptic in Slashdotia by Royster · · Score: 1

    What these guys are saying is that they have discovered a series of apparently unrelated problems that appear to share the same, non-gaussian distribution.

    Processes that follow the process that gives yield to the distribution are not gaussian but slightly skewed in their outliers. I'm waiting to read something more technical before I conclude that they are full of shit.

    In my own field (financial modelling) we don't have a good distribution for stock market activity. It's usually modelled lognormal, but it is clear that there are too many big drop days for the true distribution to be lognormal. People try to patch lognormal distributions, but the results are mixed. We use the lognormal distribution because it's the best one that we've got realizing that we're not going to catch those big drop days in the models. If a new distribution has a reasonable physical model underlying it with parameters that are readily estimable, you can bet that we'll use it. But we just can't tell from *this* article what exactly they've found here.

    --
    I have discovered a truly marvelous sig, unfortunately the sig limit is too small to contain i
  150. Never heard of the Gaussian distribution by Zoinks · · Score: 1
    Is it based on RedHat? Does it come with KDE?

    ;-)

  151. Sceptic in Slashdotia by Enoch+Root · · Score: 4
    Alright. I don't buy it.

    The problem here is how you define and measure a rare occurence. Let me give you an example.

    Let's say one night you watch the results of the lottery on TV, and the numbers '1-2-3-4-5-6' come up. Is that a rare occurence? No. That sequence is as likely to occur than your birthday and your girlfriend's birthday combined into esoteric equations.

    Example number 2: I'm with this girl one night. I say my astrological sign is Scorpio. "Really!" she exclaims, "I'm Scorpio too!" What are the probabilities of that happening? 1/144? No, just 1/12. At one point (and cryptos will be familiar with this) if you add people, it becomes a rare event that you do not find people with the same sign.

    All that graph is showing me is that the guys (I'm hesitating to call them scientists - I mean, they published in "serious papers"? Come on. Names, please) looked purposefully for freak occurences, discarding other "rare" occurences that were perfectly normal. That's why the left side of the graph is wider.

    Thing is, the Gaussian curve doesn't come out of nowhere; it's not arbitrary. For instance, in statistical mechanics and quantum mechanics, you get bell curve distributions precisely because of the distribution of particle states.

    All these guys are saying is, "rare events are not as rare as we think they are". That's not because the bell curve is wrong, it's because we seem to forget how huge the Earth provides a sample.

    What are the odds of being struck by lightning twice? One in a billion? We're 6 billion on this Earth. It's bound to happen to someone. Same thing with winning the grand prize lottery once or twice.

    And, again, same thing with floods or tornadoes. Yes, in themselves they're rare. When taken alone they seem improbable. But on the scale of the planet, that's the kind of thing that happens.

    Alright, anyone got another article on cold fusion lying around?

    "There is no surer way to ruin a good discussion than to contaminate it with the facts."

  152. But what is it? by Anonymous Coward · · Score: 0

    What a frustrating article. There are no technical details, and no links. Does anyone know what this wonderful, new, universal curve actually is?

  153. References by Anonymous Coward · · Score: 0

    Curiously, the article seems devoid of references to any articles on this new curve -- any one out there know the journals/articles? Any hard math posted online somewhere?

  154. BBell Curve with a skew. by Anonymous Coward · · Score: 0

    Yeah, that explains one side; but specifically (I think) all they are saying is that the bell curve is skewedin some cases. BFD, this is not a major breakthrough based on my reading of the article. Perhaps the author is not explaining a real breakthrough well or a silly reasearcher has deluded himself into thinking he's found something new.