Why the Cloud Cannot Obscure the Scientific Method

FYI by imstanny · 2008-06-26 00:45 · Score: 0, Offtopic

Link error in story.

Re:FYI by NoTheory · 2008-06-26 00:51 · Score: 1

The original story made me cringe. I would have imagined that it would have made Norvig do the same. Hope the rebuttal hits the right points.

Here's the link:

http://arstechnica.com/news.ars/post/20080625-why-the-cloud-cannot-obscure-the-scientific-method.html

--
There are lives at stake here!
Re:FYI by Sethus · 2008-06-26 00:52 · Score: 2, Funny

The author's head is completely up in the clouds...

--
Posting with out proof reading since 2001.
Re:FYI by Wolfbone · 2008-06-26 01:22 · Score: 1

The original story made me cringe Hehe! Yes indeed - it reminded me of this: http://www.youtube.com/watch?v=C0c5yClip4o
Re:FYI by Thiez · 2008-06-26 02:32 · Score: 1

That video is HORRIBLE. That thing will give my nightmares for days.
Please someone put that woman out of her misery.
Re:FYI by mckorr · 2008-06-26 02:47 · Score: 1

A shame we can't drop that idiot on a neutron star, then let her tell us that mass isn't important...
Re:FYI by CaptDeuce · 2008-06-26 02:57 · Score: 1

The author's head is completely up in the clouds...

Rather than a meteorological reference as to the location of his head, may I make a suggestion that is biological -- or more specifically, anatomical.

--
"Where's my other sock?" - A. Einstein
Re:FYI by Anonymous Coward · 2008-06-26 03:29 · Score: 1, Insightful

The worst part would be that we can 'leave out mass' in the E=mc^2 formula because the total amount of mass in the universe is so tiny (it isn't?).
So let's assume mass = 0 for all things (even though that makes no sense at all, she thinks it does). That means E = 0*c^2 = 0 -> there is no energy. Since she claims homeopathy works by changing you energy, AND that people have no energy (because they have no mass (?!) and Einstein's formula E=mc^2 applies), homeopathy cannot work.
Re:FYI by Anonymous Coward · 2008-06-26 08:49 · Score: 0

Thank you, the video was very informative.

datasource != process by Bandman · 2008-06-26 00:45 · Score: 5, Insightful

Because a datasource isn't a process?

--
Check out my sysadmin blog!

Re:datasource != process by Anonymous Coward · 2008-06-26 01:24 · Score: 0

Additionally ice cream != Paris Hilton's cleavage.
(Though, I'd like to lick that instead..)
Re:datasource != process by jeiler · 2008-06-26 01:40 · Score: 1

Paris Hilton has cleavage? A couple of bandaids and some Clearasil would fix that!

--
If you haven't been down-modded lately, you aren't trying.
Sacred cows make the best hamburger.
Re:datasource != process by bitocul · 2008-06-26 09:33 · Score: 1

babe that's so funny, hehe... i think you're missing an important point. That's why you might want to reconsider your argumentation and give more insight into your thinking process.

Where's the link? by Anonymous Coward · 2008-06-26 00:46 · Score: 0

Where's the link to the Ars Technica story?

Re:Where's the link? by Breakfast+Cereal · 2008-06-26 00:50 · Score: 1

The Cloud was supposed to take care of details like that, but
Re:Where's the link? by aproposofwhat · 2008-06-26 00:52 · Score: 1

My bad - should have checked the links better :o(

--
One swallow does not a fellatrix make
Re:Where's the link? by sveard · 2008-06-26 02:01 · Score: 1

Nah that's why we have editors
oh wait... I guess not

Link by Anonymous Coward · 2008-06-26 00:49 · Score: 0

http://arstechnica.com/news.ars/post/20080625-why-the-cloud-cannot-obscure-the-scientific-method.html

missing link by lhorn · 2008-06-26 00:51 · Score: 4, Insightful

http://arstechnica.com/news.ars/post/20080625-why-the-cloud-cannot-obscure-the-scientific-method.html
I like the fact that the web and search/aggregate engines may combine vast amounts of data in ways we now
cannot imagine - it expands the field for new scientific research enormously. Replace science? No.

--
accept no limits but time

Re:missing link by kalirion · 2008-06-26 02:02 · Score: 3, Funny

What, you mean I can't just google for "unified field theory" and get the right answer? Why does the universe have to be so hard?????
Re:missing link by ScrewMaster · 2008-06-28 03:16 · Score: 1

What I can't figure out is why understanding science is so hard for some people.

--
The higher the technology, the sharper that two-edged sword.

Why the Cloud CAN Obscure the Scientific Method by sm62704 · 2008-06-26 00:55 · Score: 1

Crack cocaine makes you stupid.

Oh, you were talking about the "information cloud" the crackheads at Wired always talk about. Never mind.

--
mcgrew's razor: Never attribute to stupidity that which can be explained by greedy self-interest

Re:Why the Cloud CAN Obscure the Scientific Method by ceoyoyo · 2008-06-26 03:14 · Score: 1

I think I figured out why they use "the cloud." Obviously all the good patents for "... on the Internet" have been taken, so they're just making possible a new round of frivolous patents with the phrase "... in the cloud."

Bullshit bingo by Anonymous Coward · 2008-06-26 00:58 · Score: 5, Funny

Latest addition to bullshit bingo cards:

CLOUD

It's a good rebuttal by Hoplite3 · 2008-06-26 01:01 · Score: 5, Insightful

I'd say that the models are the science. They're how you explain your data. They provide evidence that the experiments make sense, and they guide you by making predictions you can test.

Moreover, SIMPLIFIED MODELS are good science. Understanding which details can be omitted without impacting the predictive ability of your model shows you know which effects are important and which aren't.

--
Use the Firehose to mod down Second Life stories!

More Google marketing by 12357bd · 2008-06-26 01:07 · Score: 0, Offtopic

another obvious history.

I am sorry Google, but your ad bussines model will be terminated by random page requests. It is alraedy happening, no 'pseudo' articles will help.

--
What's in a sig?

Data Deluge Since Davinci by Doc+Ruby · 2008-06-26 01:13 · Score: 0, Redundant

Leonardo Davinci is reputed to be the last person who "knew everything" that there was to know during their lifetime. Even that wasn't true. But the scientific method has been the key to both creating and coping with a "data deluge".

Science suffers when there's too little data: scientists then must generate more by observation, or do something else that isn't science (and doesn't work nearly as well). Too much data is only a problem if you're willing to settle for imprecise/inaccurate results. I'm sure there are a lot more lazy scientists than since Leonardo's time, just with the inflation of the scientist population, but that doesn't mean we should dumb down scientists who just want to own a computer that spits out answers to the data they put in.

--

--
make install -not war

Correlation is not causation by tist · 2008-06-26 01:14 · Score: 4, Informative

A large source of data that has a correlation does not somehow imply causation. Even if it works under some conditions (or even all conditions). The science happens when the causation is determined and then applied.

Re:Correlation is not causation by damburger · 2008-06-26 01:18 · Score: 1

Yup. Mathematicians gushing about clouds and implying they have made science obsolete need to have that branded on their butts then be sent back to the mathematics department. They've already done quite enough giving us string theory (look! its internally consistent! it sounds cool! ergo its real!)

--
If we can put a man on the moon, why can't we shoot people for Apollo-related non-sequiturs?
Re:Correlation is not causation by maxume · 2008-06-26 01:24 · Score: 1

Of course correlation implies causation. When things are correlated, it is often a good place to look for causation. That's exactly what "imply" means.
Correlation doesn't *prove* causation.
There is a difference.

--
Nerd rage is the funniest rage.
Re:Correlation is not causation by damburger · 2008-06-26 01:38 · Score: 3, Informative

Wrong - imply has a very specific meaning to mathematicians and scientists. 'A implies B' means that if A is true, B MUST be true also.

--
If we can put a man on the moon, why can't we shoot people for Apollo-related non-sequiturs?
Re:Correlation is not causation by Anonymous Coward · 2008-06-26 01:39 · Score: 0

'Imply' can mean either entail or suggest. In the statement 'correlation does not imply causation', it's used to mean entail. Correlation suggests causation, but does not entail causation.

Dictionaries are you friends.
Re:Correlation is not causation by OeLeWaPpErKe · 2008-06-26 01:51 · Score: 1

Actually there is a statistical concept "causation" as well.
So yes, correlation does not imply causation. The reverse is through, though, causation implies correlation. There is only one mathematical relation between "things that correlate" and "causes" that supports this outcome : intersection. All causes correlate.
So you only need another mathematical property of causation, take the intersection of the concepts and there you'll have a much more precise source for causation.
You could also simply take the temporal aspect in time series. Increases and decreases of solar output can be shown to correlate with temperatures a few months later (and the sun has entered a low-activity cycle, so temp is going to drop, no matter what the goracle (I want what he's smoking) says). This means that solar output causes temperature (doesn't mean it's the sole cause obviously, let's not get into it, it does explain over 98% of temp. variation though). If correlation occurs with a temporal shift, it is trivially simple to separate cause and effect.
There are other properties that imply causation as opposed to correlation : you can already see the concept in Bayesian theory. Every bayesian spam filter "knows" that an occurance of "viagra" causes spam.
The problem is much simpler : everybody (well for the moment esp. the UN and specifically the goracle) want to politicize science.
But the scientific attitude "we doubt everything" (that means that if the earth surface temperature rises to 5000 degrees after we double Co2 output, that the scientific response to "does co2 cause temp rises ?" remains "we doubt it"), is the very antithesis of policy. We don't know how the climate responds to co2. For the moment we don't know at all.
This is, to say the least, not what Obamatons want to hear.
Re:Correlation is not causation by NewbieProgrammerMan · 2008-06-26 01:52 · Score: 2, Informative

Hey, don't try to pin all that stuff on mathematicians: the original cloud-gushing author, Chris Anderson, says, "background is in science, starting with studying physics and doing research at Los Alamos."

--
[b.belong('us') for b in bases if b.owner() == 'you']
Re:Correlation is not causation by maxume · 2008-06-26 01:57 · Score: 2, Interesting

Fine. I'll try to restate my point using more specific language.
The fact that correlation does not imply causation isn't nearly as troublesome as the volume of "Remember folks correlation!=causation" would have us believe; lacking other evidence, it is a reasonable assumption to start with.

--
Nerd rage is the funniest rage.
Re:Correlation is not causation by damburger · 2008-06-26 01:59 · Score: 2, Interesting

But nobody said that here, so your whole point is a strawman. I think its safe to assume that nobody on /. thinks correlation!=causation because that would make all science impossible.

--
If we can put a man on the moon, why can't we shoot people for Apollo-related non-sequiturs?
Re:Correlation is not causation by eli+pabst · 2008-06-26 02:13 · Score: 3, Interesting

You're exactly right. In fact if anything, science has started moving *away* from the kind of purely computational and statistical correlations that you get through data mining. Granted they are extremely important for generating hypotheses, but journals are much less likely to accept a paper without some kind of experimental validation.

The large scale genetic association studies are a great example. There was a day that you could publish a paper solely describing a correlation between a variant in gene X and its association with disease Y. However, because of the way we do statistics in science, sooner or later you'll find a statistically significant correlation simply due to chance alone. In fact the epidemiologist John ioannidis wrote an article about this (that I believe appeared on Slashdot as well). Now you're often required to show some kind of experimental validation that there is a biological basis that verifies the statistical correlation. The scientific method is not going away anytime soon.
Re:Correlation is not causation by maxume · 2008-06-26 02:18 · Score: 1

People say it all the time.

--
Nerd rage is the funniest rage.
Re:Correlation is not causation by damburger · 2008-06-26 02:21 · Score: 1

Whatever. You waded in saying
Of course correlation implies causation
and then when I pointed out the flaw in your argument you backtracked. Nobody said correlation!=causation, now saying
People say it all the time.
just makes you sound like you won't admit you are wrong.

--
If we can put a man on the moon, why can't we shoot people for Apollo-related non-sequiturs?
Re:Correlation is not causation by zeromorph · 2008-06-26 02:29 · Score: 1

... and correct if you ask some logicians and linguists. For us, imply means "something meant although not said but (through different mechanisms) conveyed" and entail means "if A is true, B MUST be true also".

--
"Hannibal's plans never work right. They just work." Amy/A-Team
Re:Correlation is not causation by maxume · 2008-06-26 02:30 · Score: 1

I admitted that I wasn't using precise language. As the AC that also replied to me pointed out, imply does happen to mean suggest in normal usage.
All the time was apparently an overstatement, but look at the tone surrounding that exact phrase:
http://www.google.com/search?hl=en&q=%22correlation!%3Dcausation%22+site%3Aslashdot.org
and the words:
http://www.google.com/search?hl=en&q=correlation+causation+site%3Aslashdot.org

--
Nerd rage is the funniest rage.
Re:Correlation is not causation by nodrogluap · 2008-06-26 02:39 · Score: 1
The correlation != causation tag is usually applied because either:
1. There are obvious confounding factors the article fails to mention
2. There's a good chance the direction of the arrow of causation is incorrect. e.g. just because fireman tend to be where you see big fires, doesn't mean they cause them. Or perhaps less obviously, aluminum doesn't cause Alzheimer's, it builds up in the brain as a consequence of Alzheimer's. Statistical inferences are only as good as the data available to you, and you need theories to drive the data collection...that's where the original article's logic fails.
A valid point the article could have made in the biological sciences is that we are returning (including my research group) to an observation-driven approach rather than a theory driven approach to initial experimental design. What do I mean? For probably the last 50 years, you needed to have a specific target (e.g. a given protein, mRNA, etc.) in order to test for its presence or concentration, etc.. So you came up with a theory for the condition of interest, and tested for the target it implied. With new techniques, you can test an experimental condition for many thousands of targets at once, allowing you to build theories from the observations then you design more experiments to confirm the theories you developed. Science is not dead, it just has a better leg up now.
Re:Correlation is not causation by maxume · 2008-06-26 02:43 · Score: 1

The article makes the mistake of assuming that new methods that can be used when you have bigger piles of information will make the old methods less powerful. As you say, it is often the case that they can be used together, resulting in faster/better/cheaper results.

--
Nerd rage is the funniest rage.
Re:Correlation is not causation by zacronos · 2008-06-26 02:46 · Score: 1

If correlation occurs with a temporal shift, it is trivially simple to separate cause and effect.

I have to disagree with that -- it's kinda correct, but I think it oversimplifies and misses some situations. (Note that I'm talking about the general case, not your solar output example in particular.)

As one example, imagine someone without an understanding of the physics of weather discovered that, at least 10 minutes prior to the arrival of any major thunderstorm, all birds in a particular forest stopped chirping and sought shelter. And in fact, every observed time that the birds stopped chirping and sought shelter, a major thunderstorm occurred. A naive application of your statement implies that the storm could not have caused the birds to seek shelter, since they happened in the wrong order. In fact, might it be possible that birds are the cause of thunderstorms? Perhaps the immediate cessation of all flying by the birds in the forest somehow triggers the thunderstorm by changing the flow of air? This is the sort of mistake the ancients would make -- assuming that because the observed phenomena happen in a certain order, the earlier observed event is the cause. The problem here, of course, is that the pressure drop preceding a major thunderstorm happens before the birds seek shelter, but if that isn't observed, the order seems backwards.

Another place this breaks down is where 2 events are correlated, but neither causes the other; instead, it is possible both have a common cause. Imagine this (very contrived) example: every time Bob presses a certain button, a bell rings in Rover the dog's doghouse. Rover has been trained to go fetch the newspaper and bring it to you whenever that bell rings. However, whenever Bob presses his button, 5 minutes later there is another effect -- a light comes on in your room. If it takes Rover no more than 2-3 minutes to bring you the newspaper, you will observe that, 2-3 minutes after Rover brings you the newspaper every morning, a light comes on in your room. Does Rover bringing you the newspaper trigger the light? No, not at all.

Correlation plus temporal shift does not equal causation.
Re:Correlation is not causation by 99BottlesOfBeerInMyF · 2008-06-26 03:08 · Score: 2, Insightful

In science, the phrase usually used is "correlation does not imply a specific causation." It does, of course, imply some correlation and most of modern science is noticing correlations and testing for causation.
Re:Correlation is not causation by Anonymous Coward · 2008-06-26 03:09 · Score: 0

Better to say "Causation implies correlation". That is, correlation is necessary to causation, but not sufficient for it.
Re:Correlation is not causation by OeLeWaPpErKe · 2008-06-26 03:10 · Score: 1

Yes but those birds and the thunderstorm do have a very important connection :
these events SHARE CAUSES. This is true for your second example as well. They would never satisfy the second part of the causation demand : A correlates with B (with a timeshift) but B never decorrelates with A (with or without a timeshift).
In otherwords : it is a specific type of deviation in correlation that implies causation in statistical data.
Re:Correlation is not causation by mckorr · 2008-06-26 03:14 · Score: 2, Interesting

I'm a mathematician, and I have never heard a colleague make the claim that science is obsolete.
Mathematics is the language of science, and there has never been an advancement in either one without an accompanying advance in the other.
A mathematician might "gush" about clouds of data, and work on the mathematics of it, but if he insisted it made science obsolete he'd be tossed out on his ear.
Oh, and string theory? That was the physicists. The mathematicians were pissed off that someone found a use for topology, which we considered pure mathematics for its own sake and unconnected to the real world. Damned physicists ruined our fun.
Re:Correlation is not causation by mckorr · 2008-06-26 03:26 · Score: 1

Correlation does imply causation, but it does not prove it. It is possible for items of data to correlate, but have unrelated causes. There really are coincidences.
Let's see if I can put this in symbolic logic for you. Two data sets, A and B :
A --> C if A then C
B --> C if B then C
Therefore A --> B error
This is a logical error. Both A and B correlate to C, but insisting that that means that A and B correlate to each other does not pass rigor. Science is needed to prove that the two data sets have the same cause.
Re:Correlation is not causation by mckorr · 2008-06-26 03:30 · Score: 1

And that is exactly what you want. Data mining can show a correlation between genes X and Y, but that doesn't tell you how to fix it. For that you need the scientific method.
Re:Correlation is not causation by maxume · 2008-06-26 03:32 · Score: 1

Dude:
http://slashdot.org/comments.pl?sid=596003&cid=23947925

--
Nerd rage is the funniest rage.
Re:Correlation is not causation by ceoyoyo · 2008-06-26 03:49 · Score: 1

Yes, it does imply causation. Just not necessarily the obvious one. The correlation != causation meme is technically accurate, but the writer of the previous article, as do so many people here, managed to screw it up completely by assuming that a correlation between two associated factors that is not a causal relationship between those factors is coincidence. It isn't. For a sufficiently strong correlation it implies a causal relationship between those two factors and a third factor.
Re:Correlation is not causation by paratiritis · 2008-06-26 06:23 · Score: 1

If correlation occurs with a temporal shift, it is trivially simple to separate cause and effect. Clocks A and B strike every hour.
Clock A strikes the hour. 10 seconds later clock B does.
1 hour later: Clock A chimes the hour. 10 seconds later clock B does.
2 hours later: ....
Does clock A cause clock B to strike? We have correlation and temporal shift so obviously yes. Or not?
Re:Correlation is not causation by RadioElectric · 2008-06-26 06:42 · Score: 1

"Or perhaps less obviously, aluminum doesn't cause Alzheimer's, it builds up in the brain as a consequence of Alzheimer's."

Do you have a source for this?
Re:Correlation is not causation by zacronos · 2008-06-26 07:03 · Score: 1

Yes but those birds and the thunderstorm do have a very important connection : these events SHARE CAUSES. This is true for your second example as well.

Yes, exactly, that's what I was getting at.

They would never satisfy the second part of the causation demand : A correlates with B (with a timeshift) but B never decorrelates with A (with or without a timeshift).

You said "If correlation occurs with a temporal shift, it is trivially simple to separate cause and effect." If you were implying additional constraints, that was lost on me. However, that still doesn't refute my examples. (BTW, "decorrelate" doesn't seem to be a word I can find in an online dictionary. I'll assume "B never decorrelates with A" means "B never occurs without A".) Unfortunately, if there is an event that has multiple causes, then you can have the effect without the cause. Similarly, if there is an event with multiple observed effects (only one of which will occur for a given instance of the cause, based on other, possibly unobserved circumstances), then you can have the cause without the single effect we're investigating.

I'm not saying you don't understand all these things, I'm just pointing out that you have grossly oversimplified the situation. "Proving" causation through merely statistical techniques is much trickier than you make it sound, especially when you can't do tests of your own to control other variables -- which brings us back to the topic of the discussion: you still need the scientific method in order to gather the data necessary to draw proper conclusions, you can't just look at a data cloud and use whatever it gives you unless that's your only option.
Re:Correlation is not causation by nodrogluap · 2008-06-26 09:09 · Score: 1

There is ongoing debate within the Alzheimer's community about the role of aluminum, even 28 years after the correlation was found. This is yet another example of why the "cloud" argument of the original article is bunk. Aluminum is probably no worse an aggravator of amyloid-beta plaque formation than other common metals such as iron:

The impact of aluminum on AD via oxidative stress
may be the same at that seen in iron or copper intake,
or any other oxidative stressors.*
*Takashima, A. (2007). "Does Aluminum Contribute to Alzheimer Disease Directly, Indirectly, or At All?" Journal of Alzheimer's Disease 11(4):431-432.

Interestingly, AD [Alzheimer's Disease] and Down's syndrome are linked in that in both conditions
the gastrointestinal absorption of Al is significantly increased (Moore et al.,
2000; Moore et al, 1997). Whether the increased production of Ap in both
of these diseases is as a consequence of a higher body burden of Al or vice
versa is not known.^
^Exley, C. (2005) "The aluminium-amyloid cascade hypothesis and Alzheimer's disease" Subcellular Biochemistry 38:225-234.
There is some evidence that aluminum prevents the breakdown of plaques, etc., etc. so it may be that there is a positive feedback loop. In any case, the arrow of causation is not always the "obvious" choice, or easily discernible (even after 28 years!).
Re:Correlation is not causation by RadioElectric · 2008-06-26 09:15 · Score: 1

I'm familiar with the research you've presented here; Dr Exley actually happens to be my research supervisor. I just felt that you'd presented quite a controversial area in a bit of a glib fashion.

All models are wrong, but .... by gopla · 2008-06-26 01:16 · Score: 4, Insightful

All models are wrong, but some are useful.

We still need scientific methods to develop useful models and understand and refine the existing models. When Newton defined his mechanics that was the state of the art in his era, and now we have progressed to quantum mechanics which might be refined tomorrow.

But mere observation of some phenomena is not sufficient to postulate the behaviour in a changed condition. A scientific model and its rigorous application is required for this. Correlations drawn from the cloud cannot substitute it.

gopla

Re:All models are wrong, but .... by 99BottlesOfBeerInMyF · 2008-06-26 03:10 · Score: 4, Insightful

All models are wrong, but some are useful.
All models are wrong, to some degree. A better way to put it is all models are imprecise, but some are precise enough to be useful. 'Wrong' is a very flexible word and can easily lead to a misunderstanding in this context.
Re:All models are wrong, but .... by PCM2 · 2008-06-26 08:52 · Score: 1

all models are imprecise I'm not sure it's even helpful to state it this way. It starts to sound like what you're saying is that a model is not a precise representation of real life, but rather is a simplified representation designed to make it easier to extract pertinent data. Mind you, I'm not trying to put words into your mouth or anything...

--
Breakfast served all day!
Re:All models are wrong, but .... by Anonymous Coward · 2008-06-26 10:21 · Score: 0

Uh... he was quoting George Box, I don't think you want to get in the business of correcting George Box on matters related to statistics.
Re:All models are wrong, but .... by Shadowlore · 2008-06-26 10:59 · Score: 1

"All models are wrong, to some degree." == All models are wrong. Either they are wrong or they are not wrong.
Precision is not the implication, correctness is. A model is a model because it is incorrect in some way - it is an approximation. "only a little wrong" wrong does not make it not wrong.
Buffalo buffalo buffalo.
The reason the distinction of all models being wrong is important is to limit people believing the model is the real world. Far, far too many "scientists" these days do all of their work in models and believe they are doing real world stuff. Then the real world occasionally slaps them upside the head with the fact that they were wrong. Unfortunately many vocal ones seem to not then realize the model is wrong and is no substituted for the real world.
Not unlike many among the /. community with regards to their models of women.

--
My Suburban burns less gasoline than your Prius.

Don't blame the author's incompetence by ruin20 · 2008-06-26 01:19 · Score: 2, Interesting

The point of the last story was horribly miscommunicated. There were two main points. The first is that data is expanding in such scope that hierarchal organization systems don't work and that the second is we're approaching a time where the method or analysis of data to show causation will come from correlation, because you can determine all the variances due to the fact that all the variables have been accounted for. Look at the human genome project or folding at home. I don't think this is completely true, but lets not bash the idea or miss the point just cause the original author's a complete bumbling moron.

--
Oh honey look... How cute... an angry slashdotter!

Re:Don't blame the author's incompetence by phobos13013 · 2008-06-26 01:39 · Score: 3, Insightful

You seem to be missing a fundamental flaw in the argument. No matter how many parameters you account for a) you can never account for ALL parameters of this system we call life (if for no other reason, there may well be some we dont know about yet!), and b) most importantly, even if you DO have all the parameters and the results show a correlation, there is no logical jump one can make that says it is the cause of the observed behavior.

Truly what yesterday's article was saying is that causation or correlation is meaningless if you have a mimic of the real world in the form of a collection of data. You don't need a model that is accurate or valid or anything. You just need to run the data in the exact replica of reality. This is the simulacrum. The first problem is that data does not just run itself. At the least it needs an algorithm to be processed to a result. Thats the model, without its just useless data, which has been mentioned already yesterday in comments. But second, the problem with even ATTEMPTING such an idea is that you lead yourself into a situation where you "predict" the future and then operate to become that future thus destroying the creative nature of humanity and become the self-fulling prophecy of machine code!

Keep in mind i speak mostly of social sciences that try to pattern human behavior. For hard sciences, etc., all you have done is created a simulation of reality, but it tells you nothing about the reality. It merely mimics it. There is no insight into creating a map the size of the United States, at best it is a work of art.

--
...and it should be known by now
Re:Don't blame the author's incompetence by oh_my_080980980 · 2008-06-26 01:41 · Score: 0, Flamebait

"but lets not bash the idea or miss the point just cause the original author's a complete bumbling moron."

No, but you are if you think that was the point of the article. First, nowhere did the author speak of "hierarchal organization systems." In fact what are "hierarchal organization systems?" Are speaking of XML? Object Oriented databases? But regardless that was not his point.

Second, "...analysis of data to show causation will come from correlation," is gibberish. It means nothing. It underscores you're profound lack of knowledge of how scientific experiments are conducted, how data from experiments are collected and analyzed. In fact you are very much like the Wired author who speaks without knowing.

You might want to read books on Statistics.
Re:Don't blame the author's incompetence by ruin20 · 2008-06-30 04:17 · Score: 1

Thank you for the dissertation (although I agree, it's such a common stance on slashdot, and supported by the submitted article, I felt the need to push the contrary position)
Despite the fact that I agree with you, I do believe the author makes a point. I'm a design engineer and I look at this similarly to the sensitivity matrices that we make when dealing with the design space.

In other words, the typical discovery cycle is to observe, hypothesize, challenge, validate. Now, the idea in the article is that with current data collection techniques, the rate of testing, when we observe a correlation, we use the simpler hypothesis of A->B and validate that. We don't have to wait for why, as large numbers of tests can show this to a high degree of statistical certainty. If we have a sufficiently discrete set of A's and matching B's we can form a regression that can be used to interpolate between the gaps.

Essentially the purpose of a model is to relate inputs to outputs. What makes a model good is when it's outputs for a given set of inputs matches closely the outputs for a given system. That is the measure of success of a model. And a model working does not mean it is correct, it simply means that it works. The success of a model does not prove that there is no other model that might also work.

Essentially what's being said here is that due to the large amount of experimentation being preformed we can back out from the data a matrix that relates each input to all the outputs proportionally. The matrix might adjust with the values of the inputs but still, we can derive a sensitivity to each input based on experimentation. The matrix is the model, and there is no theory involved in it's creation other than all the state assumed is completely documented.

This is already a common practice, and can be seen in things like, for example, state tables for compressed, superheated steam. There's a real gas law model that should predict the nature of steam, but no one uses it, as the tables have proven more accurate over time. And as a designer I don't really care why the steam does what it does, I just care that the tables actually match what happens in reality. they're a good mimic, and thats the point of the model.

Again, this assumes that we can test all the configurations or at least quantify and document them. Despite this being a false assumption for many systems, it is true for many others, and is especially useful when dealing within sets of knowns for design issues and optimization. I don't necessarily think it makes good science, but it does however yield valuable goods for engineering and design in the absence of theory and should not be summarily dismissed. Although I believe there is added value in forming a theoretical model based on assumptions of how the inner workings of a system function, I don't discard the value of not needing such a model in order to have a clear understanding of what outputs will occur for a given set of inputs.

--
Oh honey look... How cute... an angry slashdotter!

Nice rebuttal, bad example. by Angostura · 2008-06-26 01:20 · Score: 4, Informative

In general I'm right behind the rebuttal. However John Timmer chooses a very bad real-life example as his rebuttal champion.

He asks: ...would Anderson be willing to help test a drug that was based on a poorly understood correlation pulled out of a datamine? These days, we like our drugs to have known targets and mechanisms of action and, to get there, we need standard science.

These days we may like our drugs to have these attributes, but very often they don't. There are still quite a few medicines around that clearly work and are prescribed on that basis, but for which there is only the haziest evidence as to how exactly they work.

The good thing about the scientific method, however is it gives us a framework to investigate these drug's actions - even if the explanation is still currently beyond us.

Re:Nice rebuttal, bad example. by DrJay · 2008-06-26 02:55 · Score: 1

Well, what i was trying to say is that no drug company pursues anything without knowing the molecules it targets, the role they play in the cell, etc. It's doubtful that the FDA would approve the testing of a drug if all the company came up with is "we dump it on cells, and it does X, but we have no idea why."
You're absolutely correct that this sort of knowledge isn't often that deep - we know what serotonin reuptake inhibitors do on the biochemical level, but what that means for the brain is pretty hazy. But there's still a large gap between this sort of shallow knowledge and the "well, it came out of a datamining session" level of understanding.

--
______ This mind intentionally left blank.
Re:Nice rebuttal, bad example. by Anonymous Coward · 2008-06-26 03:04 · Score: 0

My thoughts exactly, he has no idea what a screening data set looks like (often 10^7 - 10^8 data points in size). However, new medicines usually have at least some model implicit in the design of high throughput assays which are used to find leads. For example, the model that a derived cancer cell line can tell us anything about actual cancer, vague but still a model at some level. About half the time the model was wrong and something different and interesting is happening, but there was a model in place at somepoint (otherwise they wouldn't have looked for that particular trait). In fact often these surprising results start someone looking for a mechanism which has and continues to launch peoples carriers. Also, the FDA tends to approve drugs only when you have some clue how they work, not necessarily a predictive model, but it interacts with BLAH BLAH BLAH in XYZ tissues sort of level.
A related way to state it, don't confuse medicine with the science of medicine, VERY different.
The other thing at least partially missed by both articles is that real science can be done pretty differently with the cloud of data out there. I can go from idea, to model, to prediction, to rough test using existing data VERY quickly (1-4 days), then refine the model, improve predictions and design a good experiment which specifically tests the idea with a comparatively small data set. The cloud isn't the science, but it leads to the science. What we really need is a good database of published results that can parse the information like a trained scientist (in particular one that has the ability to detect incorrect conclusions and possible other explanations not written into the text). When that happens maybe the cloud will replace science and we'll all become librarians, but until then actual scientists will have to do the work.
-sk

Marketing is not a Science by phobos13013 · 2008-06-26 01:20 · Score: 4, Insightful

Truly, the whole reason someone like Mr. Anderson could claim the end of science because of data is that he is a writer, a thinker, and large part businessman. Businessmen do not think about Science and how to use it to come with a method that produces a conclusion. He uses information to come up with ways to illicit a reaction in people. So to him data is more important than science because he uses it for his purposes. That is marketing, and the "science" of marketing has almost always been that way.

Mr. Anderson was not prescient in any way, he was just speaking his perspective. The only thing is we must be careful to even consider his proposition as a valid reality worth pursuing. Not for true scientists, but from a social perspective, or it will truly be the end of science. There are some in power as it is already attempting to make this happen.

That said, I almost consider responding to yesterday's article as falling for the argument. But, since it hit the /. this article is as cogent a rebuttal as one can make.

--
...and it should be known by now

Re:Marketing is not a Science by Red+Flayer · 2008-06-26 02:20 · Score: 1

to come up with ways to illicit a reaction in people
elicit == v. evoke; illicit == adj. illegal

BTW, it seemed obvious to me that he equated data discovery with scientific discovery, which is a big mistake. Adding to the sum of human knowledge is not the same as adding to the sum of human understanding, and using datamining and other automated tools for correlation determination does not in any way increase understanding.

Data discovery is about increasing knowledge. Scientific discovery is about increasing understanding.

--
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
Re:Marketing is not a Science by ceoyoyo · 2008-06-26 04:00 · Score: 1

Even from a social perspective I don't think his argument holds water. It's akin to the origin of superstition: when I make a sacrifice to the rain gods, in my experience it tends to rain. Therefore, I should believe in the rain gods.
His central example, Google, doesn't actually support his argument. Google uses an implicit model (which they carefully protect) to rank the likely relevance of search results. Then they give you a giant pile, in order of ranking, and let you sort through it. So not only does Google use a very sophisticated model but they let the searcher, and whatever model is implicit in the searcher, perform the final selection.
Re:Marketing is not a Science by Anonymous Coward · 2008-06-26 04:37 · Score: 0

One point perhaps worth bringing up is that Anderson has a degree in physics and served as an editor on the preeminent science journal Nature and Science, so his perspective may not be as skewed as most journalists. That being said, I agree with your points.

I'm moonlighting in bioinformatics by damburger · 2008-06-26 01:24 · Score: 5, Interesting

And can back up this rebuttal with a practical example. I am a physicist, I know sod all about blood samples, or proteins, or cancer. I get a pile of mass spec data (about a billion data points or so on some days) and through binning, background subtraction, and a string of other statistical witchcraft I produce a set of peaks labeled according to intensity and significance.

This does not make me a cancer researcher. This data has to go back to the cancer guys and they have to pick out the Biomarkers and thus develop new diagnostic tests, based on principles that I don't understand. I am master of the information but entirely blind as far as the science is concerned. Same goes for google.

--
If we can put a man on the moon, why can't we shoot people for Apollo-related non-sequiturs?

Re:I'm moonlighting in bioinformatics by ceoyoyo · 2008-06-26 04:05 · Score: 1

I'm a computer scientist who was morphed over the last six years into a biomedical researcher. As a computer scientist I can do all kinds of things to an image, including a bunch of statistical magic to tease out any patterns in the database. As a biomedical researcher I know that many of those associations are going to be due to the way the image was collected, or otherwise irrelevant features of the patient. Some may even be introduced by my processing and statistical methods.
Re:I'm moonlighting in bioinformatics by damn_registrars · 2008-06-26 05:38 · Score: 1

Often you are exactly the type of help that we so badly need in bioinformatics and proteomics. You know how to deal with the data in a non-biased manner.

As a biochemist myself, I know that it is far to easy to approach a data set knowing what a given m/z corresponds to, and then chose the data grooming strategy that most favors that peak. And being as we don't really have truly "standard" algorithms for approaching proteomics mass spec data, we need people who know the fundamentals of the techniques well enough to keep us researchers from falling into our own pits.

--
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.

Duh! by es330td · 2008-06-26 01:26 · Score: 5, Insightful

When I read the original article my thought was that someone was just trying to write something to get noticed. The Scientific method, IMHO, is all about a person or group of persons using a logical process to determine the vailidity of an idea. Observing massive amounts of data can reveal relationships that may not have been noticed in other ways, but at the end of the day the process of "I think X, I wonder if it is true", the heart of the scientific method, can no sooner become obsolete than we can stop being human. The questions of What, Why and How are so fundamental to humans as humans that nothing short of total omniscience will ever replace the logical process represented by the scientific method.

Re:Duh! by 12357bd · 2008-06-26 04:49 · Score: 1

The questions of What, Why and How are so fundamental to humans as humans that nothing short of total omniscience will ever replace the logical process represented by the scientific method.

There's a lot of 'faith' in this statement.

1) Human > logical being
2) Logic > Science

So: I am sorry but to expect that science answers any, 'What' 'Why' or 'How' is just to expect too much. Science has his limits, probably some philosofy and empaty will also be needed.

--
What's in a sig?

exploratory experimentation by johnrpenner · 2008-06-26 01:35 · Score: 1

traditionally, science forms its hypothesis, and performs an experimentum crucis to test the hypothesis; rinse & repeat. it seems to me that 'the cloud' refers to a hitherto statistically huge number of samples of data points from which to extract our knowledge of the world -- a sort of broad collection of facts derived from constantly and systematically varying the experimental conditions -- an exploratory experimentation. goethe outlines a method of Exploratory Experimentation in the essay The experiment as mediator between subject and object.

"Theory-oriented and exploratory experimentation are not exclusive categories, but rather members of a spectrum of experimental research strategies. Which is more productive in a given context depends on many factors, including a field's state of development, the sort of knowledge (for example, underlying mechanisms versus phenomenal regularities) sought by the physicist, and the complexity of the system being studied. Our aim in emphasizing the exploratory path has been to bring to light an experimental style that has played an important, but hitherto underrecognized, role in the history of physics.

Physics Today Article

I agree, but... by wfolta · 2008-06-26 01:37 · Score: 3, Insightful

What you say is true, Hoplite3. The big issue I see is how people define "model". My guess is that quite a few unfortunately define it as "I got 3 asterisks in the significance test", whether the "model" (say, linear regression) makes sense or not.

I forget where I read it, but I've been studying linear regression, and there was a fascinating example were if they'd have used linear regression techniques on the early "drop the canonball and time it's fall" data, they would have come up with a nice, highly-significant linear regression for gravity.

Then there is the whole issue of explanation versus prediction. Something can be predictive while providing no explanation, and perhaps that's where the petabyte idea is going: who cares about explanation if prediction is accurate enough? (Not my philosophy, BTW.)

Re:I agree, but... by Hoplite3 · 2008-06-26 01:58 · Score: 4, Interesting

Yes, I think that prediction without explanation is fascinating, but I don't know if it's what I like about science :) Have you ever heard Lenard Smith speak? I saw him at SAMSI, but his MSRI talk is online and is roughly the same. He's a statistician who works in exactly this.
Some fancy-pants technique he has is better at predicting the future behavior of chaotic systems (like van der Pol circuits or the weather) than physical models. But he also points out that these predictions don't tell you what type of data to collect to make better predictions, and that they don't generalize. One nice "model" he has can predict the weather at Heathrow better than physical weather models (from the same inputs: wind speed, temperature, pressure, etc), but it's useless for predicting the weather in Kinshasa until the model is re-trained.
I think these types of data analysis tools will be very important in the future, but they won't replace the explanatory power of models. Just like how scientific computing is useful, but never replaced actual experiments.

--
Use the Firehose to mod down Second Life stories!
Re:I agree, but... by aurispector · 2008-06-26 02:01 · Score: 4, Insightful

Thank you. Sure, there's a ton of data out there, but how was it collected? What statistical methods were used to analyze the data? How did you select the data set you're analyzing? Nothing I understand about science really applies to data mining a so-called "cloud". Prediction without explanation is just observation. Observation in and of itself is not science. You might have data, but is it the right data?
I see all this petabyte stuff as interesting and even as a valuable adjunct to real science, but a basic requirement of science is reproducibility and you can't reproduce the data collection.

--
I have mod points. The reign of terror begins now.
Re:I agree, but... by mckorr · 2008-06-26 03:02 · Score: 1

Until they shot said cannonball out of a cannon and noticed that it doesn't follow a straight line. For that you need a quadratic regression.
Linear regression is good for making predictions given strong correlation between items in a data set, but the linear equation you get is a probability, not the solution to the actual data. To show this, plug in the values for any given data point and see if the equation produces the exact results.
Granted, at the quantum level we are dealing in probabilities, but for a satellite, which is traveling in a relativistic framework, we want an exact orbit, not one that will "probably" keep it moving around the planet. This goes for other areas of scientific endeavor. Our computers can run every regression we can think of till it picks one that fits the data set perfectly, but to really understand what that data means requires hard science, not more data. And that doesn't include making intuitive connections between apparently disparate data sets, which currently only a human can do.
Re:I agree, but... by ceoyoyo · 2008-06-26 03:46 · Score: 1

That's kind of a bad example. Galileo basically did just that: rolling marbles down inclined planes and looking for a simple relationship that fit the data. Correcting for the inclination of the plane, he found one.
I don't remember how far Galileo got in explaining what the various terms in the relationship were, but Newton certainly finished the job. Only when that experimental relationship was explained did we get the theory of gravity and kinematics.
Re:I agree, but... by mysticgoat · 2008-06-26 04:33 · Score: 1
The discussion needs to bring in some other terminology.
- Mapping: Google creates the largest and most detailed mappings of some subjects that we have ever seen. Further, it provides a number of map manipulation tools that are incredibly fast and easy to work with.
- Territory: The map is not the territory; what Google delivers is always suggestive of the way the world actually is, but should never be mistaken as reality. The data Google draws on is abstracted from reality, and there may be several metadata processes in between the Real World and what is provided to Google users. This becomes more of an issue as one approaches the frontiers of human endeavor. And of course science is often done on those very frontiers.
- Algorithm: In the context of this discussion, the highly specific, technical definition needs to be used, rather than the way the term is bandied about in casual conversations. See this definition, whose first paragraph should be sufficient for this thread while remaining accessible to all of slashdot's readership.
- Heuristic: Basically, any problem solving strategy that might provide an adequate solution to a class of problems. See this description, whose first section should be good enough for this discussion. All algorithms are heuristics, but not all heuristics are algorithmic. A heuristic may lead to a wrong answer and still be considered good, if the cost of working with a wrong answer is low compared to other benefits, like speed or ease of use.
For the most part, Google relies on non-algorithmic heuristics to generate its results.
The scientific method can be described as a set of algorithms designed to select among all possible hypotheses the few that seem to best model real world events. These models are in turn used to suggest new hypotheses that can be tested with the scientific method; it is iterative. The (possibly unreachable) goal is to eventually find connections that tie all the separate models into one universal supermodel; a strong secondary desire is to simplify each model as much as can be done while preserving its ability to predict real world events.
Note that implicit to the above is the core of the Copenhagen Convention: science is all about our intellectual models of reality, and is not about reality itself, which might or might not be humanly understandable. We stay within the scope of what we know we can comprehend, which comprises the models that our minds have built. Reality is separate from that: we test our models against reality, but reality is external to our modeling space.
In these terms, what Google presents us with is another view of reality that may or may not be distorted by the viewing process, but which is very easy to manipulate. However Google is not part of the scientific modeling process, and cannot replace those activities.
Google is however very good for engineering things, where the elegance of scientific models is often trumped by the pragmatics of Getting Things Done.
Re:I agree, but... by TapeCutter · 2008-06-26 14:23 · Score: 1

I agree with most of your post, however since google runs on a computer it MUST use an algroithim to search. Once the algorithim returns the hit list then it is up to the human to use heuristics to determine if the results are usefull or not.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
Re:I agree, but... by Shauni · 2008-06-27 02:14 · Score: 1

You might have data, but is it the right data?
Who cares? The idea is, if you have all the data (and I do mean all of it), and you've got a computer powerful enough to analyze all the statistical trends, you can identify the right data.
That's the theory, in any case. But the problem is, we can't collect all the data with the equipment we have now. In order to receive data at all, we need models that previous generations have established.
The conclusion is that some day, we will only be building conclusions based on statistical analysis, which are based on other conclusions based on analysis, and these will explain the universe, and science as we know it will be obsolete.
I can't say I like that idea. But I like truth too much to reject a reality in favor of my personal likes and dislikes.
Finally: At the end of the day, of course the scientists will say that science is still useful and the statisticians will still point to the usefulness of these new "blind predictions." Science will claim statistics as merely a "stepping stone" on the way to truth, and statisticians will claim scientists in the same way.
What's the real truth? Find it yourself, you lazy git, and find it whatever way you can.

Amen by wfolta · 2008-06-26 01:45 · Score: 1

You're right about the medicine example. It's odd that medicine has an incredibly rigorous statistical process before approval, yet many medicines are basically black boxes.

Look at statins (cholesterol medication), which are one of the most widely-prescribed medicines in the world -- and which I take. There's a legitimate question as to whether their main effect is to reduce cholesterol levels, or whether it's actually a specific kind of anti-inflammatory which happens to reduce cholesterol levels.

Or how about ulcers, which were chalked up to personality and stomach acid, and treated as such, until a "crank" pushed the medical community for decades and they finally realized that a bacteria was behind most of it. The medicines were (and are) effective, but no amount of modeling along those lines could find the actual, root cause of most ulcers.

(I also take medicine for stomach acid, and interestingly I am one of the 10% whose ulcer was not caused by bacteria.)

Rise of Engineering over Science? by starfire-1 · 2008-06-26 01:46 · Score: 5, Interesting

I have always viewed this debate in the context of scientist vs. engineer. That is one who views data as "good and true" vs. "good enough". That's not a slam on engineers (I am one), but a reflection of the balance between the two. A scientist that never applies theory sits in an empty room. An engineer who build things with out science, sits in a cluttered room surrounded by useless objects.

I do find interesting though that the advent of "google data" may indicate a flip in order of the two disciplines. Historically (IMHO) science has led engineering. A theoretical breakthrough, provable by the scientific method, may take years to give birth to a practical application. Now, with enormous piles of data and the knowledge that "good enough" is often good enough, we may be creating useful objects that will take science many years to explain and model.

The biggest issue and omission in both of these pieces is that this "cloud" of data does not represent "truth" (as the scientist may seek), but rather a summation or averaging of the "perception of truth" as seen by the individual authors. The cloud, therefore, is only as useful as human's ability to divine truth without the scientific method.

My two cents. :)

Re:Rise of Engineering over Science? by maxume · 2008-06-26 02:24 · Score: 3, Insightful

I have a theory that some of the best engineers are scientists, and some of the best scientists are engineers.
Scientists often need to build crazy stuff to figure things out, and engineers often need to figure things out to build crazy stuff. Because they are each result oriented, they don't get hung up on the things that someone in field would.

--
Nerd rage is the funniest rage.
Re:Rise of Engineering over Science? by ceoyoyo · 2008-06-26 04:10 · Score: 1

I think it's the other way. Engineering got a head start on science. When we pile up rocks just so, they tend to stay where we put them, even if you walk across them. Voila, a bridge. Science came along later and explained why those particular arrangements are stable. That explanation lets the engineer investigate other bridge designs that he might not have seen before.
There are perhaps a few areas in which the availability of massive amounts of data may let the engineer go back to his "I've seen it, therefore I'll try it" methodology, but I think in the vast majority of cases he'll wait until science figures out WHY it works. Engineering by trial and error is simply too expensive when you have a viable alternative.
Re:Rise of Engineering over Science? by mlwmohawk · 2008-06-26 05:05 · Score: 1

I seriously disagree with this opinion as you discount engineering as sort of inferior to the science.
Engineering absolutely requires a scientist. If you're an engineer and don't understand the theories and science you use professionally, you are a poor engineer. Typically speaking, a scientist furthers the scientific theories and an engineer applies them. Some times there is overlap where engineers do further the theories and scientists do apply them. Nowhere would I say that engineering is a profession of "good enough." In fact, understanding what "good enough" would actually require a great deal of knowledge.
Re:Rise of Engineering over Science? by Danny+Rathjens · 2008-06-26 08:22 · Score: 1

The biggest issue and omission in both of these pieces is that this "cloud" of data does not represent "truth" (as the scientist may seek), but rather a summation or averaging of the "perception of truth" as seen by the individual authors.

I didn't realize we were discussing wikipedia. :)

knowledge != understanding by mlwmohawk · 2008-06-26 01:54 · Score: 3, Insightful

I have a problem with the google generation, sure, they can parrot facts and find things in an instant, as can any slashdotter I'm sure, but knowing something is not the same thing as understanding something.

I coworker asked me yesterday "how do you call a C++ class member function from C [or java]?" The question is an example of pure ignorance.

If they "understood" computer science, as a profession, this would be a trivial question, like how do I or can I declare a C function in C++. The second question is what google can help you with while having to ask the first question means you are screwed and need to ask someone who understands what you do not. Not understanding what you do for a living is a problem.

How programs get linked, how environments function, virtual machines vs pure binaries, etc. These are important parts of computer science, just as much as algorithms and structures. You have to have a WORKING knowledge of things, i.e. an understanding.

Google's ease of discovery eliminates a lot of the understanding learned from research. Now we can get the information we want, easily, without actually understanding it. IMHO this is a very dangerous thing.

Re:knowledge != understanding by zeromorph · 2008-06-26 02:54 · Score: 1

Wow, one of the best postings I have read for months.
Although I wouldn't call it "very dangerous", you are so right about the difference between, what you call, knowing and understanding. Raw data and number crunching is only one step towards understanding. Interpretation of the data and in the end really grasping the problem and hopefully a solution are something different.
Theories may have gone wild in some sciences in the sense that theorizing is overvalued compared to data munching, but theories and models will remain integral part of any sane science.

--
"Hannibal's plans never work right. They just work." Amy/A-Team
Re:knowledge != understanding by ceoyoyo · 2008-06-26 04:12 · Score: 1

Mr. Miyagi?
Re:knowledge != understanding by kabocox · 2008-06-26 07:53 · Score: 1

Google's ease of discovery eliminates a lot of the understanding learned from research. Now we can get the information we want, easily, without actually understanding it. IMHO this is a very dangerous thing.
Yes, because people can learn instantly what ever answers and not actually get the accepted view point stamped into them at the same time. That's extremely dangerous. There is no telling what people will come up with if the don't have their government's, employer's, school's, church's, or parent's viewpoint stamped onto every bit of information that they learn. Google can get the exact answer only without many of those built in bias that those teaching are trying to stamp/mold into their students.
Here is a simple one. Cheating is wrong. Teachers and graders have always tried to cram that one into their students. The real answer is getting found out about your cheating is wrong, never get found out. With google, the internet, and out sourcing today, the modern student can out source their assignments and get reliable usable answers back. How many times have you gotten usable answers from your teachers?
They want to pretend like they know everything, but really they want you to figure it out yourself. Why should I figure it out? What if I just want an answer that I know will work for my problem set? I don't care about learning everything under the sun, I just care about the min tools need to get my tasks done. Heck, I don't even care about the tools, I just want/need the task done with the min amount of personal effort.
Take databases, I don't care about SQL, access, or mysql or any of that. I just want my user's data stored in a nice tidy database that I don't have to have a mental melt down to manage or for them to use. If I could commune with the Google DB Design AI and have it instantly make the database that I need, I and many others would use it rather than alternative methods.
Re:knowledge != understanding by Anonymous Coward · 2008-06-26 08:22 · Score: 0

how do you call a C++ class member function from C [or java]?" The question is an example of pure ignorance. How is this an example of pure ignorance? Sure, your co-worker could have googled a bit and found out about the Java Native Interface but sometimes it's just quicker to ask someone than read 10 pages of documentation on inlining native code into Java. I guess calling a C++ function from C makes little sense, but it's certainly possible. You're pretending this isn't all just assembly flying through the CPU and C++, C, and Java are all "different" somehow. It's complicated to mix languages, but sometimes necessary.
Re:knowledge != understanding by mlwmohawk · 2008-06-26 08:55 · Score: 1

how do you call a C++ class member function from C [or java]?" The question is an example of pure ignorance. How is this an example of pure ignorance? Sure, your co-worker could have googled a bit and found out about the Java Native Interface but sometimes it's just quicker to ask someone than read 10 pages of documentation on inlining native code into Java. I guess calling a C++ function from C makes little sense, but it's certainly possible. You're pretending this isn't all just assembly flying through the CPU and C++, C, and Java are all "different" somehow. It's complicated to mix languages, but sometimes necessary. You are sort of making my point for me. Knowing these things is part of the game.
Re:knowledge != understanding by biobogonics · 2008-06-26 17:28 · Score: 1

Google's ease of discovery eliminates a lot of the understanding learned from research. Now we can get the information we want, easily, without actually understanding it. IMHO this is a very dangerous thing.
It's still GIGO - "Garbage in, Garbage out" - except now there is a LOT more garbage.
I recently read an entire book (Super Crunchers) whose substance was that regression analysis was the greatest data analysis tool since sliced bread. Nonsense.
Finding associations is relatively easy. Making sense of them is hard. In order for an association to have meaning there has to be a mechanism involved. Back when I studied epidemiology it was called "biological plausibility".
Otherwise you might think that there is a real relationship between the number of ordained ministers and root beer consumption.
Really this is driven by marketing. So I expect to see more root beer ads at the local theological seminary!

new adds to old, but doesnt end it by peter303 · 2008-06-26 02:08 · Score: 1

Petabyte technology suggests new avenues of scientific investigation, but doesnt end science or older alternative ways of doing things. The clever thing is to be first to discover the new possibilities.

we are merely neurons by sneakyimp · 2008-06-26 02:16 · Score: 1

I would agree that the scientific method is not dead, but I like this rebuttal. The scientific method as I understand it is
1) Observe
2) Form a hypothesis or create a model to explain some phenomenon
3) Experiment and gather empirical data to support or refute the hypothesis/model

We still do all that but the emphasis does seem to be shifting away from traditional models that are sweeping generalizations (e.g., "An atom has a nucleus of protons and neutrons surrounded by moving electrons") to more nuanced, numerous, highly specific, and esoteric observations which are cobbled together into a patchwork of quasi-models that collectively define a distributed understanding of the real underlying concept. No single person understands the big picture in its entirety and no single model dominates scientific disciplines. Nay! Controversy is rampant.

These quasi-models manifest themselves as scientific papers, correspondence between academics, and flame wars on web vBulletin or phpBB sites and in practice, people subscribe to them a la carte like they were ordering at McDonald's or something.They stitch together their own stylized scientific philosophy from a vast menu of options.

In my opinion, all these claims that "we scientists are still doing science and we do understand the universe" are actually kind of pathetic. To call your data on the propagation of a particular gene variant in D. melanogaster a 'model' is hubris. You are a technician, not a scientist. You are a cog in the machine. We are all just neurons in the collective brain.

Number one pet peeve with my doctor by bamwham · 2008-06-26 02:25 · Score: 2, Interesting

He makes statements about treatments, causes, and outcomes as if they were God given truths proven to the world beyond all doubt. In truth medicine seems to this mathematician as a field governed sooley by statistical correlation with next to no concern over (a) what is the actual cause is, (b) testing the hypothesized cause in any meaningful way. I've read study after study that goes through a wonderful presented statistical analysis to conclude that such and such drug works well at treating such and such symptom; they then close with a couple of paragraphs as to why (they think) the drug is working often not using an qualifiers such as "we don't know but our guess is..." or "it would be nice to find out if it is ...."

To the vast majority of practicing physicians I've met "cause" just doesn't seem to be the important question. Which I think is why things happen like my pharmacist declaring that two drugs prescribed by my doctor are going to cancel each others effects or why I take a drug to treat a painful toenail and end up with bleeding in my stomach.

Re:Number one pet peeve with my doctor by ColdWetDog · 2008-06-26 04:01 · Score: 2, Interesting

In truth medicine seems to this mathematician as a field governed sooley [sic] by statistical correlation with next to no concern over (a) what is the actual cause is, (b) testing the hypothesized cause in any meaningful way. I've read study after study that goes through a wonderful presented statistical analysis to conclude that such and such drug works well at treating such and such symptom; they then close with a couple of paragraphs as to why (they think) the drug is working often not using an qualifiers such as "we don't know but our guess is..." or "it would be nice to find out if it is ...."

You are unfortunately quite correct and it's very frustrating. I speak as a physician with a strong background in experimental biology. MOST medical research is complete and utter garbage. Statistically correct garbage, but crap none the less. However, in defense of my current field - it's awfully hard to do "experiments" in human research. Hell, it was hard enough to do on eurkaryotic culture cells. Which is why much of the underpinning on modern biological sciences was done on "simple" organisms like bacteria and phages.
Another, more empiric way of looking at what most of what medical science is doing comes from the realization that if you "cure" or "improve" a disease process, at some levels it makes no difference whether you understood what you're doing or just managed to get a valid correlation between treatment and effect. To use a previous example, when you taken a statin to reduce cholesterol, you (as the patient) don't do this to "lower your cholesterol" - you do it so you live longer / healthier / disease free. The statin -> reduce cholesterol correlation may have led researchers to the treatment regimen in the first place, but the end point is staying alive longer. Thus, if the actual mechanism for that is channeling his noodliness, the treatment still works.
Of course, that's not science (or at least not very good science). But it IS the state of medical therapy.
Biology is fiendishly complex and we, as usual, make lots of baby steps and stutters. However, anybody that thinks a doctor in the latter part of this century is going to look like back at 2010 medical practice and decide it's "butchery" is smoking some good stuff.

--
Faster! Faster! Faster would be better!
Re:Number one pet peeve with my doctor by Angostura · 2008-06-26 07:19 · Score: 1

He makes statements about treatments, causes, and outcomes as if they were God given truths proven to the world beyond all doubt.
He has to, if he doesn't he'll bugger up the efficacy of the placebo effect, which is a pretty important element in prescribing.
I'm only half joking. /Disclaimer: My wife is a hospital consultant and she's really good and interested in root cause.

What? What in the world are they talking about? by yoinkityboinkity · 2008-06-26 02:43 · Score: 1

This whole thing makes no sense. It's all ambiguous concepts. What? Lot's of data means we don't need to use theories? Lot's of data != Omniscience. If fact, lot's of data is not even yet information. You still need to find how it applies. It's the people are Wired making a religion out of new technology that causes them to say crazy things like this.

science-open , clouds-? by GodWasAnAlien · 2008-06-26 02:44 · Score: 2, Insightful

Science and openness go together.
Without openness, we all are reinventing private wheels, which we destroy the plans to when there is no profit.
If you work in software, consider for a moment how scientific your work is, considering the work of other companies doing similar work.

This Clouds thing is the "billion monkeys/humans typing on keyboards" model.
Yes, it really can work (with humans).
But, as with science, the chaos development model only works with openness.

Of course, organized science along with a little chaotic development work work even better.

There are forces in our society that do not like any open model. The Microsoft's, the MPAA, the RIAA. These type of organization thrive from closed models. More copyright controls, more DRM, longer copyright and patent terms.
These forces would prefer to own,control and close science and clouds of data. They are unaware of the inevitable impact of such actions.

In a free capitalist society, we are naturally driven my contrary forces.
A desire to hide discoveries, to maximize profits, even at the expense of innovation.
A desire to share discoveries, to contribute to society and for credit.

While it is possible to profit when ideas are shared,
It is more difficult to contribute to society by hiding information indefinitely.

Re:science-open , clouds-? by Anonymous Coward · 2008-06-26 13:45 · Score: 0

Bingo!

Because by Thelasko · 2008-06-26 02:50 · Score: 1

There are coefficients we use in models that we don't fully understand in the physical world. We obtain those coefficients through empirical data. To rely solely on those models for design ignores the fact that those coefficients may change for any reason in the real world, because we don't fully understand what factors influence them.

In my experience this only applies to certain sciences. Most of my experience with such systems is in the area of fluid mechanics, and thermochemistry. Models can save years of lab work, but in the end, the model still needs to be verified.

--
One of our competitors trademarked the term "hypothesis". From now on, we will call them "boneheaded ideas".

silly labels by illlfates · 2008-06-26 02:53 · Score: 1

I believe science is a direct descendent of the capacity for deduction as granted by our model-making brains. In our internal symbol sense, we often use the subjunctive tense, if when why hypothetically depicting, don't call it science, fine, it's still predicting what will happen from what has happened. -- that's what i'm rappin'

Missing the point by dylanr · 2008-06-26 03:02 · Score: 1

The Wired post was a bit over-reaching, sure... but that's Wired for you.

The bigger point is that science is about testability, not story-telling. There may soon come a day when our analysis can prove that something is true without our being able to explain why it is true.

We are already there in many respects, but will be much further along when the current crop of Bayesian diagnostics hits the market. Combine those with the flood of information that personal genomics companies hope to make available and you might see an explosion of insight into diagnosing disease states.

Does that mean we're all done with lab science? Of course not. But our research may come to focus more on understanding what our diagnostics have already proven, rather than on charting new frontiers of knowledge.

Call it what you will, that's a pretty big change in how people organize and gather knowledge.

If Google ads can be so bad... by linhares · 2008-06-26 03:12 · Score: 1

...then WTF??

Some time ago some researchers came out with a book which was supposed to be called "the end of intuition". The name of the book actually became "Supercrunchers", because people would click more on that ad than in the "end of intuition". I wondered why the final name shouldn't be "hot college lesbians".

The Eliza effect is so huge that any nice trick machines do seems to give us the immediate feeling that "It's alive!", and it has deep meaning.

Nonsense.

As a researcher of psychologically plausible AI models, I found the whole idea disgusting, and submitted a paper to a journal explaining why the whole thing is bogus.

Expect to see more of this overexcited nonsense in the future.

Too much information can be a bad thing too by Anonymous Coward · 2008-06-26 03:18 · Score: 1, Insightful

Another point missed here is that background noise can obscure real results. Much of the data cloud is utter garbage. Picking out the useful information is often a complicated and difficult process, in some cases it's easier to just go and do the measurement yourself. I've heard the "a few days in the library can save you weeks at the bench" about as often as the reverse. I think they're both true.

-sk

Hmmm... this is how our ancestors did things by foniksonik · 2008-06-26 03:18 · Score: 1

Ever wonder how early humans discovered medicinal qualities of plants? They didn't use models and scientific method... they used vast amounts of trial and error results. Then they used prediction based on what they had learned to narrow down what kind of plants to try out next. They didn't understand the underlying mechanisms and test out new findings based on that type of model... they used cheap and dirty statistics and record keeping.

This is just an extension of what humans have been doing to discover new correlations, for our entire history... just faster.

I come up with theories all the time based on cross-referenced science articles. Unfortunately I'm not in a position to test any of them, so the best I could do is blog about it - but then I'd join the ranks of the armchair scientists out there and that just seems lame, for now.

One day I'll come across a community that accepts crack pot ideas as the basis for experimentation... lets the community vote on which ones to carry out and takes small donations to fund the projects... then I'll submit my ideas. Hmmm... sounds like a fun community, off to Google to see if one exists already.

--
A fool throws a stone into a well and a thousand sages can not remove it.

WTF?? comment on QM in article by Prune · 2008-06-26 03:19 · Score: 1

The article states that "we know quantum mechanics is wrong on some level". Oh really? That's news to me. Any serious proposed theories of everything have been quantum in nature. It's amusingly hypocritical that the Arstechnica article refers to the Wired author as unscientific, yet makes such a claim itself.

The only thing "wrong" with quantum theory is that doesn't fit human intuitions. But this is only because people ignore the psychology of perception and are not careful about interpretations; it's easy to create a very reasonable interpretation of QM that doesn't invoke weird stuff like saying QM must have something wrong with it, or strange consciousness stuff, etc. An example is Mohrhoff's http://arxiv.org/abs/quant-ph/0412182 (also check Marchildon's review of this class of interpretations, it's in the arxiv somewhere linked to this).

--
"Politicians and diapers must be changed often, and for the same reason."

systems thinking / 5 categories by sidething · 2008-06-26 04:09 · Score: 1

thoroughly fed up with trying to register on wired to argue as the article seemed seriously wrong.
what i was intending to post there, but can't - finally somwhere to post it beyond the g/f's email!

Copied from http://www.systems-thinking.org/dikw/dikw.htm as I was looking for a reference to the information but think that this page sums it up at least as well as I could:

"The content of the human mind can be classified into five categories (Russell Ackoff):

Data: symbols

Information: data that are processed to be useful; provides answers to "who", "what", "where", and "when" questions

Knowledge: application of data and information; answers "how" questions

Understanding: appreciation of "why"

Wisdom: evaluated understanding."

The interpretation on the given url sees understanding as a process that represents the transition between each stage rather than as a stage in itself - information is the understanding of the relationships between data, knowledge is the understanding of patterns of information and wisdom is the understanding of the principles that underpin knowledge and hence make extrapolation to the future possible.

Whichever way it is looked at, the first categories relate to the past with wisdom (the ability to extrapolate) being the only one which relates to the future.

Applying to your example of J. Craig Venter, it can be seen (from my viewpoint) that his research has expanded the amount of data available to us and even possible the amount of information.
However it provides no answers, that I can see - from what is provided in your article - to the questions of "How?" or "Why?" and therefore provides no increase in knowledge, understanding or especially wisdom.

I would argue that it is the scientific method of hypothesize, model, test that provides the answers to the how and why questions and therefore increases Knowledge and Wisdom.

To me what you are arguing is not that the data deluge makes the scientific method obsolete but rather that it provides a new basis for experimentation by the analysis of statistics - it provides a new medium for testing, but provides no ability to hypothesize or test and hence does not increase knowledge, understanding or wisdom.

As such it is a very beneficial development, but the results must be treated with the same caution, indeed more, as any experimental results gained by more traditional meas. To blindy accept the findings without factoring in all the paramaters and testing against a hypothesis (indeed, unless you hypothesize how do you determine what to vary and what to test?) seems to me to be very dangerous and indeed a step backwards in thinking.

Human Comprehension is Limited by Anonymous Coward · 2008-06-26 04:13 · Score: 0

Chris Anderson
has foreseen the most profound change since the age of reason. Man has
reached the point where his "understanding" can impede evolution. It is
time to concede that some processes may be beyond our
comprehension.

Research in the area of Artificial General Intelligence provides a
crystal clear demonstration of the problem. A half century of research
has led to "intelligent" data mining and voice response systems and
very little else.

However, Koza, Fogel
and many others have observed evolutionary computation machines
creating solutions to real world problems. In some cases these
are patentable solutions beyond previous human achievement, and some of
them defy understanding.

Unless you have unlimited funding and lots of time, it's not necessary
to understand why every complex solution works. It may not even be
possible.

A million MRI's of functioning brains are not likely to result in any
Lisp program for AGI, so the search for AGI seems to be coming full
circle back to the "baby bootstrap". Even Ben Goertzel
is looking to virtual babies to mine the clouds.

Like others who have managed to see beyond the horizon, Anderson will
be widely misunderstood. He is not rejecting scientific method, he is
simply showing us its limitations.

Say... by denzacar · 2008-06-26 04:13 · Score: 1

I wondered why the final name shouldn't be "hot college lesbians". Have you ever worked in marketing? You might want to think about giving it a shot if you haven't already.

I have a feeling you could have a brilliant career in that field.

--
Mit der Dummheit kämpfen Götter selbst vergebens

Re:Say... by linhares · 2008-06-26 04:17 · Score: 1

well, if selecting a book's title is all about clicks...
Re:Say... by denzacar · 2008-06-26 04:37 · Score: 1

Well of course a title like "hot college lesbians" would be all about cli...
Oh... "cliCKS"... well... yes... I guess... it can be about clicks too...

--
Mit der Dummheit kämpfen Götter selbst vergebens

Statistical models are not new by Iluvatar · 2008-06-26 04:39 · Score: 1

When I was taking experimental physics 101, I remember we verified basic laws in mechanics by sliding and throwing stuff around many many times, then fitting equations and calculating confidence intervals. Sure, we didn't have petabytes, it all fit in one square-ruled piece of paper.

Several centruries ago, the wholy grail of theory was perfect causality and inference of all the minute details. Chris Anderson seems to be stuck there. Quantum mechanics changed that for good, by talking in terms of statistical properties of position and momentum of particles. But that turns out to be a very useful set of models, with many practical uses. (String theory, on the other hand, through my limited understanding takes a different tack: it adds so many dimensions, that's it's possible to fit almost any kind of data -- as Feynman once complained, more or less.)

So, now we have petabytes of particles (so to speak). We can throw the dice many times and make observations and draw inferences that are statistical in nature. But we're still dealing with models and confidence intervals. The fundamentals are the same, maybe there is a relative shift in focus between theory and experiment, or between perfectly causal and statistical models, but that's about it in my view.

Collapse of the clue vector? :) by argent · 2008-06-26 05:05 · Score: 1

I think the biggest problem in QM is the idea that the "collapse of the state vector" actually describes anything real. It's one of those questions like "when does life start" or "what's really a planet" that doesn't really have anything to do with science. It's just a metaphor that makes certain kinds of reasoning about QM easier, and provides guidance as to where you can simplify your model to make the calculations practical.

Yes, but... by Anonymous Coward · 2008-06-26 05:17 · Score: 1, Insightful

Chris' article was nonsense and the Ars article shows very well, at least, that Chis has drawn some inappropriate conclusions regarding "the Cloud" by citing contradictions in the very article that was posted on Wired. However I found another article (link below), written apparently by a Physics Ph.D. student, that goes into a little more depth regarding the nature of Chris' misunderstanding. He raises the question: is what Chris is referring to actually "knowledge"?

http://thatsprettylame.blogspot.com/2008/06/end-of-reason-why-data-deluge-will-not.html

Wired + Ars Technica owned by same company by Anonymous Coward · 2008-06-26 05:41 · Score: 1, Insightful

They are both owned by Conde Nast. It's sorta funny seeing them duking it out. I believe Ars Technica has a better team of journalists than current-day Wired. Wired is pretty much run by graphic designers now...

Actually, He seems to support a weak version... by hjsolbrig · 2008-06-26 05:54 · Score: 2, Insightful

While he does a good job showing that science itself isn't going away, he actually lends credence to the position that cloud computing implies a lot of useful information will be generated outside of science. Moreover, he also might be supporting the position that science isn't necessarily going to catch-up and explain this data any time soon. So, the "strong" position, that Google makes science irrelevant, is naturally false. But the "weak" position, that Google represents a new kind of inquiry that is going to be increasingly used and relevant, seems intact and supported. So cheers to Google and science, HJS

Using big words to explain something simple by relguj9 · 2008-06-26 06:35 · Score: 2, Insightful

I think the consensus is that the original article is a bit presumptuous and flawed. He says that science will be replaced, which implies that there is a hardened definition for how science is to be performed currently, which there isn't. There is no ONE definition of science or the scientific method.

From a junior high school site about the scientific method:

"Six steps of the S. M.
State the problem: Why is that doing that? Or Why is this not working?
Gather information: Research problem and get background info
Form a hypothesis: a possible explanation for the problem using what you know and what you observe.
Test the hypothesis: Make observations, build a model and relate to real-life or experiment.
Experiment: testing the effects of one thing on another using controlled conditions.
Variable: a quantity that can have more than a single value. (Dependent vs independent)
Constant: a factor that does not change when other variables change.
Control: the standard by which the test results can be compared
Analyze data: recording data and organizing it into tables and graphs.
Draw conclusions: based on your analysis of your data, you decide whether or not your hypothesis is supported."

This "cloud" is just a buzz-word for massive amounts of data collected for no good reason other than to collect it, IE before you perform a hypothesis. Using this junior high model, a hypothesis is created from observation (seeing a correlation in the data), then you go back to the data or collect more data to prove or disprove that hypothesis.

Massive amounts of data and algorithms that sift through it are TOOLS in the box for performing the scientific method. They don't replace it.

I think his argument would be better if he stated that these tools, in certain cases, allow you to reasonably prove and create a hypothesis in a single step.

Links need thought by FlyingBishop · 2008-06-26 06:50 · Score: 3, Interesting

I had a nice example of the complete inadequacy of google's thought-agnostic approach to links browsing around looking for information on samba and fuse under linux. Google's ad bars, completely misinterpreting the context, offered links to fuse boxes, as in wiring, and Samba lessons, as in dancing. But then, maybe I'm not giving Google enough credit. It might have actually recognized the pointlessness of trying to market software to a Linux user, and took the obvious step of throwing in some complete non sequiturs in the hopes of catching something of value.

Applying these methods to baseball by relguj9 · 2008-06-26 06:56 · Score: 1

I just remembered.. they collect "clouds" of statistical information about baseball players so that they can create great "correlations" to post on the scoreboard..

When you just let the algorithms try to make meaningful conclusions you get such gems as..

"On every third Sunday in June in an election year, catchers have a 35% chance of hitting a 420 yard home run in the third inning."

It was an easy job, really. by Saint_Waldo · 2008-06-26 07:10 · Score: 2, Insightful

"Because it came from WIRED," should have been enough reason to discard this bullshit from day one. Why not ask some REAL scientists in a REAL peer reviewed scientific journal about what the "cloud" is doing instead of letting a bunch of insular technophiles indulge in masturbatory fantasies about how their "culture jamming" is "shifting paradigms" all while convincing themselves the same shit wasn't going on in the 60's, 70's, 80's and fucking 90's, and is indeed the sort of thing that led to WIRED's kind in the first fucking place. If science and its titular method could both create and survive the atomic bomb, radar, TANG and LSD, it can certainly handle a fucking "cloud" of bits.

Francis Galton and the Ox ... by frogzilla · 2008-06-26 08:10 · Score: 3, Informative

Wasn't this all demonstrated 100 years ago by Francis Galton and an Ox? What's new is that there are more data points and better techniques to identify interesting correlations. Probably this is what we do internally anyway. All of our sensory input is correlated and the interesting bits are filtered out by specific algorithms trained by evolution. What is fascinating to many are the times when these algorithms are spectacularly wrong.

Cause for optimism? Yes, but ... by rangergordon · 2008-06-26 21:21 · Score: 1

I'm as pleased as anybody that the development of large pools of widely accessible data may lead scientists to find and consider correlations which may not otherwise have observed.

However, Wired does tend to breathlessly enthuse when it comes to stories about how the Internet has changed everything, everywhere, forever and ever! (Look back 11 years at "The Long Boom" for an example of this unbridled enthusiasm. Today, to our great sorrow, this seems a bit ... overoptimistic.)

In the current political climate, any claim that the Internet has made information universally available is hopelessly naive. And the veracity of the information that is available is, at best, mixed.

This is not to say that scientists would resort to sources such as Wikipedia for their sole source of information. Even so, statistical modeling is not a new science. If the emerging massive data cloud makes this kind of research an increasingly important scientific tool, it is cause for optimism.

However, anybody who claims that his/her hypothesis does not require testing, verification and review--or that scientific hypotheses in general have become obsolete--cannot be taken seriously.

That would be a typical Wired overstatement.

Why the "cloud" may obscure the scientific method by kuka · 2008-06-27 00:47 · Score: 1

One aspect nobody seemed to address is that although the "Google method" cannot replace science in the sense "it is a substitute for the scientific method", but it may become the prevalent method because of psychological and social reasons.

First notice that not all scientists are created equal. Only a handful of us are in the position or able to create new theories or advance existing ones using data collected by them or others. The body of scientific research is devoted to the collection and publication of (more or less) valuable data. Just look at the scientific papers! Most of it may be summarized this way: "We studied this and this and measured or calculated these and these, and maybe found some correlations." Of course to do this we must know our field of research well enough to be able to determine what kind of data should be collected and how. But as the usage of the "cloud" becomes ubiquitous it may seem to be more economical to just measure everything possible and then finding the correlations using an AI or ANN. In that way we only need to pay a bunch of cheap "sciworkers" who are able to handle the equipments. This means cheaper education and lower costs. And more people can work in "science", which makes good statistics. And hey! Even those with poorer abilities can do it! This will not advance our understanding of the Universe that's true, but who cares if the technical advancements still continue?

And there are other factors to consider.

Even now nobody is able to learn everything there is to know on any field of science or technology. And the situation will worsen. We cannot overcome this by simply prolonging the duration of education. The "cloud" seem to be a solution to this problem. Just take it a step further by automatizing the data acquisition too.

Nowadays more people are interested in esoteric pseudo science then in science, because it's easier to digest, doesn't require hard work and give instant answers to all problems. And we want answers right now! The "cloud" seem to be the perfect solution for this problem too. It's easy to believe that if we have enough correlated data the computer can give the correct "answer" for all of our queries without too much work from our part. So why pay scientists?

Slashdot Mirror

Why the Cloud Cannot Obscure the Scientific Method

137 comments