Laying the Groundwork For Data-Driven Science

← Back to Stories (view on slashdot.org)

Laying the Groundwork For Data-Driven Science

Posted by samzenpus on Wednesday October 1, 2014 @02:00PM from the collecting-the-numbers dept.

aarondubrow writes The ability to collect and analyze massive amounts of data is transforming science, industry and everyday life. But what we've seen so far is likely just the tip of the iceberg. As part of an effort to improve the nation's capacity in data science, NSF today announced $31 million in new funding to support 17 innovative projects under the Data Infrastructure Building Blocks (DIBBs) program, including data infrastructure for education, ecology and geophysics. "Each project tests a critical component in a future data ecosystem in conjunction with a research community of users," said said Irene Qualters, division director for Advanced Cyberinfrastructure at NSF. "This assures that solutions will be applied and use-inspired."

45 of 55 comments (clear)

Min score:

Reason:

Sort:

Horrible attempt to communicate to a broad audienc by Anonymous Coward · 2014-10-01 14:53 · Score: 1

This sounds suspiciously like something written by someone with an online MBA: "Each project tests a critical component in a future data ecosystem in conjunction with a research community of users," said said Irene Qualters, division director for Advanced Cyberinfrastructure at NSF. "This assures that solutions will be applied and use-inspired."
If we want the public to continue to support federal funding of the sciences we have to do better than this. I understand the point, but it this needlessly laden with buzz-phrases and it is clumsy.
Alan Alda and others have done work with scientists where it comes to communicating to non-scientists, and I'm grateful for it.
pork, politics ? by swell · 2014-10-01 14:54 · Score: 1

from TFA:
"In fiscal year (FY) 2014, its budget is $7.2 billion. NSF funds reach all 50 states through grants to nearly 2,000 colleges, universities and other institutions. Each year, NSF receives about 50,000 competitive requests for funding, and makes about 11,500 new funding awards. NSF also awards about $593 million in professional and service contracts yearly."
and: "awards support research in 22 states"
This particular investment is a tiny fraction of the budget. A low priority.
Note that each congressperson attempts to get government funding for his/her state as part of the obligatory re-election process. Often this funding is for nonsense activity that may provide jobs or incentive for corporate supporters.
Not saying that any of this is pork, but I'd like to know.

--
...omphaloskepsis often...
Difficult, not impossible. by oneiros27 · 2014-10-01 15:36 · Score: 3, Insightful

If the NSF grant process is like the one for NASA, there's still a little bit of flexibility for the program manager after they've gotten the scores.
I know because I was on a panel that specifically gave two proposals 'poor' reviews (the lowest possible), and the program manager asked us to consider changing it. In this case, he's a rather nice guy, and it may just be that he didn't want to have to write the 'your proposal sucks' letter to them ... but those of us on the panel knew that there is _no_ way for them to fund a 'poor'. They have leeway with any other score, and could give something with a marginal rating some seed money (fund 'em for a year, so they might be able to put in a more competitive bid next round).
We told the program manager that no, we wanted to make sure that there was no possible way that those two proposals could get funded.

--
Build it, and they will come^Hplain.
The problem with data driven science.. by Karmashock · 2014-10-01 16:00 · Score: 1

... is that data isn't evidence. And the simple fact that most people don't understand that simply underscores the danger of it.
Now, science must be empirical. It must be based on observation, experimentation, and the results should drive theory.
However, something that has been worrying for years is a lazy tendency for people... scientists included... to grab a data set, point out some correlating variables, and then conclude a discovery... or propose a theory that is supposed to be taken seriously.
That is wrong. And we all know that is wrong. I'm fine with it if we don't take the study seriously or if they don't just cite correlative statistics. But they do that with depressing consistency. Correlative statistics are not evidence. It is data. But basically anything is data. Having data isn't an accomplishment. It is having some readings or information that could mean anything including nothing at all.
Serious efforts have to be taken to ensure the data is pure of distorting influences. And then you have to set up devil's advocate tests/experiments to make sure that there is some causation going on in the data. Often as not, this isn't happening especially with the "data driven" science which in so far I have seen is code for people that sit at their computers calling up spreadsheets and then concluding things from them. That isn't good enough.
Here someone is going to say I know nothing and that people that call up spreadsheets are doing entirely valid science. Which ignores all sorts of points such them not knowing exactly where the data came from or how it was collected. Oh sure, they might have some notation that says how that was done... but who really knows. Scientists need to be willing to get their hands dirty and get the data themselves. The arm chair stuff is good to a point when the data is known to be good or when someone else went to the effort to sort out all the problems with it. But that isn't terribly common. Most of the sets have issues even after being declared good.
Anyway... I hope this all works out for the best and that my fears in this matter are unfounded. Truly. I just worry that this is going to be more of a giant waste of time.

--
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
1. Re:The problem with data driven science.. by binarstu · 2014-10-01 17:09 · Score: 4, Insightful
  
  The problem with data driven science... is that data isn't evidence.
  
  Correlative statistics are not evidence.
  I think you are confusing "evidence" with "proof". Data, and more specifically, the patterns in data, most certainly are evidence. If that were not true, then there would be no reason to even try doing science.
  
  Having data isn't an accomplishment.
  Any scientist who has spent years obtaining a hard-won dataset would strongly disagree with you. Consider, for example, the ground-breaking data generated a few years ago by the Human Genome Project, or the current explosion of data about exoplanets. These data most certainly do represent substantial intellectual and technical accomplishments. Now, if what you mean is that simply downloading someone else's data from the Web is not an accomplishment, then I agree with you.
  
  Scientists need to be willing to get their hands dirty and get the data themselves.
  I think you will find that, in the hard sciences at least, that's usually how it's done. The researchers who write the papers are usually the same people who were involved in collecting the data. However, for very large-scale studies (e.g., global biodiversity research), there is no way that a single scientist, or even a single research team, could gather all of the necessary data. In these cases, the only way to make the research tractable is to integrate multiple datasets.
  Your points about the importance of understanding where the data one uses in a study came from, how they were collected, and any potential biases are all well taken. However, ignoring any of these factors is simply sloppy science, no matter whether the researcher collected the data him or herself, or if someone else collected it.
2. Re:The problem with data driven science.. by slimme · 2014-10-01 18:55 · Score: 2
  
  I also see a trend that people look for correlations, find correlations and then draw some conclusion without any proof of causation. To me it strikes me most for economics. Public policy is set based on those correlations.
  It is very counterintuÃtive but correlation research means nothing, especially in economics. Correlation research would be an amusing way to spend your time and get to know some variables, but correlation research is being used to inflence people. Repeat after me: correlation means nothing. If you find a correlation luck has hit you. Or luck has been manipulated to serve some point. Correlation means nothing whatsoever. Articles describing correlation are a waist of your time. You should not act based on correlation research.
  Now if you take big datasets with lots of variables and you test correlations between those variables, you will find strong correlations. Correlation here, correlation there, correlation everywhere. If you do millions of tests en tweak your parameters, correlation is all yours.
  But luckily now you know: correlation has no pratical use in your live.
3. Re:The problem with data driven science.. by StripedCow · 2014-10-01 22:09 · Score: 2
  
  Data isn't evidence, but it can be used to find useful hypotheses, starting points for further research.
  Remember:
  
  The most exciting phrase to hear in science, the one that heralds new discoveries, is not “Eureka” but “That’s funny...”
  (Isaac Asimov)
  
  --
  If Pandora's box is destined to be opened, *I* want to be the one to open it.
4. Re:The problem with data driven science.. by tribeca.kaji · 2014-10-02 02:56 · Score: 1
  
  It is not correct to say that correlations mean nothing. The fallacy most arguments fall victim to is for some correlation between X and Y and some correlation between Y and Z someone mistakenly correlates or even jumps to causation of X and Z. Correlations are meaningful in that we know that some relationship exists and the relationship becomes stronger the more acute that secant angle becomes. Correlations are not a solution but they are small pieces of a big puzzle. If we could gather enough data and generate a vast array of correlations and begin to sift through those that are stronger, we may be able to deduce something meaningful. The problem being solved using big data sets is that we get a shot at analyzing lots of correlations. Our status quo merely has us picking a few variables and seeing if they correlate and when we find one we jump up and down. Find the set of all relevant correlations for a given problem and we can move on from the misuses of correlation.
5. Re: The problem with data driven science.. by Vesvvi · 2014-10-02 03:16 · Score: 2
  
  Volume 338, 2012: "Detecting Causality in Complex Ecosystems" http://www.uvm.edu/~cdanfort/c...
6. Re:The problem with data driven science.. by Karmashock · 2014-10-02 03:16 · Score: 1
  
  We can play words games if you like... I'm quite good at them. But I frankly find the prospect to be boring. So I'll just win.
  Data (/ËdeÉtÉ(TM)/ DAY-tÉ(TM), /ËdÃ¦tÉ(TM)/ DA-tÉ(TM), or /ËdÉ'ËtÉ(TM)/ DAH-tÉ(TM))[1] is a set of values of qualitative or quantitative variables; restated, pieces of data are individual pieces of information. Data in computing (or data processing) is represented in a structure that is often tabular (represented by rows and columns), a tree (a set of nodes with parent-children relationship), or a graph (a set of connected nodes). Data is typically the result of measurements and can be visualized using graphs or images.
  Data is simply information. Data does not imply analysis or even meaning. If I collected all the numbers from a random number generator... that file would be "data" just as much as anything else.
  You have a different definition for data? Great... then we're having a semantic debate which is possibly the most boring dispute possible. By your definition of data I will probably agree with you. If you accepted my definition then you'd probably agree with me. Who cares.
  Next issue.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
7. Re:The problem with data driven science.. by thrich81 · 2014-10-02 04:39 · Score: 1
  
  Albert Einstein, for one example, is not known for acquiring any of the raw physical data from which he did his work on the Photoelectric Effect, Special Relativity, or General Relativity. All he did was sit at his desk and write equations based on data acquired by others.
8. Re:The problem with data driven science.. by binarstu · 2014-10-02 05:48 · Score: 1
  
  No, I don't have a different definition of "data". My point was that your original post repeatedly confuses "evidence" and "proof". As I said, data, and more specifically, the patterns in data (correlative statistics are one example), are used as evidence all of the time in science. That is, in a nutshell, how science works. Data provide evidence, not proof, for or against alternative hypotheses. The strength of the evidence depends on the strength of the data, which encompasses all of the potential data problems you discussed in your original post. None of this has anything to do with disputing the definition of "data". Data are pieces of information (just as your Wikipedia article says), and collectively, they can provide evidence for or against scientific hypotheses. Another way to state it is to say that in science, evidence comes from data.
  Your blanket statements that "data isn't evidence" and that "correlative statistics are not evidence" are not supported by the way real scientists actually use data. Scientists frequently use the results of statistical analyses, including correlative statistics, as evidence. Evidence does not imply proof of causality or any other underlying explanation, and that is where your original post seemed to get things mixed up. Evidence for a particular hypothesis simply means that the patterns observed in some data are consistent with the hypothesis.
9. Re:The problem with data driven science.. by AK+Marc · 2014-10-02 13:38 · Score: 1
  
  Data is typically the result of measurements
  Data is simply information. Data does not imply analysis or even meaning.
  So you are saying that measurements have no meaning. That is why I object to your argument.
  
  Data [...] is a set of values
  Data is a set of values. DNA is a set of values. DNA is data. That data is important.
  
  Data is simply information.
  Information is good. Every book is "simply information".
  
  --
  Learn to love Alaska
10. Re:The problem with data driven science.. by Karmashock · 2014-10-02 13:59 · Score: 1
  
  Measurements CAN have meaning... but they do not require meaning. I can take measurements that don't mean anything all day. I can set the output of my phone's accelerometer to output to a spreadsheet... then record the data. Will the data mean anything? I can take data furthermore in completely irresponsible ways and it will still be data. I can cherry pick my results and it is still data.
  Data in and of itself is not meaningful. Data must be collected in a certain way to preserve its integrity and then it must be analyzed in a specific way to turn it into something meaningful. But prior to that point... it has only the POTENTIAL of being meaningful. Prior to that point... no meaning.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
11. Re:The problem with data driven science.. by Karmashock · 2014-10-02 15:57 · Score: 1
  
  And if most scientists stuck to mathematics then they could get away with that too. Do anything else and you're going to need a bit more.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
12. Re:The problem with data driven science.. by Karmashock · 2014-10-02 16:06 · Score: 1
  
  The patterns in data are not data. The data is not the analysis of the data which would be a pattern in the data.
  Your lack of basic reading comprehension has run out of my patience with you.
  Good day.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
13. Re:The problem with data driven science.. by binarstu · 2014-10-02 18:39 · Score: 1
  
  The patterns in data are not data. The data is not the analysis of the data which would be a pattern in the data.
  Okay. Against what, exactly, are you arguing? When, at any point, have I claimed that "the data are the analysis of the data" or any such nonsense?
  Let me remind you of one of your original claims:
  
  Correlative statistics are not evidence.
  Do you not understand that "patterns in data" includes correlative statistics? If not, let me make this clear: You originally claimed that neither data, nor the patterns in data, are "evidence". I've tried to explain why, to scientists, patterns in data, including correlative statistics, most certainly are evidence. That is all. For some reason, instead of responding to any of that, you want to keep arguing about the definition of "data", which, as I've also explained, was never in dispute.
  
  Your lack of basic reading comprehension...
  Cheap insults aren't necessary. Look -- your original post simply reflected a misunderstanding of how a particular bit of terminology is used by scientists. I thought it might be helpful to explain why. I apologize if I inadvertently offended you somewhere along the way.
14. Re:The problem with data driven science.. by mick129 · 2014-10-10 07:41 · Score: 1
  
  I can set the output of my phone's accelerometer to output to a spreadsheet... then record the data. Will the data mean anything?
  Of course it will! The data will convey how much your phone as moved. Why do you think that this data is meaningless? Perhaps you mean "useful"?
  
  --
  Move along, no sig to see here.
15. Re:The problem with data driven science.. by Karmashock · 2014-10-10 08:15 · Score: 1
  
  I'm not getting in a word game with you.
  When I say meaningful, I refer to whether that data automatically tells you something of significance.
  If you want to use the word "useful"... we can see if your use of it is the same as mine. But I've a very low tolerance for semantics games.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
16. Re:The problem with data driven science.. by mick129 · 2014-10-10 08:29 · Score: 1
  
  Some people use their accelerometer data as a pedometer, some don't. Is the data meaningful or not? Whether data is meaningful, or useful or significant (all are fine with me) is subjective. Accelerometer data is not meaningful to you? Fine. But, you are not the arbiter of what is meaningful.
  
  On the other hand, if you'd like an example of data which is (nearly?) universally recognized as meaningless, how about /dev/random > data.txt?
  
  --
  Move along, no sig to see here.
17. Re: The problem with data driven science.. by mick129 · 2014-10-10 10:12 · Score: 1
  
  > how about /dev/random > data.txt
  Unless, of course, you happen to have a need for some random data. Then it's useful.
  
  --
  Move along, no sig to see here.
18. Re:The problem with data driven science.. by Karmashock · 2014-10-10 15:24 · Score: 1
  
  I'll just use that random data set to justify everything then.
  If that argument I just made doesn't make sense it is because your counter to my argument made no sense.
  That is how different the two things we're talking about are right now.
  So thank you for your response, but you're not talking about what I'm talking about and I'm not going to talk about what you want me to talk about. I was talking about something that I cared about as relates to this topic. And your attempt to strawman my argument while not unexpected isn't going to do anything.
  I know what I said and what I was talking about. Attempting to change my argument to something else only makes your own counter argument sound incoherent.
  I was being quite clear but if we're just going to play word games then game on. Fine. We'll just do that. I won't even attempt to make a point if that's all you're going to do. Instead, like you, I'll go out of my way to confuse things just for the lolz.
  Happy? I am.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
19. Re: The problem with data driven science.. by mick129 · 2014-10-13 09:08 · Score: 1
  
  Well, I think I've caught the flu, but yes, I'm generally happy. Thanks for the concern. Glad to hear you are too, buddy.
  
  One question though, does your happiness have anything to do with this semantics argument? If so, don't write them off as boring. Seek them out! Follow your bliss!
  
  --
  Move along, no sig to see here.
20. Re: The problem with data driven science.. by Karmashock · 2014-10-13 15:42 · Score: 1
  
  I find some satisfaction in turning silly rhetorical tactics against their users.
  I am by intention if nothing else a hyper rationalist. I hold reason and the mental framework that supports and allows for reason to be of primary importance in any discussion. For without them we will not have a reasonable discussion and will not know if what we have discussed was or was not reasonable.
  This means defining what you're talking about, keeping a clear image of what our goals are in the discussion, and then working through several different mental and rhetorical stages to determine the proper method of getting from point A to point Z.
  What so often happens with these discussions is that rather then talking about B, C, D, etc someone will move the origin point or the destination point without explicitly noting that they've done that. Or just as bad they'll attempt to define the opposition's argument and then argue against that point.
  Both of these only work if the opposition in question doesn't have a clear idea of all these things in their mind whenever they read/listen to the arguments as presented.
  If you do... it is a bit like watching someone cheat at chess while you watch them. I mean... you know which side of the board you're on. You know which color of pieces are yours. You know he can't just randomly take pieces off or move them. And by the same token one knows what the rules of the game are and what they are not. If someone starts moving pawns like a knight or something then it isn't going to work.
  And that is about 80 percent of arguments on the internet. Really really silly attempts to win arguments that probably work against really stupid or really unaware people.
  I'd like to THINK I am neither. But who knows... I can't... if I've learned anything it is that you're always the last one to know. But that is what your argument looked like to me. And I'm naturally not going to stand for it.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
21. Re: The problem with data driven science.. by mick129 · 2014-10-14 08:09 · Score: 1
  
  I agree that defining terms is important. FYI, I think you should consult a external reference for a definition of meaningful, as you did much higher up the thread with "data". Your usage of the word seems unusually specific, specifically the "automatically" part referring to data without further analysis. Collecting and analyzing data is what researchers do, so requiring data to be useful without analysis is baffling. Several commenters have tried to correct your definition.
  
  Even data without pristine integrity can be collected and analyzed to create something meaningful. For an example of that, how about neurosynth.org - this uses an automatic scraper to collect coordinates from published neuroimaging papers. The scraper does as scrapers do and collects all sorts of false positives, so many of the "coordinates" are in fact not. The overall result produces very strong results.
  
  --
  Move along, no sig to see here.
22. Re: The problem with data driven science.. by Karmashock · 2014-10-14 12:43 · Score: 1
  
  data absent analysis isn't meaningful. And just because you analyzed data doesn't mean the analysis is meaningful.
  Data only obtains meaning by establishing cause and effect via the data. Absent that you have white noise.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
23. Re: The problem with data driven science.. by mick129 · 2014-10-15 05:48 · Score: 1
  
  Data's meaning is made accessible through analysis. Absent analysis, its meaning may be obscured or unintelligible, but it's in there regardless of if we observe it. The data sharing initiative from the article is allowing future researchers the chance to find that meaning in the data.
  
  --
  Move along, no sig to see here.
24. Re: The problem with data driven science.. by Karmashock · 2014-10-15 07:27 · Score: 1
  
  I told you I wasn't going to have a semantics argument with you. You want to define the world "is" mean "pineapple" that is your business.
  good day, sir.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
25. Re: The problem with data driven science.. by mick129 · 2014-10-15 12:38 · Score: 1
  
  Actually, you said if it was a semantics argument you would "just win".
  
  Data is simply information. Information is meaningful.
  
  But yes, have a nice day.
  
  --
  Move along, no sig to see here.
26. Re: The problem with data driven science.. by Karmashock · 2014-10-15 14:58 · Score: 1
  
  Actually I didn't.
  This is what I said when you started with this foolishness:
  ""I'm not getting in a word game with you.
  When I say meaningful, I refer to whether that data automatically tells you something of significance.
  If you want to use the word "useful"... we can see if your use of it is the same as mine. But I've a very low tolerance for semantics games.
  ""
  I am not going to debate you on the definition of basic english words that you seem determined to turn into an argument about nothing.
  I will simply skip that and move on.
  We're done. This bores me.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
27. Re: The problem with data driven science.. by mick129 · 2014-10-15 16:12 · Score: 1
  
  This has been a semantic argument since the first reply telling you that you're using words wrong. Your response: "We can play words games if you like... I'm quite good at them. But I frankly find the prospect to be boring. So I'll just win."
  
  Best of luck finding something to interest you.
  
  --
  Move along, no sig to see here.
28. Re: The problem with data driven science.. by Karmashock · 2014-10-15 17:06 · Score: 1
  
  Wrong.
  Full quote:
  ""
  We can play words games if you like... I'm quite good at them. But I frankly find the prospect to be boring. So I'll just win.
  Data (/ÃdeÃtÃ(TM)/ DAY-tÃ(TM), /ÃdÃfÂ¦tÃ(TM)/ DA-tÃ(TM), or /ÃdÃ'ÃtÃ(TM)/ DAH-tÃ(TM))[1] is a set of values of qualitative or quantitative variables; restated, pieces of data are individual pieces of information. Data in computing (or data processing) is represented in a structure that is often tabular (represented by rows and columns), a tree (a set of nodes with parent-children relationship), or a graph (a set of connected nodes). Data is typically the result of measurements and can be visualized using graphs or images.
  Data is simply information. Data does not imply analysis or even meaning. If I collected all the numbers from a random number generator... that file would be "data" just as much as anything else.
  You have a different definition for data? Great... then we're having a semantic debate which is possibly the most boring dispute possible. By your definition of data I will probably agree with you. If you accepted my definition then you'd probably agree with me. Who cares.
  Next issue.
  ""
  I won there. The argument literally ended there. Everything since has been my vain attempt to explain something very simple to someone very stubborn and very wrong.
  Good day.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
29. Re: The problem with data driven science.. by mick129 · 2014-10-15 17:25 · Score: 1
  
  Data is simply information. Information contains meaning.
  
  information
  [in-fer-mey-shuh] noun
  1. knowledge communicated or received concerning a particular fact or circumstance
  
  Have a nice day! :)
  
  --
  Move along, no sig to see here.
30. Re: The problem with data driven science.. by Karmashock · 2014-10-15 17:53 · Score: 1
  
  You fail at reading comprehension. No where in that definition does it imply the data was analyzed in any manner.
  You were always wrong and I think you know it. The sad thing is that you think I don't know it... or that confusing the issue will make you less wrong. At best you might confuse me... but you'd still be wrong.
  You didn't even accomplish that though.
  Kindly put your thumb at a 90 degree angle, lift off your seat about eight inches, pull down your pants, place your hand thumb up on the seat perpendicular to the chair, and gently sit down.
  Twit.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
31. Re: The problem with data driven science.. by mick129 · 2014-10-15 18:00 · Score: 1
  
  I wasn't the one who required analysis for there to be meaning, that was your position.
  
  I'll take vulgarity as conceit.
  
  --
  Move along, no sig to see here.
32. Re: The problem with data driven science.. by mick129 · 2014-10-15 18:03 · Score: 1
  
  Hmm, my dictionary says I meant conceding.
  
  --
  Move along, no sig to see here.
33. Re: The problem with data driven science.. by Karmashock · 2014-10-15 18:45 · Score: 1
  
  Right... because when Galieo insulted people that said the sun was the center of the solar system, it meant he was wrong.
  Logical fallacies for the win, fucktard?
  I'm insulting you because you deserve to be treated with contempt and after trolling me for all these posts with your idiocy there isn't any courtesy left.
  As to my requirements for meaning, I know yours didn't include implicit qualities that are required for their proper function. That was me... because I'm not a moron.
  Seriously... Fuck off.
  
  --
  I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
34. Re: The problem with data driven science.. by mick129 · 2014-10-16 01:29 · Score: 1
  
  "She still moves" isn't an insult.
  
  --
  Move along, no sig to see here.
Walrus census by jamesl · 2014-10-01 16:10 · Score: 1

Perhaps they should start with an annual walrus census (see story following this one).
Re:Horrible attempt to communicate to a broad audi by ClickOnThis · 2014-10-01 16:21 · Score: 1

This sounds suspiciously like something written by someone with an online MBA: "Each project tests a critical component in a future data ecosystem in conjunction with a research community of users," said said Irene Qualters, division director for Advanced Cyberinfrastructure at NSF. "This assures that solutions will be applied and use-inspired."
If we want the public to continue to support federal funding of the sciences we have to do better than this. I understand the point, but it this needlessly laden with buzz-phrases and it is clumsy.
I understand your point about the technobabble. However, Ms. Qualters' résumé appears to be somewhat less fluffy that the quote would suggest.

--
If it weren't for deadlines, nothing would be late.
Re:Bullshit filtered explaination requested. by phantomfive · 2014-10-01 17:17 · Score: 1

The NSF groups it's funding towards various targets. If you know what they are targeting, then it's easier to get funding.

In this case, they are targeting computer related things, and grouping it under the name, "Data Infrastructure Building Blocks." The actual funding goes towards things as diverse as an MOOC, and some kind of scientific library for super computers.

--
"First they came for the slanderers and i said nothing."
science driven science? by louic · 2014-10-01 19:34 · Score: 4, Insightful

All science is data driven. Without data there is no hypothesis, and without hypothesis there is nothing to test (falsify). This is just another hype, like nanotechnology or now nanobiotechnology etc. Nearly all molecules are nanoscale: their size is measured in nanometers, and in the same way all science is data driven.

There is nothing wrong with good old "science driven science" where people think, do experiments, and think again.
1. Re:science driven science? by radl33t · 2014-10-02 03:36 · Score: 1
  
  Let's just eliminate all specificity and refer to everything as physics. Sounds useful to me! Anything else is just more hype for charlatans and fools! Nuance and differentiation are for the birds!
science driven science? by Vesvvi · 2014-10-02 03:28 · Score: 2

This particular push may not be effective, but it's not hype.
Science may be data-driven, but historically scientists have not been trained to be good data custodians. They know reasonably well how to use data, but they don't know how to store it, label it, transfer it, etc. Go pick an article from 5 years ago which is data-heavy and try to get the original dataset from the authors: 95 times out of a hundred you'll spend a month emailing people and you'll end up with nothing. Four more out of the 100 you'll get an Excel spreadsheet without labels on the columns. Scientists desperately need to become better at managing data.
Personally, I think that this program is targeting a small subset of the people who need help, and as such it won't be very effective. These look like infrastructure projects, but infrastructure only drives trends in extremely rare cases. Here's a quote from one funded proposal:

This project develops web-based building blocks and cyberinfrastructure to enable easy sharing and streaming of transient data and preliminary results from computing resources to a variety of platforms, from mobile devices to workstations, making it possible to quickly and conveniently view and assess results and provide an essential missing component in High Performance Computing and cloud computing infrastructure.
Will that project help teach scientists they shouldn't email files to themselves as a method of long-term archival? Yes, that really is extremely common. We should be focusing on building data tools which are extremely simple, extremely broad in scope, and encourage or force adoption of those tools.
Save Money by Areyoukiddingme · 2014-10-02 04:19 · Score: 1

I can save the NSF a bunch of money with this initiative. There's a data center in Utah that's not being used (for anything legal) with a huge amount of data storage capacity. The NSF should have it.