Slashdot Mirror


Obama Administration Places $200 Million Bet On Big Data

wiredmikey writes "As the Federal Government aims to make use of the massive volume of digital data being generated on a daily basis, the Obama Administration today announced a 'Big Data Research and Development Initiative' backed by more than $200 million in commitments to start. Through the new Big Data initiative and associated monetary investments, the Obama Administration promises to greatly improve the tools and techniques needed to access, organize, and glean discoveries from huge volumes of digital data. Interestingly, as part of a number of government announcements on big data today, The National Institutes of Health announced that the world's largest set of data on human genetic variation – produced by the international 1000 Genomes Project (At 200 terabytes so far) is now freely available on the Amazon Web Services (AWS) cloud. Additionally, the Department of Defense (DoD) said it would invest approximately $250 million annually across the Military Departments in a series of programs. 'We also want to challenge industry, research universities, and non-profits to join with the Administration to make the most of the opportunities created by Big Data,' Tom Kalil, Deputy Director for Policy at OSTP noted in a blog post. 'Clearly, the government can't do this on its own. We need what the President calls an 'all hands on deck' effort.'"

47 of 72 comments (clear)

  1. Great QOTD by smwny · · Score: 5, Insightful
    The QOTD at the bottom of the page so perfectly matched this story.

    All the taxes paid over a lifetime by the average American are spent by the government in less than a second. -- Jim Fiebig

    1. Re:Great QOTD by the+eric+conspiracy · · Score: 1

      It isn't possible to buy anything without paying taxes.

    2. Re:Great QOTD by Jesse_vd · · Score: 1

      It's called Amazon. It was even in TFS!! ;)

    3. Re:Great QOTD by the_Bionic_lemming · · Score: 2

      It isn't possible to buy anything without paying taxes.

      Public aid cards are used by people who do not pay taxes with money they earned - the money they use and the money spent on taxes are earned by others.

      Ergo, it is possible for people to buy things without actually paying for it themselves - or paying taxes themselves.

      --
      _ _ _ Go for the eyes Boo! GO FOR THE EYES!
    4. Re:Great QOTD by owlnation · · Score: 2

      It isn't possible to buy anything without paying taxes.

      You can buy politicians without paying taxes.

    5. Re:Great QOTD by LifesABeach · · Score: 1

      Your units are incorrect, it's Multi-Millionaire. Adjust your data accordingly.

    6. Re:Great QOTD by Anonymous Coward · · Score: 2, Insightful

      I understand your joke but in reality there are about 40-50% that pay no federal income tax at all and a decent percentage of that group get back more than they paid during the year (they MAKE money when they file).

      On a side note.. I really don't think most people understand the tax refund concept and tax withholdings from your paycheck. They don't realize that they are getting money back that they gave to the government already with the exception of those I mentioned above that get even more back then they paid in.

    7. Re:Great QOTD by Anonymous Coward · · Score: 1

      Actually, the politicians can buy elections easily by manipulating the 40-50% of people that pay no federal taxes at all. The first president or candidate that comes up with an honest sound plan to reduce our national debt would never get re/elected because those 40-50% would never vote for him/her. Politicians can keep claiming that the top 1-5% can fix our economy and support the entire countries budget but the reality is they can't. It gets votes though. The hole is getting deeper and deeper and no one in public office will do anything about it because there is an election coming up (BTW, there is always an election coming up). As time goes on, we will hear more and more about how the "previous administrations" bad decisions in the past got us in this mess and its not the current administrations fault but they will also do the same thing the previous administrations did. Spend more and take in less and don't accept any long term responsibility for it.

    8. Re:Great QOTD by sonamchauhan · · Score: 1

      Thanks goodness government is a non-profit enterprise these days.

      Harken back to the times of kings!

    9. Re:Great QOTD by the+eric+conspiracy · · Score: 1

      The value of the public aid card belongs to the recipient after he receives it. When he uses the card some of the value of the card goes to pay the taxes on the services he purchases.

      He's paying taxes.

    10. Re:Great QOTD by the+eric+conspiracy · · Score: 1

      Far fewer than 40-50% of people avoid paying Federal taxes. For example when you purchase an alcoholic beverage, gasoline, cigarettes, or have any kind of paying job you are paying some form of Federal tax.

    11. Re:Great QOTD by the+eric+conspiracy · · Score: 1

      And the thing you get from Amazon? Do you think that no portion of the price of this goes to pay for taxes, even indirectly?

    12. Re:Great QOTD by the+eric+conspiracy · · Score: 1

      Politicians aren't a thing.

    13. Re:Great QOTD by the_Bionic_lemming · · Score: 2

      No, it doesn't.

      If you are a leech on society, and produce noting - then you cannot claim to have done anything other than being a leech.

      The taxes are paid by the people who are forced to allow the leeches to live.

      If you are a leech and think you are paying taxes - or trying to convince others that you are doing anything other than being a leech, rest assured, we know better.

      --
      _ _ _ Go for the eyes Boo! GO FOR THE EYES!
    14. Re:Great QOTD by FrootLoops · · Score: 2

      That's kind of a silly statistic to use, since it's sensitive to population size. There are about 310 million Americans, and 1 year contains about 31 million seconds. You'd expect (roughly) 10 years of an average taxpayer's taxes to be spent every 1 second if revenues == expenditures, everything is inflation adjusted, and everyone pays taxes. If the country instead had only 31 million people you'd expect 1 year per second. The implied point--that government spending is out of control and/or hugely wasteful--depends crucially on factors not mentioned in that one-liner. Without further analysis all it does is illustrate how huge and unintuitive national scales are.

      I wonder if the original quote is inflation adjusted. It would be an easy mistake to make to forget that today's dollars and dollars of the past aren't directly comparable and should not be aggregated naively. It would also account for a large part of the deviation between my very rough estimate and the quote (~1 lifetime's taxes / 4 seconds vs. 1 lifetime's taxes / <1 second).

    15. Re:Great QOTD by BlueStrat · · Score: 1

      Politicians aren't a thing.

      Yes, they are.

      But who, over 10 years old, still calls it a "thing"?

      Strat

      --
      Progressivism (aka US 'Liberalism'): Ideas so good they need a police/surveillance-state to enforce.
    16. Re:Great QOTD by DarkOx · · Score: 1

      Yea but as a percentage the kings took less wealth from a serf than the us tax payer is asked to fork over today, if you look at those who pay some income tax.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
    17. Re:Great QOTD by squiggleslash · · Score: 1

      I understand your joke but in reality there are about 40-50% that pay no federal income tax at all and a decent percentage of that group get back more than they paid during the year (they MAKE money when they file).

      Yes, but there are more taxes than income tax.

      The reality is that virtually everyone pays taxes, usually directly, and obviously indirectly. Buy something and it's almost certainly subject to sales taxes. Own a home and it's covered by property taxes. Drive a car and... you get the idea. And if you try to avoid most of those things, you end up paying indirectly. Your landlord pays the property taxes you'd have paid on your rented home, and obviously prices your rent accordingly, they don't magically disappear.

      It may be a meme on the right that the country is 50% populated by "Lucky Duckies" who use government services on your dime, but it isn't remotely true. Find a state that doesn't charge property taxes, grow your own food, and never travel beyond the borders of your property and you might avoid them, but otherwise...

      --
      You are not alone. This is not normal. None of this is normal.
  2. Privacy? by GeneralTurgidson · · Score: 5, Insightful

    When it comes to big data, there's going to be little privacy.

    1. Re:Privacy? by Bigsquid.1776 · · Score: 1

      I agree with you good sir. When I hear about things like this... I feel like selling everything I own and walking off into the mountains.

    2. Re:Privacy? by Tsingi · · Score: 2

      Yeah. Considering what the data is that they want to process.

    3. Re:Privacy? by Sarten-X · · Score: 2, Interesting

      That's an absolutely unfounded concern.

      I worked at a Big Data company. About 90% of my job was improving privacy while maintaining the integrity of medical data. The patient's zip code was reduced to 3 digits. Any references to states were removed and forgotten (because there are some zip codes that cross state lines). Any names were removed, as were any user-entered comments (doctor's notes, etc.) that might possibly contain personal information. Any personal information that is necessary for the system but might be identifiable is salted and hashed twice before it ever leaves the source (hospital, insurance provider, etc).

      That in itself isn't good enough for privacy, so we then used some proprietary methods (that was kinda outside my job, so I don't know much about them) to intentionally screw up the data we provided to our users. A user could find out, for example, that between one and fifty people in the vicinity of Denver had a particular medical condition on a particular date, and received a particular drug. Narrow down results more than that, and my company's system simply wouldn't fulfill the request.

      This isn't really the exception to how many Big Data companies treat their data. Believe it or not, Big Data providers take privacy seriously, and are willing to sacrifice perfect accuracy to run an ethical operation. Anyone interested in Big Data is running on statistics anyway, so statistically-insignificant methods are easy to preserve privacy.

      --
      You do not have a moral or legal right to do absolutely anything you want.
    4. Re:Privacy? by Bravoc · · Score: 2

      Yea but.... we're not talking about a company where they could become the target of civil litigation. We're talking about the US Federal Govurn-munt. Need I say more? I don't feel good about this at all.

    5. Re:Privacy? by WaywardGeek · · Score: 1

      That's pretty cool. I'm hopefully having my "exome" sequenced soon, as part of a clinical trial. I don't think my data will make it into a public database, but I would be for it, so long as my name etc were removed.

      Just some dumb thoughts on TFA: The 1000 Genome project is hosting 200TB of data?!? Haven't these guys ever heard of compression? It seems that an individual's entire genome can be compressed to about 4MB, so the ~2000 genomes produced by the 1000 genome project should easily fit on my microSD card. $20 worth of storage in 1/20th of a cubic centimeter. Also, how many downloads to they expect to get? Even one? Not at 200TB containing 8GB of useful data! Why does this require the government to pay Amazon loads of cash? And... how do I get in on this scam? I'm thinking I could host a database of all zip codes in only 500TB from my house using 500 external 1TB drives. The exact boundaries between zip codes, taking into account tectonic plate shifting in real time, can be a lot of data if I write a stupid program to generate stupid data. I'd be willing to do that for the government for only $10M.

      --
      Celebrate failure, and then learn from it - Nolan Bushnell
  3. Doesn't this bother anyone? by Anonymous Coward · · Score: 3, Insightful

    Clearly, the government can't do this on its own. We need what the President calls an 'all hands on deck' effort

    So the Obama wants to pick and choose how this will be handled but he wants everyone else to do it? Whatever happened to representation?

    1. Re:Doesn't this bother anyone? by Anonymous Coward · · Score: 2, Insightful

      Why does every criticism of Obama lead to howls of racism? You spent 70% of a post shouting about racism when not a racist word was said. And yes, some of us were against Bush just as much as Obama. We don't all fit into tiny little slots, some of us don't wear partisan labels. How about you?

    2. Re:Doesn't this bother anyone? by LandoCalrizzian · · Score: 5, Insightful

      I'm black, and I think that even if Obama is "my nigga", he cannot be trusted, and he betrayed most of those who voted for him with all his false promises.

      I intend to vote against "my nigga" next time.

      I guess you can call me an Uncle Tom, but I'd rather be called that than be called a sucker or an idiot.

      Posting as AC and talking about trust is an irony in itself. Anyone naive enough to think that the President can drastically change the way a nation works in 4 years is a sucker and an idiot. Government is slow to work and slow to react because it is run by a committee of people who have their own agenda that doesn't always align with the President's agenda. It doesn't matter who is in office, if the kids can't play nice then nothing gets done. Let's set the record straight, a President has the least control on how a nation is run, the body of Congress has most of the power. If you want one man to run the country move to Syria. If you want change, you should be more selective of your Congressional representatives instead of only paying attention to 2 people for ~6 months every 4 years.

    3. Re:Doesn't this bother anyone? by Politburo · · Score: 1

      Your interactions with the government must be fairly limited. The ability to access large amounts of data from anywhere has been a large improvement to many citizens.

      Yesterday it took me all of 5 minutes to get the USDA test results for e.coli in beef processing for all of 2011. How long would that have taken in a manual system?

      The other day, I searched a database of agency decisions going back to 1970. Being able to narrow the search results meant that I only had to manually go through 15 records instead of 20,000.

      Instant access to USGS and FEMA maps, tax forms and instructions, legislative history, the US code, the CFR, economic data, transportation data, energy data, agriculture data, etc etc etc.. all of which would previously have required a trip to the library (or DC) and then manually going through records. Even if you don't do this stuff personally, people are doing it and these systems save time, which saves money, which generally results in lower costs.

  4. And all over the world by overshoot · · Score: 1

    spies are ecstatic over the goodies that Uncle Sugar is about to drop in their laps.

    --
    Lacking <sarcasm> tags, /. substitutes moderation as "Troll."
  5. Data mining by Caerdwyn · · Score: 1

    With, of course, the certainty that the data mining capabilities will never be used for evil such as monitoring American citizens for the purposes of identifying nonviolent (but loud) political dissidents.

    The difference between this and Google is that you can haul Google into court when they do evil.

    --
    Everybody gets what the majority deserves.
    1. Re:Data mining by Caerdwyn · · Score: 1

      Except that Google (and other non-governmental entities) must respond when caught with their hands in the cookie jar.

      One of many examples: such as when Congress hauls their butt into a hearing over their privacy practices.

      The government, after all, is jealously seeking a monopoly on invasive privacy violations.

      --
      Everybody gets what the majority deserves.
  6. Just another corporate hand-out? by Anonymous Coward · · Score: 1

    That is what it looks like to me.

    I think Obama gave Solendra $500 million about one week before the company declared bankruptcy. The execs refused to tell anybody where the money went, and acted offended that anybody would ask.

    Big political contributions are probably about the best investment you make, I figure about a $10 return for every $1 invested.

  7. Re:Big Data or Big Pork? by c0lo · · Score: 1

    Big Data or Big Pork?

    Given it's about a bet, I would think of a race horse. Given the amount to be spent, I'm inclined towards a milking cow.
    'know what? Let's settle in between: it is actually a bet placed on a milking race cowrse.

    --
    Questions raise, answers kill. Raise questions to stay alive.
  8. Right investment, right time by macwhizkid · · Score: 4, Interesting

    I'm a hard science/computer science guy who's livelihood is working on various NIH/NSF projects. A common thread talking to other scientists the past few years has been the theme that the tools for data analysis have not kept pace with the tools for data acquisition. Companies like National Instruments sell sub-$1000 USB DAQ boards with resolution and bandwidth that would make a scientist from the early 1990's weep for joy. But most data analysis is done the same way it's been done since that same era: with a desktop application working with discrete files, and maybe some ad-hoc scripts. (Only now the scripts are Python instead of C...)

    The funny thing is, most researchers haven't yet wrapped their brains around the notion of offloading data onto cloud computing solutions like Amazon AWS. I was at an AWS presentation a couple months ago, and the university's office of research gave an intro talking about their new supercomputer that has 2000 cores, only to get upstaged 10 minutes later when the Amazon guys introduced their 17000 core virtual supercomputer (#42 on the top 500 list, IIRC). There's a lot of untapped potential right now for using that infrastructure to crunch big data.

    1. Re:Right investment, right time by Ruie · · Score: 1

      I was at an AWS presentation a couple months ago, and the university's office of research gave an intro talking about their new supercomputer that has 2000 cores, only to get upstaged 10 minutes later when the Amazon guys introduced their 17000 core virtual supercomputer (#42 on the top 500 list, IIRC). There's a lot of untapped potential right now for using that infrastructure to crunch big data.

      Big Data is about I/O not cores... How many GB/sec from disk can that cloud support ?

    2. Re:Right investment, right time by frosty_tsm · · Score: 1

      I was at an AWS presentation a couple months ago, and the university's office of research gave an intro talking about their new supercomputer that has 2000 cores, only to get upstaged 10 minutes later when the Amazon guys introduced their 17000 core virtual supercomputer (#42 on the top 500 list, IIRC). There's a lot of untapped potential right now for using that infrastructure to crunch big data.

      While very cool, some problems require more communication between threads and might not scale well on more distributed VMs. Still, very cool.

    3. Re:Right investment, right time by tomhath · · Score: 1

      This doesn't seem like a hardware problem. The government has been talking about "data fusion" for decades. They collect way more data than they know how to use. At a very high level this initiative sounds reasonable, but with no specific goals I wouldn't expect anything tangible to come out of it.

    4. Re:Right investment, right time by mapkinase · · Score: 1

      The sequencing data produced is of inferior quality than 10 years ago. Somebody might be weeping with joy, but not the assemblers downstream in this data flow.

      Ironically, previously quality did not matter much, when mapping genes was good enough. Nowadays when we are talking SNPs, reads are arguing with each other, MiSeq assemblies in disarray with Ion Torrent. Every sequence variation in alignment is screaming "I am Spartacus".

      The problem of opened Pandora boxes is solved by opening hundred more of them. Which is good - slap NGS on your LinkedIn experience and you are set for at 150G.

      --
      I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
    5. Re:Right investment, right time by mapkinase · · Score: 1

      If you need GB/sec for downloading data, you are asking wrong questions. AWS can not only store but also compute.

      BJI is producing so much data they are going back to shipping hard drives (flashback from 2000 for me)

      --
      I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
  9. Big Data? by Freddybear · · Score: 1

    Given this administration's record on "investments" and "betting on the future" that's just another $200,000,000 into the pockets of Big Democrats.

  10. The Act needs to be renamed.... by HotNeedleOfInquiry · · Score: 1

    The Oracle/Cisco/IBM Full Employment Act

    --
    "Eve of Destruction", it's not just for old hippies anymore...
  11. Transparency? by Anonymous Coward · · Score: 1

    When Obama said his administration would be the most transparent, it is apparent it was about how the government would make all of our private details transparent to government while making what the government does totally opaque to us.

  12. How about we use the data we already have first? by Anonymous Coward · · Score: 1

    Sure it's great that we can start to analyze lots more data but does anyone else think we should start using the data we've already got? The next time I hear a politician propose a policy that directly contradicts current research on the subject in order to pander to their constituents or Jesus I think my brain will explode.

  13. And the cost to move the data? by oneiros27 · · Score: 2

    Amazon is using the idle time of their huge cloud when it's not being used for christmas shopping ... so the cost of CPU is relatively cheap. Bandwidth and storage is *not* with most cloud sevices.

    So, say I need to calibrate a year's worth of SDO/AIA data ... that'd mean pushing to them somewhere in the range of 500TB of data, and then pulling it back again. They've changed their pricing so the transfer in is now free ... but if I'm doing the math right, that'd cost somewhere on the order of $30k for the transfers, and if we assume we're pushing it in and deleting it as soon as it's done, we don't need a lot of storage. For other processes, people *do* need the storage, which runs around $100/TB/month, so $50k ... per month.

    It's not as impressive, but it's more cost effective in the long run to build in your own processing near the data. Would it be nice to redo two years of calibration in a day, rather than the ~3hrs to process 1 day's data that it takes now? Yes, but we don't have the funding to pay for it. (every launch delay costs money (gotta keep the scientists employed, store satellites in machine rooms, pay for offices, etc.) ... and that money, without fail, gets taken from the actual running of the mission and the data analysis.

    What I'd personally like to see is more large scale infrastructure coordination, and for any project where the PI team's composed entirely of physicists yet they're designing and implementing their own data system be immediately de-funded.

    I'm not going to say that everyone should be using iRODS or OODT or whatever the next new sexy thing is ... but a physicist writing the drivers that run the tape drives? That's a sign something's gone horribly wrong, and yet it's still happening.

    --
    Build it, and they will come^Hplain.
  14. Military departments? by dutchwhizzman · · Score: 1

    It's all spent in military departments, according to TFA. I really doubt the community will see much back from this investment in the form of better open source tooling, since a lot of it will be used to deal with military secret stuff, no doubt.

    Also, given the amount of black ops money spent there and the "regular budget", this is nothing. The F22 project alone has budget overruns that make this look like pocket change.

    --
    I was promised a flying car. Where is my flying car?
  15. Solyndra The Sequel? by Squidlips · · Score: 1

    Have no fear, I am sure it will be run as well as other government ventures such a your local Registry of Motor Vehicles....

  16. Thanks Barry by TonyXL · · Score: 1

    Thanks for gambling my $$$.