Slashdot Mirror


Why Data Is the New Coal (theguardian.com)

An anonymous reader shares a report on The Guardian: "Is data the new oil?" asked proponents of big data back in 2012 in Forbes magazine. By 2016, and the rise of big data's turbo-powered cousin deep learning, we had become more certain: "Data is the new oil," stated Fortune. Amazon's Neil Lawrence has a slightly different analogy: Data, he says, is coal. Not coal today, though, but coal in the early days of the 18th century, when Thomas Newcomen invented the steam engine. A Devonian ironmonger, Newcomen built his device to pump water out of the south west's prolific tin mines. The problem, as Lawrence told the Re-Work conference on Deep Learning in London, was that the pump was rather more useful to those who had a lot of coal than those who didn't: it was good, but not good enough to buy coal in to run it. That was so true that the first of Newcomen's steam engines wasn't built in a tin mine, but in coal works near Dudley. So why is data coal? The problem is similar: there are a lot of Newcomens in the world of deep learning. Startups like London's Magic Pony and SwiftKey are coming up with revolutionary new ways to train machines to do impressive feats of cognition, from reconstructing facial data from grainy images to learning the writing style of an individual user to better predict which word they are going to type in a sentence.

75 comments

  1. But... by sbrown7792 · · Score: 4, Funny

    I thought data was Oreos!?

    1. Re:But... by Yvan256 · · Score: 0

      What do you mean, African or European Oreos?

    2. Re:But... by Anonymous Coward · · Score: 0

      I thought data was Oreos!?

      Nope, it's Orwellian!

    3. Re:But... by Matt.Battey · · Score: 5, Funny

      Bacon, ya I see it but, I propose data is like the new donut. Looks great on the outside, a little empty on the inside, and once you've eaten three, you wish you hadden't.

    4. Re: But... by Anonymous Coward · · Score: 0

      What do you think the filling in an Oreo is? That's why it has so many calories!

    5. Re: But... by Anonymous Coward · · Score: 0

      Aids?

    6. Re:But... by davester666 · · Score: 1

      Great. Now all I'm going to think about for the rest of the day is bacon-wrapped donuts.

      --
      Sleep your way to a whiter smile...date a dentist!
    7. Re:But... by Anonymous Coward · · Score: 1

      Data is like Sticky Notes. Having a few is useful and easy to manage. But it's easy for it to get out of control and become useless. That's where Big Sticky Note comes in.

    8. Re: But... by Anonymous Coward · · Score: 0

      Don't forget the donut holes or Timbits.

    9. Re:But... by garlicbready · · Score: 1

      Can someone please re-phrase the story in the form of a car analogy

  2. uh... by Anonymous Coward · · Score: 0

    what?

  3. Some editing, please! by sandmaninator · · Score: 1

    buy coal in to run it.???
    The problem is similar: blahblahblah
    Nonsense!

  4. A particularly grim metaphor by Anonymous Coward · · Score: 1

    Especially when you think about who the coal miners are, who owns the coal, and who the "coal" is.

    1. Re:A particularly grim metaphor by PPH · · Score: 2

      So when Microsoft buys Twitter, that will be like a mine collapse?

      --
      Have gnu, will travel.
    2. Re:A particularly grim metaphor by NetNed · · Score: 1

      No. More like a minecraft collapse.

    3. Re:A particularly grim metaphor by Anonymous Coward · · Score: 0

      Silicon Valley is the new West Virginia, whatever the hell that means.

    4. Re:A particularly grim metaphor by Anonymous Coward · · Score: 0

      As someone from West Virginia, I can tell you that it's not good.

  5. What the fuck is this??? by Anonymous Coward · · Score: 1, Interesting

    God. I'm sure this was written by a hipster. With think black glasses and a muslim-style beard. The whole nine yards, with bicycle and record player.

    1. Re:What the fuck is this??? by Anonymous Coward · · Score: 1, Funny

      Close. It was dictated into a record player/recorder by a hipster while biking to a remote coffee shop (you've never heard of it, don't ask). Then it was transcribed to paper by another hipster with a vintage 1884 typewriter. Later, the "masterpiece" was given to guy with a real job who spends much of his free time questioning his decision in friends so he could scan it, run text recognition, fix the errors in conversion, then upload it for all his "friends" to see on their phones as they sip microbrews and play canasta.

  6. Data is the new Uber by PopeRatzo · · Score: 0, Offtopic

    If you can't come up with a proper automotive analogy for a technology, get off my fucking Internet.

    --
    You are welcome on my lawn.
    1. Re:Data is the new Uber by Zontar+The+Mindless · · Score: 0

      I see your secret admirer's been hard at work again.

      --
      Il n'y a pas de Planet B.
    2. Re:Data is the new Uber by JaredOfEuropa · · Score: 0

      An analogy is... just a thought... with another thought's hat on.

      --
      If construction was anything like programming, an incorrectly fitted lock would bring down the entire building...
  7. But the internet is pipes by Chrisq · · Score: 1

    But the internet is pipes. Surely if data is coal it should be conveyor belts.

    1. Re:But the internet is pipes by Anonymous Coward · · Score: 1

      But the internet is pipes. Surely if data is coal it should be conveyor belts.

      You kids. Back in the day we had to shovel our data.

    2. Re:But the internet is pipes by PopeRatzo · · Score: 1

      There's still a data chute on the side of my house that leads down to the data-fired server.

      --
      You are welcome on my lawn.
    3. Re:But the internet is pipes by drinkypoo · · Score: 1

      There's still a data chute on the side of my house that leads down to the data-fired server.

      And there's a chute on the back side of my ass that leads to analogies about data being coal.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    4. Re:But the internet is pipes by Anonymous Coward · · Score: 0

      And there's a chute on the back side of my ass that leads to analogies about data being coal.

      No... that's a crappy metaphor.

    5. Re:But the internet is pipes by demonlapin · · Score: 1

      I can't believe I'm having to say this here, but the internet is a series of tubes, not pipes.

  8. That's "so true" by Anonymous Coward · · Score: 0

    In other words, it's beyond true.

  9. More like 'privacy invasion is the new coal' by Anonymous Coward · · Score: 0

    Keep your goddamned 'deep learning' machines out of my -- and everyone elses business -- or else.

    I hear rumblings about an upcoming 'civil war' or even a 'race war', but I think the next 'civil war' will not be over race or ethnicity, I think it'll be over our private lives being infiltrated and violated by so-called 'big data' assholes and their asshole machines, sticking their silicon-based noses into things that aren't any of their goddamned business.

    I love the smell of datacenters burning in the morning!

  10. Dear article writer: Listen to yourself by nitehawk214 · · Score: 4, Insightful

    "Data is coal, not oil."

    You sound like a moron.

    Sometimes things do not fit into your analogies. No matter how hard you try to force it.

    --
    I'm a good cook. I'm a fantastic eater. - Steven Brust
    1. Re:Dear article writer: Listen to yourself by meta-monkey · · Score: 1
      --
      We don't have a state-run media we have a media-run state.
    2. Re:Dear article writer: Listen to yourself by bluefoxlucid · · Score: 1

      Dissent on the +2 Troll moderation. This guy is an angry prick but I'm pretty sure the analogy makes no sense, at least not to any layman. Even to my senses, coal and oil are the basis of economy: all economy runs down to energy. Hunter-gatherers are solely concerned with food to power human muscle to hunt and gather; agrarian societies are similarly concerned, until they invent animal power (still food) and mills (water, wind, coal, oil, solar power). Societies require human time to produce the things required to live, and they reduce that time by technology, which eventually requires non-human energy: tractors harvest food more-quickly, mills process grain more-quickly, and we're freed from human labor time by extracting energy from coal and oil.

      Data is a commodity processed by technology, like cloth or sand. It's not a source. Data requires energy, and energy doesn't require much data at all--so little, in fact, that just basic human knowledge such as knowing that Africa is sunny and has geothermal hot spots in the north-east can tell you where to drop your solar and geothermal power plants. Data might be nice for squeezing out 1% more efficiency--and 1% of ENERGY is a hell of a lot--but society runs on energy, and data *needs* energy; energy doesn't actually need data, and society can get all of its energy needs without energy being built using big data infrastructure.

    3. Re:Dear article writer: Listen to yourself by nitehawk214 · · Score: 1

      I hope I can get this to +5 Troll.

      The thing I am most angry and prickish about is that people try to force analogies where they just don't belong in order to falsely direct the conversation. Analogies might be intuitive, but they are often so intuitive that it gives people the false sense of actually knowing what they are talking about.

      Internet of tubes and all that.

      --
      I'm a good cook. I'm a fantastic eater. - Steven Brust
    4. Re:Dear article writer: Listen to yourself by UnknownSoldier · · Score: 1

      > all economy runs down to energy.

      ^ THIS.

      > society can get all of its energy needs without energy being built using big data infrastructure.

      I'm not sure how you missed the fact that (Big) Data leads to Knowledge which leads to Power and Energy and ultimate Money.

      That's why Apple, Amazon, Google, Microsoft, etc. are all building huge data centers. They want a piece of the pie of influencing & controlling because ultimately it will bring profits.

    5. Re:Dear article writer: Listen to yourself by bluefoxlucid · · Score: 2

      The thing is big data lets you go to East Africa and use gajillions of samples to map out a statistical analysis of exactly what square meter of ground you want to tap into to get the most-likely absolute-best geothermal energy production. Rough knowledge lets you do ... about the same thing, just without taking it to planck scale.

      We're not talking about the difference between a 500 gigawatt production facility and a 900 gigawatt production facility; we're talking about 500 gigawatt versus 500.1 gigawatt.

      That's why Apple [datacenterknowledge.com], Amazon, Google, Microsoft, etc. are all building huge data centers. They want a piece of the pie of influencing & controlling because ultimately it will bring profits.

      Big data makes the difference between 30%-effective advertising and 70%-effective advertising. Big energy can go outside and run a thermal scan of the ground (from an air plane, using IR cameras) and then just pick somewhere for geothermal; THAT'S HOW ADVERTISING WORKS WITHOUT BIG DATA! If you just bluntly advertise based on a survey of demographics, you get significantly less conversion. You go into a city and say, "Hmm, lots of black people here, kind of poor, thug life, ok. Put up billboards about Ciroc featuring buff black dudes in do-rags with face tattoos." With big data, instead of running online ads that say, "You're in regional Baltimore, so let's show racially-profiled ads that basically assume you're a black gang thug," they can try to pinpoint exactly what behaviors describe the recipient of a particular ad, and serve an ad that matches their interests, thus get much more conversion.

      So, again, while advertisers might more than double their effectiveness by churning through piles and piles of data, all that effort gets power companies roughly zero over just taking a fly-over with thermal or sonic imaging. The most important data tool in oil prospecting is AUTOTUNE. They don't much benefit at all from big data. Neither does most other things (farming, manufacturing, music production, pharmacology, chemistry).

    6. Re: Dear article writer: Listen to yourself by Anonymous Coward · · Score: 0

      Coal was a raw material that powered the industrial revolution. Once you had coal, you could power steam machines and do all the things that a water wheel or windwill but at your speed and not dependent on the weather. Grinding grain, powering looms, threshing machines. The only limit of what you could do was your knowledge of machinery. The steam engine gets replaced with the combustion engine, the diesel engine, then the electric motor.

      We're seeing that evolution with CPU's and GPU's. First CPU's were slow and could only handle small amounts of data (8-bit @ 1MhZ). Now we have thousandcore GPU's with Gigabytes of dataspace arranged in on demand grid computing.

    7. Re:Dear article writer: Listen to yourself by Anonymous Coward · · Score: 0

      Best comment I've read in awhile.

    8. Re:Dear article writer: Listen to yourself by Anonymous Coward · · Score: 0

      I agree with the sentiment of your post and I laughed out loud, but honestly half the reason I hated this article was that the metaphor has potential. Coal when subjected to intense pressure and temperature transforms into diamond, which is useful in all sorts of ways.

  11. Call me dumb but... by Anonymous Coward · · Score: 1

    Am I the only one who found this to be gibberish that makes zero sense? Seriously what is this supposed to mean?

    1. Re:Call me dumb but... by nitehawk214 · · Score: 1

      Data mining causes global warming.

      --
      I'm a good cook. I'm a fantastic eater. - Steven Brust
    2. Re:Call me dumb but... by Anonymous Coward · · Score: 0

      It makes lots of sense. Machine learning techniques are like a steam engine and data is like the coal that makes them work. Machine learning techniques (like early steam engines which were only used at coal mines) aren't yet at the point of usefulness and efficiency to make them work for people who don't already have a lot of data (i.e. Google or Facebook).

    3. Re:Call me dumb but... by Anonymous Coward · · Score: 0

      Simple analogue taken too far equals an analogue lost in the deluge of data. See, what I did there? ;)

    4. Re:Call me dumb but... by Anonymous Coward · · Score: 0

      Well it sure generates a lot of hot air.

  12. Breathless breathless.. by Anonymous Coward · · Score: 1

    omigod omigod THIS is the NEXT BIG THING !! Don't MISS OUT in this ONCE IN A LIFETIME opportunity to get in on the GROUND FLOOR of the biggest thing since COAL !!!!

    cc: Mr Bulschiter and Associates,
      As per our agreement, Oh-pinion Makerz has placed 200 stories on social media designed to ignite the hype cycle for your company's offerings. This fulfills the terms of our contract.

    Please send fee to our offshore associates.

    Thank you.

  13. Data is the new beer. by SolemnLord · · Score: 3, Insightful

    It goes down smooth and tastes great but if you get more than you can handle you end up driving your car through a house.

  14. We thought data was the new beer... by SolemnLord · · Score: 0

    But it turns out that data is more like oranges: once you strip away the rough outer layers you've got something wonderful but you still have to divide it up or squeeze it.

  15. We thought data was the new orange... by SolemnLord · · Score: 0

    But now we know that data is the new web article: if you do a lot of digging you can find nuggets of vital importance in even the blandest drivel!

  16. We thought data was the new web article... by SolemnLord · · Score: 0

    But just here me out: data is the new joke format. It gets old fast and you immediately want something new and relevant that will give you an edge over the competition.

  17. Devonian? by Anonymous Coward · · Score: 0

    Man, he was old.

  18. They're not similar at all by Solandri · · Score: 1

    Coal was a new energy source - a way to replace human and animal labor with machine labor. This resulted in huge productivity gains (measured in productivity per person - productivity per Joule expended actually went down because coal energy was so much cheaper than human labor meaning inefficient machines could still be cheaper). The MO was dirt simple - take anything that used to require people or animals to expend effort to do, make a machine to do it, and power the machine with coal.

    Data is just data. Aside from a few data-processing tasks which have already been automated (OCR, statistical analysis), there is no dirt simple way to use data to reduce human labor. You can eek out a small productivity gain by using it to improve the efficiency of marketing (e.g. don't show bra ads to men), but that's pretty much it. The productivity gain is what's necessary to make it "better" than previous ways of doing things. Improvements in economic efficiency show up as productivity gains.

    Popularity is one way (probably the best way) to leverage data. You can use it to determine what's popular and position the marketing of your products in that direction. But that's a zero-sum game. Any increased sales you gain because you marketed your products better directly reduces sales of other competing products. This is totally different from coal (and oil) which enabled new methods of production, and thus weren't zero-sum.

  19. Sure, but... by fahrbot-bot · · Score: 4, Funny

    After sitting through many code reviews, I can tell you it's not clean coal.

    --
    It must have been something you assimilated. . . .
  20. Just like Coal by Oswald+McWeany · · Score: 1

    Just like coal- my data is polluted. I deliberately try and pass as much misinformation (when I can) into companies that collect my data. Obviously a lot of it I can't.

    Part of it is for self-protection and privacy- and part of it is because it amuses me and I have a weird sense of humor.

    --
    "That's the way to do it" - Punch
    1. Re: Just like Coal by Anonymous Coward · · Score: 0

      Please tell us how you do this so we may also do this. Us idiots figured we had no control.

  21. A new achievement in Slashdot's recent history by jenningsthecat · · Score: 2

    For the first time since I can remember, TFA was actually written more poorly than TFS. Of course, that wasn't not too hard; TFS only contained one paragraph from the article, while TFA itself went on and on and on in an a meandering, fuzzy-headed, buzz-word-filled fashion that said nothing and went nowhere. As a bonus, that 'coal' metaphor seems to have come straight from a cannabis-induced moment of "enlightenment".

    --
    'The Economy' is a giant Ponzi scheme whose most pitiable suckers are the youngest among us and the yet-unborn.
  22. Typical near-top-of-bubble stuff by ErichTheRed · · Score: 3, Insightful

    Just like the dotcom bubble, there are entire companies whose fate hinges on massive uptake of the "big data" and "deep learning" revolutions. And just like the hype cycles from the last bubble, there's some truth to them but people really take it to an extreme to get headlines and clicks. I think when the bubble pops, there will be plenty of "real" big data problems for serious qualified people to solve, as well as legions of unemployed "data scientists" and "cognitive champions."

    I think applying data analysis techniques to societal problems (emergency response, environmental issues, etc.) is a good thing. I don't think the current focus of ever more intrusive advertising and behavior analysis is going to add much value in the long run. This isn't a tinfoil-hat style rejection of tracking, it's my belief that even the dumbest of consumers are going to reach a point where they can't stand having ads shoved in their face anymore and demand that it stop. Ever notice how commerce sites email you when you put an item in your cart, then don't buy it? Lots of sites have at least buried a setting somewhere in their account configs that let people turn this off. No one ever went broke overestimating the stupidity of the average consumer, but pushing things on every channel (phone, computer, tablet, streaming ads, browser ads, etc.) will lead to consumer fatigue.

    1. Re:Typical near-top-of-bubble stuff by Anonymous Coward · · Score: 0

      So being old enough to work through the AI Bubble of the late 1980's I'm having Deja Vu!

  23. Data does reduce human labor by sjbe · · Score: 1

    Aside from a few data-processing tasks which have already been automated (OCR, statistical analysis), there is no dirt simple way to use data to reduce human labor.

    Complete nonsense. Computers are how you use data to reduce human labor and tons of tasks have been automated. To take the analogy further oil by itself is useless. You need a machine to do something useful with it. Data is the same way. By itself it is comparatively useless but with a computer you can do a lot to reduce human labor. For example CAD or bookkeeping or inventory are all data processing tasks which substantially reduce human labor with the help of data.

    1. Re:Data does reduce human labor by Anonymous Coward · · Score: 0

      Big DATA = Go AI and Chess AI.

  24. Okay, if big data the new coal... by slew · · Score: 1

    Okay if big data the new coal, we should stop using it now because although it is currently cheap and plentiful with apparently many applications, we know eventually it lead us to the collapse of civilization.

    Maintaining access to big-data will eventually cause political conflicts and maybe even wars, and continuing unrestrained usage of big data will eventually cause inconvenient problems in our daily lives that will make our world unliveable and our society unsustainable. The money exploited by the early adopters in the big-data industrial complex will dominate the political landscape and prevent us from doing anything about constraining this monster until it is too late.

    If you could have put a cap on companies like the Peabody coal company back in the early days, you wouldn't ever hear statements like this today from coal company analysts...

    “We have never seen leases of more than a billion tonnes and we are starting to see that under the Obama Administration.”

    If the Obama administration's department of Interior can be bought-and-sold by a coal company with annual revenues of only $5B, what hope do governments have against big-data companies with annual revenues of $74B?

    Any other analogies to coal people would like to say about big-data?

  25. Re:Ah, I see by Mister+Transistor · · Score: 0

    You mean mindlessly repeated buzzwords like "pivot point"?

    --
    -- You are in a maze of little, twisty passages, all different... --
  26. There's Not Enough of It and Never Will Be.... by littlewink · · Score: 1

    to justify hitting it with new deep learning/NN technology. Also, most of the old data has been "mined out" with standard statistical techniques.

    Oh, they can do the exercise: chase phantoms, declare that they "found" something new (the same old trends previously found again), with the very occasional true discovery. But by and large, 15 minutes after the data is released, everything we already know will be confirmed and anything new we can know or will know will be revealed.

    Nothing here, Folks. Move along please.

  27. Incoherent summary is incoherent by robot256 · · Score: 1

    Ignoring the grammatical errors and typos, I assume the closer was supposed to be something like, "But right now, the only organizations actually using deep learning techniques are the ones who produce the big data in the first place, and they are using it for their own purposes. We have not yet reached the point where big data or deep learning are being commercialized by third parties."

  28. Re: Ah, I see by Anonymous Coward · · Score: 0

    Pivot point also means dick. How can it be a buzzword? Oh I get it, all buzzwords actually mean dick.

  29. As Jeffrey Albertson might say by martinX · · Score: 1

    Worst analogy ever.

    --
    When they came for the communists, I said "He's next door. Take him away. Goddam commies."
  30. I Won! by Anonymous Coward · · Score: 0

    When I saw "coal" on my Buzzword Bingo sheet, I thought I was screwed!

    But someone renamed something old and they named it coal and I won!

    America, fuck yeah!

  31. Missing the Key Part To Make Sense by crunchygranola · · Score: 1

    The summary should have included the next paragraph of the article to make any sense:

    And yet, like Newcomen, their innovations are so much more useful to the people who actually have copious amounts of raw material to work from. And so Magic Pony is acquired by Twitter, SwiftKey is acquired by Microsoft – and Lawrence himself gets hired by Amazon from the University of Sheffield, where he was based until three weeks ago.

    Or else the poster should have just written a terse summary and not just cut and paste paragraphs. Yeah I know, this is slash-dot...

    --
    Second class citizen of the New Gilded Age
  32. Peak Data.. by Anonymous Coward · · Score: 0

    The good news is that Peak Data lies just ahead, predicted for early next week. Things should settle down from there on...

  33. Big Data leads to social cooling by Anonymous Coward · · Score: 0

    In the humanities there has been a lot of research into the many metaphores used for data, and how they effect society's view on data-hungry operations:
    http://dismagazine.com/discussion/73298/sara-m-watson-metaphors-of-big-data/

    Personally I think both oil or coal are more useful metaphores than gold, because in contrast to gold, the public understands these as having both up- and downsides. The problem with the gold metaphore is that it doesn't point to the serious problems data gathering can lead to.

    I worry that the pumping of data will create a lot of chilling effects and self-censorship. The widespread use of oil lead to global warming. I think the widespread use of data could lead to "social cooling".