Slashdot Mirror


As Big Data Plateaus, Data Science Education Grows

gthuang88 writes: Even as the hype around big data has died down, opportunities for data scientists are expanding. Johns Hopkins, NYU, and MIT are among the schools offering courses in data science, and IBM and other big companies are investing heavily in training programs. Now a startup called DataCamp has raised $1 million to expand online courses in R programming, Apache Spark, and other topics. The deal speaks to the opportunity that venture capitalists see in training the next generation of data scientists and business analysts. It also shows online education is specializing beyond platforms like Codecademy, Coursera, and Udacity.

41 comments

  1. Data ?Science ? by Crashmarik · · Score: 1

    Much more an engineering discipline than a science.

    1. Re:Data ?Science ? by Mr+D+from+63 · · Score: 1

      I'd call it a scientific instrument. Like a microscope. Scientists learn how to use it, engineers design and build it.

  2. "Data Science"? by Anonymous Coward · · Score: 2, Interesting

    "Data Science"? Why the fuck isn't it just called "Statistics", like it has been for centuries?

    1. Re:"Data Science"? by Anonymous Coward · · Score: 0

      Because it is Web Scale!

    2. Re:"Data Science"? by Crashmarik · · Score: 1

      Well give them something there is the information extraction going on. Don't see how that rises to a science.

    3. Re:"Data Science"? by umafuckit · · Score: 1

      This is somewhat fair, but when I think "statistics" I think fitting a GLM to extract some parameters or test for significance. When I think data science, I think of stats but I also think of dimensionality reduction, visualisation, lots of programming, clustering, etc.

    4. Re:"Data Science"? by LaurenCates · · Score: 1

      "Statistics" isn't cool enough, and since it doesn't have "science" or "mathematics" in the name, it doesn't fit into the acronym "STEM".

      And you can't invoke the mighty need for "STEM" if you have to make people think about how it's related.

      --
      Some people don't believe in fairies. I don't believe in The Patriarchy.
    5. Re:"Data Science"? by Anonymous Coward · · Score: 0

      Statistics means "numbers of/for the state". Those approaches can be used by Gov't funded academics. Private organizations will use data science.

    6. Re:"Data Science"? by Anonymous Coward · · Score: 0

      In the best "Big Data" effort, you are attempting to establish baselines from your data (control versions of the sampling) and control samplings of your data in an attempt to invalidate your null hypothesis.

      That said, there's a lot of people out there that just yell the equivalent of "5 is a really important finding!" Also there's the big cohort of people using big data tools to gather reports that have little to do with science, as there is little attempt to invalidate their assumptions.

    7. Re:"Data Science"? by gweihir · · Score: 1

      Because then it would not sound "modern" and "hip". Personally, whenever I hear "Data Science", I think "bullshit", because it really is just statistics, but with big egos and small skills.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    8. Re:"Data Science"? by Anonymous Coward · · Score: 0

      It has shards and map-reduce!

    9. Re:"Data Science"? by invalid_user · · Score: 1

      Actually, don't worry so much about the ego thing. I teach data science, and I am fairly candid about the "b******t" thing. Many other people I know who does "Big Data" knows and admits it's "b******t", too.

      As for the skills part... hmmm, you want to pick a fight with me? :)

    10. Re:"Data Science"? by gweihir · · Score: 1

      Well, if you admit that your respective skills are in "bullshit", then there is obviously no need to fight.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    11. Re:"Data Science"? by penguinoid · · Score: 1

      "Data Science"? Why the fuck isn't it just called "Statistics", like it has been for centuries?

      Because statistics show that you can get more money by calling it data science.

      --
      Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
    12. Re:"Data Science"? by invalid_user · · Score: 1

      No, I do not admit my skills are "bullshit", though the skills required of current "Big Data" stuffs are.

      The fight is on? ;)

  3. Filter Firehose entries rated red or better by Anonymous Coward · · Score: 0

    WTF and when i click the link nothing happens. Also when I do a search from the page of a single story and not the front page nothing happens except that single story just reloading. I feel like there's literally no one left at Slashdot offices just a bunch of Dice shmucks who check in once a week to fuck things up at this site little by little. After Beta failed they just fired everyone and decided to just come in every now and then and ruin the classic UI slowly enough the resistance is too small to cause a big enough uproar. Dice, you suck.

  4. This field has already yielded a lot of nonsense by Karmashock · · Score: 0

    The issue is that if you massage the numbers hard enough they'll say whatever you want. And often what is being done to them is so complicated that it is hard for anyone that isn't very familiar with the specific algorithms to even know what happened.

    A big issue here is that it is very hard to audit this sort of science and if you are inclined to sensationalize your paper or make it appear more interesting than it would otherwise... it doesn't appear to be hard to do.

    What I hope comes out of this education is clever young people that will hold their peers to reasonable standards and know the math well enough to spot nonsense when they see it.

    --
    I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
  5. Saying about fields with word "science" in title by blahbooboo · · Score: 1

    If a field has the word "science" in the title, it's probably not science.

  6. Viagra scholarship by Anonymous Coward · · Score: 0

    Is it just me or does the title read like a scholarship from Pfizer?

  7. Re:This field has already yielded a lot of nonsens by Mr+D+from+63 · · Score: 1

    I agree, and think you articulated it well. There is too much abuse of data, often but not always unintentional. Better education in how to work with data should be a good thing. Even in base curricula, not just science.

  8. R programming by tomhath · · Score: 2, Insightful

    There's no point in learning R if you don't have a solid background in statistics.

    1. Re:R programming by Anonymous Coward · · Score: 0

      > statistics

      Data science, please.

    2. Re:R programming by gweihir · · Score: 2

      You can still produce a lot of nonsensical results. Like people that learn Java or JavaScript but have no solid ComSci background produce a lot of bad software.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    3. Re:R programming by Anonymous Coward · · Score: 0

      Math. It's called math. If you're good with it then you can program in QBASIC, but if not all the R/Python/Javascript/Custom-lisp-tailored-to-your-neurolinguistic-patterns won't help you.

    4. Re:R programming by umafuckit · · Score: 3, Insightful

      I disagree. By learning R I developed a much better grounding in statistics than I had previously. This is because it doesn't hold your hand and you have to actually think in order to use it.

    5. Re:R programming by Anonymous Coward · · Score: 0

      Frankly, R shouldn't be your first language, and probably not even your second.

      Take a C, C++ instead of a bullshit humanities class, and take a Python class instead of a US History class. Then take R instead of Anthropology/Sociology/Human Sexuality.

      With so much "fluff" in your typical college diploma, the "computer science skills shortage" is hard to take seriously. Why not divert all the people taking Spanish/German/Chinese/Russian/Latin/Arabic/American Sign Language/[insert foreign language requirement] in to a Python class for example?

      Unlike learning to speak sign language, Python might make your average STEM worker more effective in their job. I don't mean to suggest that deaf people don't count as real people(no comment) but it's pretty much irrelevant anyway because by the time a change like this has been made to academics: Google/Dragon Natural Speaking will have Google Glass translating spoken language to written text in real time.

      I've seen what happens when you give someone with zero grounding in real programming languages a high level scripting language like Matlab/.m and it isn't pretty.

    6. Re:R programming by Anonymous Coward · · Score: 1

      With so much "fluff" in your typical college diploma... Why not divert all the people...

      If you aspire to be a code monkey for the rest of your life, don't go to college. Just try to convince a prospective employer that your awesome C++ skill is just what they need; maybe it is, this year.

    7. Re:R programming by twistedcubic · · Score: 2


      Frankly, R shouldn't be your first language, and probably not even your second.

      R is not a general purpose programming language, even though you can write R programs. There are R programming courses (Johns Hopkins, Coursera), but you can get pretty far with using R as a statistical calculator, without ever writing a complete program (ala Octave or Matlab).

    8. Re:R programming by Anonymous Coward · · Score: 0

      Actually, I've found that learning the syntax of another natural language has a lot in common with learning a highly structured language like C or Python. I wouldn't say that learning a foreign language is a total waste of time. The cognitive gymnastics you have to go through to master irregular verbs, word order, etc parallels picking up programming idioms and idiosyncratic frameworks where developers have settled upon an at least somewhat arbitrary convention (a 'signifier', according to semioticians). In both cases (natural vs programming languages) though, immersion is ultimately what counts the most (especially in terms of how deeply you process a language).

  9. Really??? by Anonymous Coward · · Score: 1

    " IBM... ...investing heavily in training programs"

    Forgive my cynicism, but that doesn't sound like the IBM I know.

    1. Re:Really??? by Anonymous Coward · · Score: 0

      " IBM... ...investing heavily in training programs"

      Forgive my cynicism, but that doesn't sound like the IBM I know.

      Perhaps IBM is investing heavily in selling training programmes not providing training to their employees?

    2. Re:Really??? by tomhath · · Score: 1

      IBM sells consulting. They have highly trained experts available in whatever technology is hot this year, just write the check and all your problems will go away.

  10. Re:Saying about fields with word "science" in titl by gweihir · · Score: 1

    Fortunately then for me that I graduated in "Informatics", not in "Computer Science". Of course, all you people in the English-speaking countries are screwed...

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  11. Data Scientists == Web Masters by Anonymous Coward · · Score: 0

    I say it every time... Data Scientists are this bubble's Web Masters.

  12. Combo of CS, Stats, Analytics, & domain knowle by langelgjm · · Score: 4, Insightful

    While I think this has changed over time, initially I think some statisticians were suspicious of techniques coming out of computer science, e.g. SVMs. And still, machine learning is a rather niche field of statistics that requires a fluency in CS that many statisticians don't have (or need). Check out this discussion.

    Of course there are some statisticians who are also good CS people (think Trevor Hastie and Rob Tibshirani). And a lot of stats people have great domain knowledge in their areas. But I think "data science" is supposed to be the combination of stats, CS, practical programming ability (e.g., cleaning and manipulating large datasets, which is definitely not part of traditional CS or stats education), ability to communicate results effectively, maybe throw in some visualization, knowledge of how to query databases, and domain knowledge to interpret what data mean. Also, some types of data (e.g. text with the aim of NLP) are pretty infrequently touched upon in stats education.

    That said, I get the sense that a lot of places looking for "data scientists" are actually just looking for business intelligence people.

    --
    "Anyone who [rips a CD] is probably engaging in copyright infringement." - David O. Carson
  13. Headlines should rhyme, all of the time. by mark-t · · Score: 1

    It's catchy and bright, if one's not uptight,
    To catch up on news with a poetical muse.

    No?

    (crickets)

    (goes away and sulks in corner)

  14. Re:Combo of CS, Stats, Analytics, & domain kno by invalid_user · · Score: 1

    Yes, and if America can produce enough data scientists, you don't have to H1B-hire SAP "consultants" (shoot me please for saying that) from India.

    It's a win(Americans)-lose(SAP(German company))-lose(Indian consultants) situation. :)

  15. Hot skills become commodity skills all the time by Anonymous Coward · · Score: 0

    Just another example of yesterday's hot skill becoming today's commodity skill with a glut of "coders" who can do it.

  16. Re: Combo of CS, Stats, Analytics, & domain kn by Anonymous Coward · · Score: 0

    I actually did the Coursera things from JHU (auditing Stanford, eyeballing U of I) because I wanted to do things w/ text cheaply (not NLP necessarily, textual analysis dates to either the Babylonian Captivity or the Council of Nicea in 325 a.d., or maybe earlier.)

    Big strength of the online courses was you got to see what languages people are using. JHU it was R, Stanford had Octave, UofI had no programming, though a few in the forums were swapping Java/scala/Python.

    While bogus data stuff flourishes, especially in politics, I was offered a job when I called up my local police department, asking for crime stats to use in a final project, and they were very aggressive in trying to get me, so I'm not sure the hype has gone out quite yet.