Slashdot Mirror


It's Time to End the 'Data Is' vs 'Data Are' Debate (vice.com)

dmoberhaus writes: After receiving too many irate emails about using "data" in the singular, a reporter spoke to two lexicographers about how the language changes over time and why it's perfectly acceptable and perhaps even "standard" to use data as a singular noun, rather than a plural noun in an attempt to settle an old debate. Peter Sokolowski, a lexicographer for the Merriam-Webster Dictionary, told the reporter that data's transition between its historical roots and contemporary use is related to a lexical phenomenon called "semantic bleaching," where a word's original meaning is lost or diminished over time. An example of semantic bleaching include the contemporary use of the word "literally," whose Latin root, littera, means "letter." In the case of "data," it has transitioned from "things given" to mean something like "a collection of information in aggregate" when used in everyday speech.

31 of 248 comments (clear)

  1. Is it? by TechJones · · Score: 2

    Are we sure it is?

    1. Re:Is it? by wonkey_monkey · · Score: 2

      I'm pretty sure it aren't.

      --
      systemd is Roko's Basilisk.
    2. Re:Is it? by Waffle+Iron · · Score: 5, Insightful

      Are we sure it is?

      We're pretty sure, but we need to wait until more data is available before we officially close the debate.

      We're pretty sure, but we need to wait until more data are available before we officially close the debate.

      Well, that settles it: The second form just feels weird and stilted, like a grammar rule from a musty out-of-date dictionary. Debate closed.

    3. Re: Is it? by Austerity+Empowers · · Score: 5, Funny

      This is why Dr. Pulaski lasted only one season, she was an inhuman monster.

    4. Re:Is it? by CanadianMacFan · · Score: 4, Insightful

      We're pretty sure but we need to wait until we have more data before we officially close the debate.

      When in doubt change the sentence so that you get around the tricky bit.

    5. Re:Is it? by Hallux-F-Sinister · · Score: 3, Informative

      You've (perhaps deliberately) misunderstood the argument. In your example, is you/are you, the choice of conjugation of the verb "to be" is based on "you" not "data". This is like people arguing about the difference between rope and line (on a boat or a ship) and you come along and say "a line is the shortest distance between two points, and that rope you're all arguing about is coiled up, so..." which has nothing to do with what they're talking about.

      Incidentally, is it my family IS or my family ARE? Data can be multiple pieces of information even about multiple things, and as a single umbrella name for that information, it would be appropriate to say "the data is..." just like "my family is". Data is information, and we don't say "the information are missing". We say "the information IS missing"...

      The data is all in, and my family entirely agrees with me on this point. (See, data here, multiple pieces of information is treated as a singular noun, just like family.)
      You could also say "The data are ..." Hmm... actually I don't like that one. I came here to make ONE point and ended up taking a side instead. DAMNIT!

      Okay, put me down as "Data IS". NOW... is it DAY-tuh, (first syllable rhymes with LAY) or DAA-tuh? (first syllable rhymes with CAT)?

      I'm in the "rhymes with LAY" camp. So "The DAY-tuh IS..."

      Damnit. I was going to make ONE comment and leave. ONE lousy stupid comment and get on with my fucking life... DON'T get drawn into this stupid arguement, I said to myself...

      (Wanders off muttering to self)

      --
      Our reign has gone on long enough. Indeed. Summon the meteors.
  2. At least we've still got the gif debate by Anonymous Coward · · Score: 3, Funny

    I hate when people pronounce gif as gif instead of gif. Everyone knows it's gif!

  3. What debate? by cybersquid · · Score: 2, Insightful

    How is this a debate? Look in a dictionary. The word has a literal dictionary definition. :-)

    The two dictionaries I just checked say data is the plural of datum.

    People who use it differently either knowingly choose to (which is fine with me) or they are ignorant.

    1. Re:What debate? by dgatwood · · Score: 4, Informative

      The problem is that the word is more commonly used now as a synonym for "information". You would never say "informations". At this point, it is mostly treated as plural in scientific contexts, and even there, it has often been superseded by the compound word "data point", which is obviously and trivially pluralizable.

      BTW, Oxford weighed in a while back.

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    2. Re:What debate? by dgatwood · · Score: 5, Informative

      For example, you never say: "I found a data" just like you never say "I found a stuff".

      I agree. However, the reason you would never say that is because "stuff" is uncountable. Using the article "a" is nonsensical, because that implies that there can be exactly one of something, and thus it must be countable. Just as you can't have exactly one, you can't have more than one, hence it is neither singular nor plural, per se. If "data" can't be used in that way for the same reason, then it, too, is an uncountable mass noun.

      So no... the word "data" cannot be singular.

      Except uncountable nouns in English always take a singular verb, e.g. "This stuff is gross," not "This stuff are gross". "The flour is in the cupboard," not "The flour are in the cupboard," and so on.

      The only way "data" can be plural is if you treat it as the plural of datum, which only makes sense if you are talking about a specific, countable set of data points. The result of an experiment produces data that is a collection of datum, hence ostensibly countable, so using it in the plural form is acceptable. When we start talking about the flow of data across a network, that's not really countable in any meaningful sense, because it varies from moment to moment, so it is uncountable, and must take a singular verb.

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    3. Re:What debate? by dgatwood · · Score: 2

      Sorry. I'm new here. :-D

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    4. Re: What debate? by crunchygranola · · Score: 2

      Data is a collective noun. You can treat it as either singular or plural as long as you stick with it.

      As for datum, that's not a word people use. It's data or piece of data.

      "Datum" is in active use, but not as a singular for "data". Instead it means some kind of reference point as in , for example geodetic datum.

      --
      Second class citizen of the New Gilded Age
  4. Data are an Android by Anonymous Coward · · Score: 5, Funny

    "Data are an Android." No sir, I don't like it.

  5. How do you tell if someone is an idiot? by JoeDuncan · · Score: 2

    They participate in inane meaningless debates like "is data plural or singular"?, while the rest of the world just laughs at them and keeps on.

    1. Re:How do you tell if someone is an idiot? by cybersquid · · Score: 4, Funny

      Thank you for your participation. ;-)

  6. No by wonkey_monkey · · Score: 4, Funny

    No it aren't.

    --
    systemd is Roko's Basilisk.
  7. like deer and fish, but the other way around by XXongo · · Score: 5, Interesting
    "data" is the plural of "datum", but it's also the singular of "data".

    kinda like "deer" is both the singular of "deer", and also the plural of "deer".

    1. Re:like deer and fish, but the other way around by plague911 · · Score: 2

      So either use would be correct. "The deer is" and "The deer are" would both be valid, but have different meanings.

    2. Re:like deer and fish, but the other way around by fibonacci8 · · Score: 5, Funny

      The plural of "pope" is "the Western Schism".

      --
      Inheritance is the sincerest form of nepotism.
  8. Plural in Latin, singular in English by Tough+Love · · Score: 4, Informative

    Plural in Latin, singular mass noun in English, does it need to be any more complicated? Strictly speaking, if you mean to write the latin word in English prose then you should italicize it.

    --
    When all you have is a hammer, every problem starts to look like a thumb.
  9. What about hospital? by 110010001000 · · Score: 3, Interesting

    I still want to know why in the UK they say "in hospital" instead of "in the hospital". What's up with that? And don't get me started with "math" vs "maths".

    1. Re:What about hospital? by JoeDuncan · · Score: 3, Insightful

      That one's easy. When you say "in the hospital", the use of the definitive article "the" implies that it is a *specific* hospital they are in (usually with some understanding that the reader/listener is already aware of which *specific* hospital is under discussion).

      When you say someone is "in hospital" it is a more general statement, saying that they are in a hospital somewhere receiving medical treatment, but does not imply that the *specific* hospital in question is already shared knowledge with the listener.

      USians tend to use "in the hospital" for the most part because their health care system sucks balls and in most places there is only ONE local hospital which you could be referring to.

      In other countries with proper healthcare, there are multiple possible hospitals, and the specific hospital can't be assumed by context.

    2. Re:What about hospital? by JoeDuncan · · Score: 4, Interesting

      Spoken like a true and proper douche who has never left their own town. Is it really any wonder why there's a strong anti-europe sentiment in the US when there's people as condescending as yourself?

      Wow. You really are an idiot.

      I'm Canadian, moron. Also, the only douches here are the ones posting AC LOL...

      Realistically, I can take any socialized health care system and point out to a way that it's inferior to the US system.

      No, you can't, because they aren't. The US has the worst health care in the developed world; Mexico and Cuba are better.

      The thing people as naive as yourself don't realize is every system has its positives and every system has its negatives.

      Nope. The thing USians don't realize - because they've deliberately lobotomized their educational system, and therefore have a hopelessly parochial and myopic view of the world - is that health care is better just about ANYWHERE in the world that isn't a third-world banana republic, and even some of THOSE have better health care!

      Honestly, I don't know what else to expect from a degenerate culture that uses their own children for target practice though...

  10. Just came here to argue the opposite case by SuperKendall · · Score: 3, Insightful

    ...but then I read the summary and was too bored to figure out what the opposite case is supposed to be.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  11. Re: What debate? Mass vs Count noun by Anonymous Coward · · Score: 2, Interesting

    The real distinction is between a 'mass noun' and a 'count noun.' When your server asks you "would you like french fries or mashed potato?", french fries is a count noun and mashed potato is a mass noun. We usually use water as a mass noun, but biblically, waters meant multiple (countable) bodies of water.

    So the question about 'data' is whether it has transitioned from something countable to something measured in bulk ( e.g. ounces of rice versus grains of rice). The claim here is that more people are using data as a mass noun, and I tend to agree.

    Full disclosure: decades ago I was a C student in highschool Latin.

  12. Let's compromise by reboot246 · · Score: 3, Interesting

    I'll stop saying "data are" when the rest of you stop using plural verbs with collective nouns. The team "IS" winning, not the team "are" winning. Sure, they may say it like that across the pond (the UK), but here (the US) collective nouns are singular and take a singular verb (most of the time - there are a few exceptions).

    1. Re:Let's compromise by bigdavex · · Score: 2

      So much this. If you want a plural meaning, using a plural noun. THOSE PLAYERS are winning.

      --
      -Dave
  13. Re: What debate? Mass vs Count noun by Anonymous Coward · · Score: 2, Insightful

    That's nice that you knew some Latin and are also familiar with the mass/count distinction, but I question whether you're a native English speaker, when you claim that a server asks if you want "mashed potato". It's always always always mashed potatoes.

  14. Re:Whatever by mikesum32 · · Score: 2

    That typo claim factually incorrect.

  15. It's worse with dollars by slickwillie · · Score: 4, Insightful

    As in " One million dollars was spent . . ." instead of "were spent".

  16. Re: Whatever by Anonymous Coward · · Score: 2, Informative

    It was named aluminum by the discoverer and renamed because it didn't sound like those other metals.

    At this time, both are accepted.