Slashdot Mirror


Tracking The (English) Words We Use

Zugok writes "Wordcount.org has an interactive presentation of the 86,800 most frequently used English words. In addition they have Query Count which is a dynamic database of what are the most queried words on Wword Count. Then there is the conspiracy corner where certain words seems to end up in some sort of eerie order. Cowboy comes 14834 and Neal comes 18928. Bebop comes 70673."

67 of 332 comments (clear)

  1. another word by mrpuffypants · · Score: 4, Informative

    fuck is number 5598

    Actually, I expected this to be higher since I watched Goodfellas last night.

    1. Re:another word by Rosyna · · Score: 4, Funny

      Bite: 5922
      My: 69
      Shiny: 8590
      Daffodil: 27591
      Ass: 15036

      I am actually quite disappointed that this wasn't from the bite-my-shiny-daffodil-ass dept. Tsk, tsk. Hemos.

    2. Re:another word by Hinhule · · Score: 2, Funny

      I think it just went up quite a bit since the hosts realized their servers had been slashdotted.

  2. Nice flash by bizpile · · Score: 2, Interesting

    That has to be the coolest use of Flash I have ever seen that wasn't simply an animation. I guess I won't adblock it.

    1. Re:Nice flash by Anonymous Coward · · Score: 5, Interesting

      Except it is one of the most annoying interfaces I've used. There's no way to know what you're suppose to click on, and there's limited space that displays like 3 words at a time in a giant font. I would prefer to see even the most rudimentary HTML so I could scroll through a list of 100's or 1000's of words at a time.

    2. Re:Nice flash by rdc_uk · · Score: 5, Interesting

      Pretty,
      but possibly the most useless UI for list-format data ever; I can only read the first (counts) 19 entries, (can't read the numbers after 10). After that you have to do random sampling.

      Browsable Lists - the past and future of basic data presentation!

    3. Re:Nice flash by julesh · · Score: 2, Insightful

      Yeah, very cool.

      "A script in this movie is causing Macromedia Flash Player 6 to run slowly. If it continues to run your computer may become unresponsive. Do you want to abort this script?"

  3. gee.. by g-to-the-o-to-the-g · · Score: 3, Interesting

    Hm...I would have thought things like "the", "and" or "or" would have beat out "dog" "pussy" "sex".

    1. Re:gee.. by drunkennewfiemidget · · Score: 5, Funny

      You're grossly over-estimating the general public. ;)

  4. Love Hate by richardoz · · Score: 5, Insightful

    At least love @384 ranks above hate @3107

    I think the world isn't so bad...

    --
    All the worlds indeed a .sig, and we are mearly players..
    1. Re:Love Hate by bizpile · · Score: 5, Interesting

      At least love @384 ranks above hate @3107

      But war(304) beats peace(1155).

    2. Re:Love Hate by KRYnosemg33 · · Score: 5, Insightful
      Sure, but what you fail to realize is that the word is most often used in the following cases:
      "I love Britney Spears"
      "I love M. Jackson"
      "I loved Gigli"
      "I love [insert political candidate] because of [insert extremely dumbass reason]"

      You realize the world isn't the poetic and romantic place you think it is.

    3. Re:Love Hate by Lemmy+Caution · · Score: 4, Funny

      "I loved Gigli"

      I suspect that yours is the first use of this phrase, ever.

    4. Re:Love Hate by DrEldarion · · Score: 3, Insightful

      Yet people are still terrified of saying "I love you".

  5. NSFW! by welshwaterloo · · Score: 5, Informative
    In case anyone's curious at work - don't click the link to see what other people are searching for.

    I mean, I guess I should've known, but I didn't expect the font size to be so damned *large*!


    (Not, of course that anyone would waste work time by reading /.)

    1. Re:NSFW! by Cederic · · Score: 3, Funny


      Hmm. Thing is, there's pretty few words likely to appear high on the search list that I don't use verbally every day anyway.

      Unless lots of people are searching for 'theocratic'. I don't use that one much.

      ~Cederic

    2. Re:NSFW! by mykdavies · · Score: 2, Insightful

      Fortunately, the webmaster's decision to use Flash means that that wasn't a problem for anyone with Flash disabled.

      But, using Flash to display a list of words???

      --
      The world has changed and we all have become metal men.
  6. You know this world is in trouble by Lispy · · Score: 5, Insightful

    when the word "money" makes place 227 while "love" is at 384. Or maybe I am just turning into some sort of postmodern hippie. ;-)

    1. Re:You know this world is in trouble by ceeam · · Score: 5, Funny

      I love money.

    2. Re:You know this world is in trouble by PMuse · · Score: 3, Interesting
      It takes somewhat of a long time to get past all the pronouns, articles, prepositions, to-be verbs, etc. Once we do, we can start to see what things people are talking about.

      people (81)

      first (86)

      down (97)

      think (102)

      work (103)

      years (106), year (122)

      right (112)

      government (140)

      day (141)

      man (142)

      world (149)

      ...and it was at that point that the slashdot effect killed the flash app

      --
      "We reject as false the choice between our safety and our ideals." --The American President (20.1.2009)
  7. no flash, please by latroM · · Score: 2, Insightful

    This is so bad. No one should make their information to depend on non-free software. I will not install flash to see this.

  8. Re:I have looked up all the rude words: by gazbo · · Score: 3, Funny
    You seem to have posted a couple of typos - I've figured out what you meant by looking up the words by rank:

    Fuck = 5598
    Cunt = 18636

    HTH.

  9. Re:I have looked up all the rude words: by imsabbel · · Score: 3, Interesting

    Well, if people could write fuck, cunt, bitch, motherfucker, ect in the web without being censored by "lets be nice" moderatores, irc-bots, php-bbses,ect, their rank would be quite a bit higher.
    I guess fuck should be at least in the top 1000.

    --
    HI O WISE PRINCE. WHT TOOK U SO DAM LONG?
  10. 86,800 most frequently used English words??? by ceeam · · Score: 4, Insightful

    Bloody hell, I wonder what other words are _not_ so frequently used then.

    1. Re:86,800 most frequently used English words??? by shadowcabbit · · Score: 2, Informative

      Gigarectum is one that's probably not so frequently used. Same goes for Xenomorph, flagellate, moribund, logorrhea, sialoquent, genetrix, and bolection.

      (Most of these I got from here: http://phrontistery.50megs.com/ihlstart.html)

      --
      "Why Subscribe?" Good question...
    2. Re:86,800 most frequently used English words??? by mrmagos · · Score: 2, Interesting

      Well, considering that is somewhere in the neighborhood of 1/10th of the words estimated to be in the English language, there are quite a few not in the list. The actual number varies by source however, estimated between approximately 800,000 and 1,000,000 words.

      --
      Never start vast projects with half-vast ideas.
    3. Re:86,800 most frequently used English words??? by jonadab · · Score: 3, Insightful

      > I wonder what other words are _not_ so frequently used then.

      Use Google, and try to get the lowest number you can get for the number of
      pages. Yes, this is a variant on GoogleWhacking, but with only one word.

      Some quick attempts: Google finds 76,500 pages using 'rotund' (round),
      31,000 for 'pneumatology' (the study of the [sS]pirit), 13,900 for 'cromulent'
      (valid), 818 for 'pimola' (a stuffed olive), 242 for 'anatopism' (something
      that is out of place), and only 31 for 'propretonic' (preceding the syllable
      before the accent).

      I chose "pimola" because I happen to know that it's not listed in the OED, so
      I figured it was fairly uncommon, but it turns out that a couple of the other
      words I tried are even less common. I was surprised that "propretonic" isn't
      used more often. FWIW, the sites that do use it probably use it numerous
      times each.

      --
      Cut that out, or I will ship you to Norilsk in a box.
  11. Linux is currently not in the archive... by Lispy · · Score: 4, Funny

    but Windows ranks at a disturbing 1169. ;-)

    1. Re:Linux is currently not in the archive... by Lispy · · Score: 4, Funny

      Noticed that smiling face at the end of my post?? You know, this means that I might be kidding.

  12. Flash!? by avalys · · Score: 5, Insightful

    It would be nice if the list were available in plaintext form, instead of this slow and miserable Flash presentation.

    This is a prime example of Flash being misused. It's not needed at all, and only serves to slow things down. It also makes it impossible to use the data for anything useful.

    --
    This space intentionally left blank.
    1. Re:Flash!? by FinestLittleSpace · · Score: 2, Insightful

      I agree that there should be a simple listing of it (although any half-decent programmer could try packet catching and watch what it requests.. its probably very easy to make a bespoke script to go get the data)....

      BUT.. the flash implementation is very clear, easy to use and a good bit of coding. So Ner.

    2. Re:Flash!? by Angostura · · Score: 4, Funny

      It's ART goddamit. It's not meant to be FUNCTIONAL :-)

    3. Re:Flash!? by julesh · · Score: 4, Insightful

      BUT.. the flash implementation is very clear, easy to use and a good bit of coding. So Ner.

      Sorry, according to the copy I've just downloaded, there are NaN words in the archive, and the word I've just clicked on (""), is at position NaN.

      I wouldn't say it was exactly the greatest coding in the world.

  13. Re:Flash? by Astrorunner · · Score: 5, Funny

    "WordCount was designed with a minimalist aesthetic, to let the information speak for itself."

    Which explains their logical use of Flash.

  14. Re:I have looked up all the rude words: by kaleco · · Score: 2, Insightful

    I don't have Flash. A couple of non-php static html pages generated once a day could have handled a league table, with a PHP search.
    Did the Conspiracy page report a coincidence of the words 'Fu*c' and 'Microsoft'? :)

    --
    Prosperity is only an instrument to be used, not a deity to be worshipped. Calvin Coolidge
  15. words we DO NOT use by theMerovingian · · Score: 4, Funny


    1) que
    2) centre
    3) colour
    4) dialogue
    5) program
    6) pyjamas

    Why yes, I am american :)

    --
    "If you think you have things under control, you're not going fast enough." --Mario Andretti
    1. Re:words we DO NOT use by rainman_bc · · Score: 2, Funny

      Hehehehe... My favourite is when I phone American hillbillies and have to spell something with "z" in it. I get a response "zed"? What's "zed"? (in the rest of the world, the letter zee is reall the letter zed)

      It gets even better. These same hillbillies spell my fiance's last name Cadezed when she spells it to them - read "C-A-D-E-Z(ed)".

      And it would be okay if it was only in America, but then they come to Canada and as such stupid questions too! "zed"? What's "zed"? "Oh you mean zee!"... No we didn't mean "zee". If we meant "zee" would have said "zee".

      --
      09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
    2. Re:words we DO NOT use by iantri · · Score: 2, Funny
      What do Americans have against pyjamas?

      IAC(Canadian).

  16. Compression by SavedLinuXgeeK · · Score: 2, Interesting

    I know there are already types of compression that take the most common letters of a document, and then builds a binary dictionary off of it, to create the most efficient way of storing the data. Perhaps this database could be used, as a static dictionary, and compressing documents could be even better, though the db queries might slow it down.

    --
    je suis parce que j'aime
  17. Huh? by brianjcain · · Score: 4, Funny

    "Grok is not currently in the archive"

  18. Does anyone know... by Suit_N_Tie · · Score: 2, Funny

    what the ranking is for the word /.?

  19. Re:I like my tin foil in the microwave please by Stargoat · · Score: 5, Funny

    Funny that you mention this, because they'll definitely be adding the verb "slashdotted" after today.

    --
    Hoist Number One and Number Six.
  20. Reminds me of flaming logos by QuietLagoon · · Score: 2, Insightful

    Nice concept for a web site, but the gratuitous use of technology gets in the way.

  21. I wonder what rank by jayhawk88 · · Score: 4, Informative

    "Slashdot" and "effect" are located at?

  22. where are.... by bob_herzog · · Score: 2, Funny

    ...teh, noob, and haxor... I have trouble believing they aren't ranked heheh

    --
    "I'll waste 'em with my crossbow!" ~Bob Herzog, Power Gamer
  23. Re:Flash? by alatesystems · · Score: 5, Insightful

    I really am sick of sites that require flash to get actual information. It should be part of the usability guidelines of the web that information be required to be in at least format.

    Take these two sites for example. I work in the healthcare profession and we don't run our machines as administrators, and flash isn't installed default on Win2k. When you go to Ochsner's Health Plan website, you can't do anything unless we, as administrators, log in and install flash for them from the activex control, just to log in as a provider.

    Also, Houston RoadRunner is the exact same.

    I hate flash, a lot, and It annoys me because you can't manipulate fonts, you can't use scroll wheel most of the time, all the control is taken AWAY from the user. I love flash when used for hilarious web cartoons, but using it for content is ridiculous.

    Chris

  24. Word flashmobs by G4from128k · · Score: 5, Interesting

    Perhaps sites like this will encourage the creation of word flashmobs. A group of people would conspire to overuse some obscure word to boost its rating. Bombing the word within blogs, web pages, and postings might help the word spread into wider use and rise in the rankings. It could even be a competitive sport -- two teams pick two words of adjacent rank and the team whose word rises the most wins.

    --
    Two wrongs don't make a right, but three lefts do.
    1. Re:Word flashmobs by pipingguy · · Score: 3, Funny

      What a craptacular idea!

  25. Found one for the conspiracy corner. by Chess_the_cat · · Score: 4, Funny
    Troll: discarding coexistence.

    Words 29350-29352.

    --
    Support the First Amendment. Read at -1
  26. Did their sources include AIM and ICQ? by Mordaximus · · Score: 2, Interesting

    I half expected this wordcount thing to, well, count real English words. OMG ranks at 43712.

    P.S. WTF Did not rank :/

  27. Well duh. by Yaztromo · · Score: 2, Interesting

    The archive bills itself as "...an interactive presentation of the 86,800 most frequently used English words."

    Last I checked, "Linux" is not a word in the english language.

    For the same reason, you're not going to find "Slashdot", "jSyncManager", or "iPod", regardless of how many times they're used online.

    Yaz.

  28. I need this for German. by torpor · · Score: 2, Interesting

    I have to learn German. I need the 86,000 most-commonly used German words. This would give me a nice target of words to get to know in the process of learning it ...

    --
    ; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
  29. CoS by lovebyte · · Score: 4, Funny

    1941-1945:
    faith establish facts requires membership

    Tom Cruise hacked their website!

    --

    I'll do it for cheesy poofs.

  30. Something wrong? by Ronald+Dumsfeld · · Score: 3, Funny
    There must be something wrong with this.

    Book comes in at 357, Television comes in at 1022 and TV comes in at 1577.

    Ah, now I know what's wrong with it... It's "Artistic" so it doesn't have to mean anything. I mean, nobody would find it useful if the number of occurrences of a word was given.

    Here's the bit that would make you choke on your cornflakes...
    WordCount recently won AIGA's (American Institute of Graphic Arts) 2003 Award for Information Design.
    Tell me, what was the award trophy? A chocolate tea pot?
    --
    Where's the Kaboom?
    There's supposed to be an Earth-shattering Kaboom.
  31. Cool idea by wombatmobile · · Score: 3, Insightful

    That has to be the coolest use of Flash

    It is a cool idea and it has been implemented with Flash.

    I'd like to see it implemented without Flash. What is cool would then be more accessible and available faster. That would be more compelling.

  32. True but... by Savage-Rabbit · · Score: 4, Funny

    .. 'Microsoft' is at a disturbing 4304 which puts this word ahead of 'Fuck' at 5589!

    This means that either:

    1) That people at large think more about Microsoft than copulating. (Unlikely)

    2) They used a bunch of /. readers as a basis for working out their word collection.

    --
    Only to idiots, are orders laws.
    -- Henning von Tresckow
  33. like is #67 by chyne · · Score: 2, Funny

    What happened to the days when only California teenagers/surfers used 'like' for every second word. I really noticed this when I went back to university recently. It's really, like, annoying to listen to, like, the kids today, like, use the word 'like', like, five times in one, like, sentence.

  34. I looked for this word... by laejoh · · Score: 2, Funny
    but couldn't find it, which seems appropriate

    The word is
    semprini


    More info on Just the words (Monty Python)
  35. NaN! by Swedentom · · Score: 3, Funny

    Apparently, the 'word' NaN is used a lot! :-)
    NaNNaNNaNNaNNaN

    Slashdotted?

    --
    Sig Nature
  36. Slow news day? by Krafty+Koder · · Score: 2, Informative
    I submitted a story about WordCount way way back in August.

    It was rejected.

    "Word Count Tuesday August 03, @06:04AM Rejected "

    992-995 america ensure oil opportunity
    3046-3051 iraq winner, fucking smooth, nick votes

  37. for the record, the 7 dirty words not allowed by gemtech · · Score: 2, Informative

    on TV (according to George Carlin) are: shit, piss, fuck, cunt, cocksucker, motherfucker, and tits.

    --
    Insanity: doing the same thing over and over again and expecting different results. Albert Einstein
  38. Also, by ambrosen · · Score: 2, Informative

    The BNC only goes up to 1990, as well. Linux wasn't a word then. Microsoft ranks 5293 on the list I've got, occurring 1704 times in 100 million words

  39. Spam filter uses? by danharan · · Score: 4, Interesting

    To fight keyword stuffing, I believe keeping track of the word use distribution in an email would help us judge the spam potential.

    --
    Information: "I want to be anthropomorphized"
  40. floccinaucinihilipilification by the_twisted_pair · · Score: 2, Funny

    ..means 'the act of estimating as worthless.'

    -To you and me, it means calling something shit.

    (teehee. finally found a way to post that one)

  41. How many people speak English? by Jonti · · Score: 2, Informative
    Well over a billion people speak English. Sure, around a quarter of them live in the US, but that still means most do not.

    Even so, I kinda agree with what you say, that the site is close to misrepresenting itself. But the greater dishonesty is surely that the bloody thing is just grandstanding with public data -- it's almost useless, presumably by design, for practical purposes. So, yes, I too would rather the authors had been clear about their American background.

    Here's some stats ...

    • English has official or special status in at least seventy five countries with a total population of over two billion
    • English is spoken as a first language by around 375 million and as a second language by around 375 million speakers in the world
    • speakers of English as a second language probably outnumber those who speak it as a first language
    • around 750 million people are believed to speak English as a foreign language
    • one out of four of the world's population speak English to some level of competence; demand from the other three-quarters is increasing.
    • It looks to me as if the sums work like this:
      375m (1st language)
      375m (2nd language)
      750m (learned English as a foreign language)
      -----
      1500m

      http://www.britishcouncil.org/english/engfaqs.htm

  42. History of the English language by siskbc · · Score: 4, Informative
    If it helps, think of American English as a foreign language. You wouldn't call someone in Spain on the phone and insist on speaking English, would you? Similarly, when calling an American, it would serve you well to make accomodations for their knowledge of your language, particularly if you expect that you are more knowledgeable of American English than the person to whom you are speaking is of UK English.

    Also, it's not as if you are "correct" and the American "incorrect." Languages are fluid. Languages evolve, including English. Brits (I include Canadians here, having severed ties only quite recently) have really screwed up the proper German you were taught ~1500 years ago too. And the Norwegian you were taught ~1200 years ago. And the French you were taught 968 years ago. As such, would you consider the entire English language "incorrect?" Many words had various spellings in the 1600s when English was brought to America. As such, it's not accurate to claim that the American spelling is incorrect, when we simply chose one of the accepted spellings at the time and the Brits chose the other. It might be different if the English language had an established spelling for a certain word by 1500 and Americans changed, but this is not the case. For all the pedantic spelling and grammar correction, many Brits (and Canadians) seem to be ignorant of the history of their own language.

    One might also suggest that you not engage in such displays of self-superiority - "When in Rome..." one might say. You seem to share the attitude of tourists in foreign lands who expect to have waiters (for example) speak their own native language and become irate when the waiter can't or won't. Admittedly, Americans are one of the major contributors to the image of the self-righteous tourist, and I find that disgusting too. Ultimately, one can adapt to your host nation - even if it's simply over the phone - or one can maintain self-righteousness and deal with the inevitable inaccuracies. What does one gain from this exchange, anyway?

    As for the Americans in Canada you cite, their mistakes are borne of ignorance rather than self-righteousness. The difference borne of ignorance is correctable. I would politely, without condescencion, inform them that the letter they refer to as "zee" is called "zed" by the rest of the English speaking world. If they insist on maintaining their behavior, then your ire would be well-placed - if you didn't insist on doing the same, that is.

    All in all, there's really no need for this "whose language is correct" debate. Language is a tool. If you can effectively with the other party, you have no problem. Your problem is you intentionally choose not to simply due to ego, which I find baffling.

    --

    -Looking for a job as a materials chemist or multivariat

  43. Re:Flash? by fiannaFailMan · · Score: 2, Informative
    I've refuted this line of 'reasoning' in my jounral. Have a look. Flash, when implemented properly, is the perfect tool for delivering content in certain applications where a lot of interaction is required. It is a lot more efficient than re-loading a whole page of HTML just for the sake of updating a few words on it. If the whole page changes, then HTML begins to have an advantage.

    Please don't present an argument about technical issues based on how you 'hate' a technology. We have to examine technologies and their implementation on their own merits, not based on emotion.

    --
    Drill baby drill - on Mars