Slashdot Mirror


'Reading Level' Filter Added To Google Search

entotre writes "A new feature has been added to the advanced Google search: reading level. From the blog post: 'The feature lets you filter or annotate the search results by reading level. The reading levels include basic, intermediate and advanced. You can either have Google label or annotate the results with those labels, only show basic results, only show intermediate results or only show advanced results.' At the time of writing, Slashdot is 1 % advanced, 64 % intermediate and 34 % basic."

44 of 266 comments (clear)

  1. But... by Kev92486 · · Score: 5, Insightful

    How am I supposed to choose the correct filter when I don't know what the word "intermediate" means?!

    1. Re:But... by TheWarp · · Score: 3, Insightful

      Perhaps Google should set it on basic by default. It's not like people would notice the internet getting any dumber.

    2. Re:But... by mysidia · · Score: 5, Interesting

      How am I supposed to choose the correct filter when I don't know what the word "intermediate" means?!

      I assume this act of Google means reading level will soon be influencing page rank, results sorting, and more basic documents will begin to appear first

      No problem. Stories will be at the top. The top ones will explain what intermediate is

      Website operators will have to act. To keep their top spot.

      Writers will need to make their sites basic.

      Advanced grammar will go away.

      Compound sentences will be banned.

      Most pronouns will be banned.

      Most contractions will be banned.

      Making lists of things in one sentence will be banned.

      Pages that do banned things will be hard to find.

    3. Re:But... by noidentity · · Score: 4, Interesting

      This seems to be a useful junk filter. Do your search normally. If you get too much spam, try restricting to intermediate or advanced. I'm going to be using this all the time now.

    4. Re:But... by Opportunist · · Score: 2

      Judging from the Google results it has something to do with some sexual practice I didn't WANT to know about!

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    5. Re:But... by Opportunist · · Score: 3, Funny

      Why do I have this horrible vision of LOLcats pages getting the first page on any result you might be looking for on a "basic" setting?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    6. Re:But... by shadowfaxcrx · · Score: 3, Informative

      I can haz slashdot.

      --
      "I disagree with you" does not equal "flamebait."
    7. Re:But... by Will.Woodhull · · Score: 2

      Any damn fool can write "advanced" jargon: it only takes a bit of time to study some vocabulary lists to do so.

      But to be able to communicate clearly and concisely in basic English-- now that takes intelligence. That is the kind of writing that is worth reading.

      --
      Will
    8. Re:But... by Arancaytar · · Score: 2

      We can't put the broken part in the machine. It wouldn't smash the right tiny things together. Then the machine might break. That would be very sad.

    9. Re:But... by the_womble · · Score: 2

      Basic does not necessarily mean easier to read for a literate audience. A longer sentence may be better constructed, or link related ideas more naturally. Long words may allow more concise writing or be more precise.

    10. Re:But... by mcvos · · Score: 2

      It almost seems like anything that isn't "kid's book level" or "PHD degree" gets lumped into intermediate.

      I think that's a perfectly fine distinction.

    11. Re:But... by vidnet · · Score: 3, Informative

      After playing around with it, I get the impression that it's not literary reading level, but technical reading level. Unlike the Fleisch-Kincaid test that uses the lengths of words and sentences, Google's test seems less concerned with long sentences and more with the choice of words. This is arguably a better way to go about it, but it's a luxury Fleisch-Kincaid can't afford in it's single line definition.

      For example, searching for random phrases from War and Peace by Tolstoy returns 0% Advanced results. The simple english wikipedia page for dissection, which is readable to excess but contains some technical terms ("To dissect is to cut up a body so as to reveal its structure. The body could be that of a human, an animal, or a plant. ") classifies as Advanced.

      I definitely agree with your view on basic grammar, and Google's method ensures that basic grammar about advanced topics will still be classified as advanced.

    12. Re:But... by AmiMoJo · · Score: 2

      I get the impression this will be an optional setting mainly of use to children and those with learning difficulties. Google already filter results by default to prevent adult material showing up unless you specifically search for it. It is most visible when using image search but the main web search and instant/suggest do it as well.

      In fact Google already ranks results based on the language you use when searching. If you type in "Newton's Laws" you get introductory material written for laypeople, while "Newtonian physics" ranks pages with maths and more formal discussion higher. This just appears to be an extension of that which allows the user to state what they want instead of Google just guessing.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    13. Re:But... by theaveng · · Score: 3, Insightful

      Oooo fun! (from highest to lowest Reading Level)

      foxnews is 2% advanced and 73% intermediate
      cnn.com is 2% advanced and 70% intermediate
      pbs.org/news is 1% advanced - 84% intermediate
      slashdot is 1% advanced and 64% intermediate.
      And the surprise:
      MSnbc is 0.5% advanced and 55% intermediate

      "Tut-tut. I think I am experiencing cognitive dissonance. Obviously this google formula is flawed because everyone knows NBC is the best source for unbiased news. And FOX #1 in reading level? Bah. Humbub." - MSnbc viewer smoking his pipe. (I'm just joking - put down the guillotine.)

      --
      FOX NEWS.com should be BANNED from television and internet. Have the Congress take it over and give us Truespeak.
    14. Re:But... by theaveng · · Score: 2

      Left out some of my favorites from the bottom & lowest ranked:

      english.aljazeera 1% advanced - 92% intermediate
      france24.com 0.5% advanced - 94% intermediate
      russiatoday 0.5% advanced - 86% intermediate
      MSnbc is 0.5% advanced and 55% intermediate
      euronews 0% advanced and 100% intermediate

      Revised ranking:

      1 reason.com 4% advanced - 83% intermediate
      2 foxnews.com 2% advanced - 73% intermediate
      3 www.bbc.co.uk/news 1% advanced - 95% intermediate
      4 cnn.com
      5 pbs.org/news
      6 aljazeera
      7 france24
      8 russiatoday
      9 msnbc
      10 EUROnews
      etc...

      --
      FOX NEWS.com should be BANNED from television and internet. Have the Congress take it over and give us Truespeak.
  2. Simple English Wikipedia by MaxOfS2D · · Score: 2

    http://www.google.com/search?hl=en&lr=&safe=images&tbs=rl%3A1&q=site%3Asimple.wikipedia.org&aq=f&aqi=&aql=&oq=&gs_rfai= Basic 28% Intermediate 55% Advanced 16% I think someone didn't live up to his claims!

    1. Re:Simple English Wikipedia by The+Archon+V2.0 · · Score: 4, Interesting

      http://www.google.com/search?hl=en&lr=&safe=images&tbs=rl%3A1&q=site%3Asimple.wikipedia.org&aq=f&aqi=&aql=&oq=&gs_rfai= Basic 28% Intermediate 55% Advanced 16% I think someone didn't live up to his claims!

      My word, if you made it any simpler you'd be down to words of three letters or less.

      (Tries it on own site.)

      100% BASIC?!? Oh, hell no. You don't use words like "beset" in basic writing.

      I do hereby put on my smartypants crown and declare this b0rken.

    2. Re:Simple English Wikipedia by camperdave · · Score: 5, Funny

      You don't use words like "beset" in basic writing.

      Sure you do. "I want my TV to beset to channel 8".

      --
      When our name is on the back of your car, we're behind you all the way!
    3. Re:Simple English Wikipedia by thepotoo · · Score: 2

      This is probably correct. PubMed: 98% advanced

      Nature: 61% advanced

      Science: 94% advanced

      PNAS: 99% advanced

      Can anyone figure out why science is so much more "advanced" than Nature? Both seem pretty similar to me.

      Oh, and by way of a control group:

      I Can Has Cheez Burger, surprisingly 11% intermediate

      --
      Obligatory Soundbite Catchphrase
    4. Re:Simple English Wikipedia by PPH · · Score: 2

      You don't use words like "beset" in basic writing.

      Sure you do. "I want my TV to beset to channel 8".

      Well, you're losing advanced points for that remark.

      WWE is on channel 8 right now.

      --
      Have gnu, will travel.
  3. 99% advanced by MBCook · · Score: 2

    99% advanced. On the other hand, Wikipedia is quite evenly distributed.

    --
    Comment forecast: Bits of genius surrounded by a sea of mediocrity.
  4. Okay quick by Amorymeltzer · · Score: 5, Funny

    Everyone sound smart!

    Derrida began speaking and writing publicly at a time when the French intellectual scene was experiencing an increasing rift between what could broadly be called "phenomenological" and "structural" approaches to understanding individual and collective life. For those with a more phenomenological bent the goal was to understand experience by comprehending and describing its genesis, the process of its emergence from an origin or event. For the structuralists, this was a problematic and misleading avenue of interrogation, and the "depth" and originality of experience could in fact only be an effect of structures which are not themselves experiential. It is in this context that in 1959 Derrida asks the question: Must not structure have a genesis, and must not the origin, the point of genesis, be already structured, in order to be the genesis of something?

    (source: http://en.wikipedia.org/wiki/Deconstructionism#Theory)

    --
    I live in constant fear of the Coming of the Red Spiders.
  5. Simple English Wikipedia not marked very simple... by Khopesh · · Score: 2

    The Reading Level for site:simple.wikipedia.org is currently ranked 29% Basic, 52% Intermediate, 17% Advanced, implying that Slashdot is easier to read than the version of Wikipedia specifically tasked with being approachable to those with only basic English language comprehension. Google's filter fails here, though I suspect Wikipedia is failing to a small degree too.

    --
    Use my userscript to add story images to Slashdot. There's no going back.
  6. Re:DURRRRR by somersault · · Score: 4, Funny

    Hahaha. Soon "Advanced" will be renamed to "Faggy and retarded" to aid comprehension.

    --
    which is totally what she said
  7. Re:Reading level is useless by geekoid · · Score: 2

    IT's not useless. It's a guideline.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  8. This. Is. AWESOME. by Seumas · · Score: 4, Funny

    Finally, I can just set Google to "filter everything below a third grade level" and never have to see 'Yahoo! Answers' spam cluttering up my search results!

  9. The following option is req'd for 95% of Americans by Godskitchen · · Score: 3, Funny

    -aliterate

  10. Re:High school math versus college math by nobodylocalhost · · Score: 3, Interesting

    It's English used in those math sites. You can express complex ideas in simple terms, and simple ideas in complex terms. It has nothing to do with the actual content.

    --
    Where is the "Ignorant" mod tag?
  11. P.O.R.N. by gilbert644 · · Score: 5, Funny

    My quest for advanced level porn brought me here: http://en.wikipedia.org/wiki/Progressive_outer_retinal_necrosis :(

  12. What about keyword searches? by tomp · · Score: 3, Interesting

    That's great and all, but what would be *really* cool, is if Google provided some way to search for pages that contain a specific word or phrase. Yeah, that would be cool. Some kind of search engine where I type in words and the search engine returns only pages that contain those words. Can Google work on that next?

    1. Re:What about keyword searches? by Anonymous Coward · · Score: 5, Insightful

      Yeah, that would be sweet. Especially if it didn't filter out special characters commonly used in programming languages, like .:()[]{}

    2. Re:What about keyword searches? by metamechanical · · Score: 3, Insightful

      For Pete's sake, I've never understood why they didn't support some simple subset of regular expressions. Just "simple" stuff, like character classes and multipliers.

      Also, while I don't mind being corrected on my spelling (being that, despite trying to be diligent, I certainly make mistakes), what the heck is up with google flatly refusing to search for my exact text? It was fine when you searched for 'x,' they asked "do you actually mean y?" But now, it takes me three searches before I figure out the magic phrasing that will actually do my search and not return "corrected" results.

      --
      If I had a nickel for every time I had a nickel, I'd be richcursive!
    3. Re:What about keyword searches? by TempeTerra · · Score: 2

      Speaking from knowledge of search engine indexing; you can't search on regular expressions because the search index is heavily heavily optimised and the only way to search for a regex would be to generate all the possible expansions and search for them individually. You could do it, it would just fuck up everyone's processes.

      If you've done anything with SQL think about how slow wildcard text searches are compared to an indexed primary key search, then multiply by a couple of orders of magnitude since search engines throw away generality for extreme performance on a particular kind of search.

      --
      .evom ton seod gis eht
  13. Re:Reading level is useless by MichaelSmith · · Score: 2

    Yeah my son is eight years old and reading long novels now, but when he was at pre-school age he would take DVDs he liked (say Ben-10) and type the titles one letter at a time into google to get the youtube related videos list. Then he would be set for hours. Most of it was above his reading level but all he needed to kow was that B on the title matches B on the keyboard.

    And once they get the hang of reading they fly past the "levels".

  14. Re:The following option is req'd for 95% of Americ by noidentity · · Score: 5, Funny

    They already have that option, but it's labeled Images.

  15. Re:The following option is req'd for 95% of Americ by adisakp · · Score: 2

    aliterate/litrit/
    Noun: An aliterate person.
    Adjective: Unwilling to read, although able to do so

    I believe he meant illiterate though which is unable to read rather than unwilling to.

  16. Might as well make this political by Albinoman · · Score: 2

    Democratic National Committee: 21% Basic, 77% Intermediate, less than 1% Advanced
    Republican National Committee: 11, 87, less than 1 (DNC has .org site and RNC has .com? Weird)
    Whitehouse: 6, 87, 5
    Or Wikileaks: 1, 42, 56
    Of course the epicenter of stupid, Sarah Palin's Facebook page, 64, 33, 1

    A few Slashdot worthy ones:
    Microsoft: 12, 77, 9
    Apple: 48, 49, 2 (anyone surprised here?)
    Linux: 4, 91, 3

  17. Re:Please, Google dudes, automatic translation! by gblackwo · · Score: 2

    What happens if you come across a slug? a fathom? a league? your automatic translation is gonna screw alotta stuff up that wasn't intended.

  18. Unreported choices by SuperKendall · · Score: 2

    34% basic

    Oddly discarded from the reported results were 2% COBOL and 4% Lisp. C results were discarded for using the "wrong" brace style (regardless of style used).

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  19. Re:Reading level is useless by plover · · Score: 5, Insightful

    In what sense is it a "guideline"? Perfectly clear text can get a poor readability index, incomprehensible text can get good readability.

    A reading index is just like a measuring tape. It can't tell you that you built a crappy house with crooked walls and a leaky roof; it can only tell you that something is 40 feet long by 30 feet wide.

    A reading index is a tool that simplifies understanding, reducing a very complex thing to a simple number that's useful for comparisons. Just like you can use the measurements of the house to figure out that it's 1,200 square feet, you can compare that to a house that is 2,400 square feet. Neither measurement tells you the quality of the construction, the color, the flooring, the lot size, or the neighborhood. But if you're looking for a home for a family of six, knowing the floor space is one thing that can help weed out the useless candidates quickly. If you're looking for a book for first graders, you don't trot out a book with a reading index of 18.

    And claiming it doesn't work on incomprehensible text is like complaining that a measuring tape can't tell you the color of a house. A reading index is not an interpreter of syntax, grammar, spelling, or any other attribute of text. It just measures one simple set of dimensions of text.

    A reading scoring system can only give you an indication, not a guarantee, of what kind of audience should be able to comprehend a given piece of text; and it can give you an indication of relative difficulty. For example, the widely used Flesch-Kincaid Readability Index bases its score on the average number of words per sentence and the average number of syllables per word, and outputs a "grade level". The grade levels were probably modeled on the textbooks and lesson books of the era in which it was developed. Is it still relevant? Perhaps the actual grade levels are different these days, but it's still a widely accepted model because it's useful for what it does provide.

    --
    John
  20. Re:The following option is req'd for 95% of Americ by Arancaytar · · Score: 2

    Ah? Are Americans actually all alliterate?

  21. Re:The following option is req'd for 95% of Americ by Adambomb · · Score: 5, Funny

    Actually, alliteration almost always annoys any average American audience.

    --
    Ice Cream has no bones.
  22. Re:Reading level is useless by toetagger · · Score: 5, Funny


    A reading index is just like a measuring tape. It can't tell you that you built a crappy house with crooked walls and a leaky roof; it can only tell you that something is 40 feet long by 30 feet wide.</p></quote>

    Not true!

    If the measuring tape is wet, then the roof must be leaking!
    If the measuring tape is swinging, then the house must have a draft!
    If the measuring tape is white, then even snow is getting in!
    If you can't see the measuring tape, then your electricity is out!
    And if you have a candle, and you still can't see it, then it must be foggy!

    I'm sure there is more than this that a measuring tape could tell you, if you would be creative!

  23. Re:I hate to break it to you guys... by BigSes · · Score: 2

    Ok, fine then, I won't post an entire Larry the Cable Guy stand-up routine in hopes of zeroing out our 1%.