Slashdot Mirror


A Useful Grammar Checker?

burtdub asks: "With the amount of raw text data available, there seems to be no shortage of ambitious language projects on the horizon, from Universal Language Translators to Junk Email Filtering. However, the mess that is the English language still seems to elude commercial attempts while being relatively ignored by the open source community. What would it take to make a useful, functional grammar checker?"

15 of 503 comments (clear)

  1. How about LEARNING the English language? by TripMaster+Monkey · · Score: 3, Insightful


    What would it take to make a useful, functional grammar checker

    How about a competently taught highschool English class?

    Seriously, people...learn to use the language...you'll be better off.

    --
    ____

    ~ |rip/\/\aster /\/\onkey

    1. Re:How about LEARNING the English language? by Haeleth · · Score: 3, Insightful

      Agreed! We already have the problem of people not knowing how to spell (reliance on spellchecking) and people not being able to do basic math (reliance on calculators) - this would just dumb people down even more.

      And don't forget the problem of people not knowing how to shoe a horse (reliance on motor vehicles), or light a fire (reliance on electricity), or plough a field (reliance on supermarkets).

      Wait, those aren't problems, they're examples of how the advance of technology has completely obsoleted things that used to be vital life skills. Whereas clearly spelling, grammar, and basic maths are completely different, and we should not be making any effort to help people take their mind away from niggling details and let them concentrate on the content of their writing or the implications of their calculations.

      No, wait, I'm still not quite following the logic here...

    2. Re:How about LEARNING the English language? by Deanalator · · Score: 5, Insightful

      Not to be a jerk, but how is that insightful? Its not even really that funny. An open source grammar checker would be extremely useful. Everyone mistypes from time to time, and often times spellcheckes are unable to catch it.

      To the best of my knowledge, its one of the harder open problems in the OSS community. Im actually surprised that someone didnt enter something like that into the google summer of code. If I had any idea where to start, I know I would have (and I did consider it). It's a very valid question, and I look forward to seeing if anyone here comes up with any good answers.

  2. AI by Roguelazer · · Score: 4, Insightful

    Grammar can often only be determined by context, especially in English, where the rules of grammar change so much. Until a computer can for itself understand context, no grammar checker can be successful (or even marginally useful). Thus, my answer to your question is two words: "Artificial Intelligence." Artificial stupidity can also be used to simulate bad English.

    1. Re:AI by Haeleth · · Score: 2, Insightful

      there are some things a grammar checker could readily do . . . see if a sentence ends in a preposition, etc.

      Wait, why should a grammar checker be used to enforce pointless and arbitrary style guidelines which have never had any foundation in the usage even of the best writers, let alone served any purpose other than to require hideous contortions from anyone daring to employ phrasal verbs - like "to put up with", in that famous example which I'm sure I don't have to quote?

      Bonus points for anyone spotting the other stupid and arbitrary "rule" which I've flouted several times in this post. One which Word's detestable "grammar checker" does attempt to pick up on. (Ooh, I just ended a sentence with a preposition!)

  3. What do you need? by slashname3 · · Score: 2, Insightful

    How about a dictionary and classes in english, like those given in schools. Should be all that is needed.

  4. English needs to be mutable. by vertinox · · Score: 4, Insightful

    One of the concepts that most people should realize is that the main success (and downfall) of the English language is that it can mutate quite easily.

    Remember... English is the bastard child of Celtic, Latin, and various other Germanic languages. Language also affects the way the way we think and also is the key limiting factor in grasping concepts.

    If your language cannot express a certain concept then you need a way to bend the rules (which English has a bad habit of doing) so that you can share that idea with others.

    To enforce a view or a proper method of speaking will often stagnate a societies ability to assimilate new ideas or methods. George Orwell pointed this out when he came up with the idea for new speak in which society can restrain itself from unwanted aspects by removing societies ability to even discuss it.

    We obviously do not speak Elizabethan English or the olde English of the Middle ages. Should our descendants be forced to speak an archaic language 200 years from now because we demanded to have our software set in stone what is the proper way to express ideas and communication.

    Man, this sounds a bit hippy-esque, but hopefully you understand what I mean.

    Still there should be some ground rules to what proper English is and should be so we can understands each other without going "Huh?" but it shouldn't be a hard-line stance that is unchangeable for the next 50 years.

    --
    "I am the king of the Romans, and am superior to rules of grammar!"
    -Sigismund, Holy Roman Emperor (1368-1437)
    1. Re:English needs to be mutable. by dbIII · · Score: 2, Insightful
      We obviously do not speak ... the olde English of the Middle ages.
      Try reading "Canterbury Tales" aloud - once you get past the spelling it is surprisingly similar to the english we speak nearly 800 years on. If more people on slashdot read this there would be less people complaining about spelling and grammar here, it really doesn't matter in an informal forum, and even in some more formal settings.
      ground rules to what proper English is and should be so we can understands each other without going "Huh?"
      The theme song to the comedy "The Nanny" was one of those situations for me and probably most non-US english speakers that heard it. Live with the regional differences and the fact that the language of the net is broken english.
  5. To everyone... by MaestroSartori · · Score: 2, Insightful

    ...saying "Just learn the grammar correctly in the first place", here's a question: can you really see no use in a computerised tool to help you learn correct grammatical usage?

    It's like someone coming on asking about natural media painting apps being told "Just go to art school and learn how to use REAL paint, you lazy bastard!" - you're missing the point entirely. A grammar checker would be useful even for people with a decent grasp of grammar, as a double-check. Like spell checking, do you get it yet?

  6. Re:Make it for Latin by ari_j · · Score: 4, Insightful

    All those different forms and the nearly syntax-free sentence structure are precisely why it is easier to parse Latin than English.

  7. Erm, um, actually... by Hosiah · · Score: 2, Insightful
    What grammer-checking was done in M$ Word, as I recall, bore no resemblance to English as given in my English books. I quit using it, like I did eventually with all M$ products.

    Grammer checking is a thousand-fold more complicated than most people realize. English's hoary syntax, which pretty much boils down to "8 million exceptions in search of a rule", doesn't parse easily into computer code.

    But I, too, would be interested in seeing this field develop - because it has the side effect of making bot AI better! Now, a voice-activated console that understood commands in plain, sloppy English would be worth striving for. Grammer-checking in a word-processor usually just provokes me: "How *dare* you red-line this sentence; I'm quoting *Shakespeare*, you illiterate rock!"

    But we'll have perfect machine-generated grammer before we've reached the level of innovation required to put a spell-checker on the comment box on Slashdot!

  8. Re:Make it for Latin by Sahib! · · Score: 2, Insightful

    Precisely. GPP said 140 different forms as if that would be a large number for a computer.

    --

    I prayed about it, and God said, "Don't do it!" But I thought, "I know better."

  9. Re:Make it for Latin by dgatwood · · Score: 5, Insightful
    The thing is that most Romance languages also have word order restrictions. French, for example, adjectives come after the noun they modify.

    What makes English such a pain in the backside is that the language has been so utterly simplified over the millenia that we have lots of words with identical spellings, but different parts of speech. This makes the word order critical.

    Technically, word order isn't critical in English. I can say "Campus green and tow'ring trees" and you understand I'm talking about a green campus. This was actually common usage in the not-so-distant past.

    The problem, though, is that words have become overloaded and/or multiple words combined to a single term. For example, the green lantern is probably something you carry around to provide light when the power goes out. The Lantern Green is probably a place where they play cricket.

    We're seeing this happening with things like "it's vs. its" and "their vs. they're vs. there" in some people's usage as well. Every time the spelling distinction between words breaks down, it becomes significantly more difficult for anything short of a person to get meaning out of a sentence. That's why there are so many spelling/grammar nazis on slashdot. If we don't, in a matter of just a few years, we'll get to the point where nobody can understand anything.

    There is another theory which states that this has already happened.

    --

    Check out my sci-fi/humor trilogy at PatriotsBooks.

  10. Re:How about a useful, functional grammar? by Anonymous Coward · · Score: 1, Insightful
    Bollocks. What you call useless and malfunctional, I call rich. If you want consistency and uniformity, learn symbolic logic. A human language with a completely rational grammar would not only not last very long before "corrupting" but it would also be desperately boring. English grammar uncertainties create wonderful cracks whereby style may enter. What could be more useful than that?

    Grammar is, to a large extent, nothing more than a post hoc description of the expressive customs which have arisen amongst a particular community of speakers. Consider this: all living languages are in constant flux. Given that, when a particular alteration occurs, do you really think the collective Spirit of Grammar first makes a check for internal consistency? Language is imitative. People say things because they hear other people say them. Other than by attempting to influence the reader toward or away from a particular construction, grammar manuals can never be anything but historical documents.

  11. Re:Make it for Latin by brpr · · Score: 5, Insightful
    It's depressing being a linguistics student. Every time a language-related topic is raised you have to listen to people who don't know what they're talking about spouting off and getting modded +5 insightful (or whatever the non-Slashdot equivalent of this accolade may be).

    What makes English such a pain in the backside is that the language has been so utterly simplified over the millenia

    No, it hasn't been simplified. At least, you won't find any linguist or student of Old or Middle English who'll claim that it has simplified as opposed to changed. Presumably you'll back up this outlandish statement with, say, a detailed analysis of the history of the case system in English from the Norman conquest onwards?

    that we have lots of words with identical spellings, but different parts of speech.

    Yeah, just like every other language. Do you have any data suggesting that English is unusual in this respect?

    This makes the word order critical.

    Word order isn't critical because of homographs, it's critical because the rules of English grammar are strict about word order. From a more practical point of view, it's critical because English is too poorly inflected for a parser to work out the structure of a sentence without reference to the order of the words. In any case, there's nothing particularly difficult about parsing languages with strict word order rules, or parsing languages with homophones and homophones, or parsing languages with both.

    Every time the spelling distinction between words breaks down, it becomes significantly more difficult for anything short of a person to get meaning out of a sentence.

    Not really. The problem of people writing "their" instead of "they're" is absolutely trivial compared to the staggeringly difficult task of accurately parsing natural language, or machine translation, or any other NLP problem of similar complexity. For God's sake, just list "their" as a synonim for "they're" in your parser and it will figure out which meaning was intended from the grammatical structure (there are few, if any, syntactic contexts in which more than one of "there", "their" or "they're" is correct).

    If we don't, in a matter of just a few years, we'll get to the point where nobody can understand anything.

    People have been saying this for hundreds of years.

    So, basically, you've taken one of the most difficult areas of AI (NLP) and argued that it's really difficult these days because sometimes people spell "they're" incorrectly. Weird.

    --
    Freedom is not increased by mere diminuation of government. Anarchy is freedom for the strong and slavery for the weak.