Computers To Mark English Essays

No! That is what they want you to do by Norsefire · 2009-09-25 16:20 · Score: 3, Funny

Having failed to kill him, SkyNet sent a Terminator back in time to make John Connor fail English.

Re:No! That is what they want you to do by kindigth · 2009-09-25 16:32 · Score: 3, Funny

After Terminator: Salvation, i'd take it.

Graduate Record Exam by ub3r+n3u7r4l1st · 2009-09-25 16:21 · Score: 5, Informative

The GRE Writing portion is already using it.

From http://www.ets.org/portal/site/ets/menuitem.1488512ecfd5b8849a77b13bc3921509/?vgnextoid=ebd42d3631df4010VgnVCM10000022f95190RCRD&vgnextchannel=54c846f1674f4010VgnVCM10000022f95190RCRD

"For the computer-based Analytical Writing section, each essay receives a score from at least one trained reader, using a six-point holistic scale. In holistic scoring, readers are trained to assign scores on the basis of the overall quality of an essay in response to the assigned task. The essay score is then reviewed by e-rater, a computerized program developed by ETS, which is being used to monitor the human reader. If the e-rater evaluation and the human score agree, the human score is used as the final score. If they disagree by a certain amount, a second human score is obtained, and the final score is the average of the two human scores."

If you find a way on what the algorithm look for, even a software-generated essay can get 6's.

--
New Economic Perspectives

Re:Graduate Record Exam by icebike · 2009-09-25 19:45 · Score: 3, Interesting

It also scores great writing and even greater speaking very inconsistently.
When fed Kennedy's "I am a Berliner" speech these systems always scored it rather low. Repetitious. Gratuitous use of foreign words: Ich bin ein Berliner.

--
Sig Battery depleted. Reverting to safe mode.
Re:Graduate Record Exam by markov23 · 2009-09-25 23:32 · Score: 4, Interesting

The paper scoring technology that I am familiar with ( used by the GRE's and some high school English classes ) cant be fed a random paper -- it needs to be trained on a particular assignment. Then it can score papers for that assignment. The success that they get with these is pretty surprising -- but the application is limited to these types of tests or curriculum that is designed around the assignments it has been trained for. The more interesting affect from this type of system reported from students ( not gre takers ) is that it lets them write a paper -- get it scored, make changes and see if they are getting better. When I was writing papers in high school -- you wrote it -- handed it in, then a week later got a grade and never thought about it again. This type of technology actually allows you to learn a lot more from one paper by iterating several versions and getting direct and specific feedback on how to improve.
Re:Graduate Record Exam by mhelander · 2009-09-25 23:33 · Score: 4, Informative

Um, you should google that. Current consensus, I believe, is that his German was fine and that the donut in question isn't even called a Berliner in Berlin.
http://en.wikipedia.org/wiki/Ich_bin_ein_Berliner

Cheatcode by sam0737 · 2009-09-25 16:24 · Score: 4, Funny

Includes "Edexcel iddqd" should do it.

Don't they already do this? by darkshot117 · 2009-09-25 16:32 · Score: 3, Insightful

I seem to remember back in school my English teachers would grade as if they were a computer, failing to actually read into the meaning of things and simply complain about obscure grammar errors (which no one in the real world even knows about) and simple typos. From the sound of this, nothing is going to change.

Re:Don't they already do this? by Anonymous Coward · 2009-09-25 17:25 · Score: 5, Insightful

As a writing instructor, let me put it this way: I very, very seldom see a paper with misspellings and grammar mistakes that is nonetheless a well-written paper. It happens, but not often. Grammar and spelling mistakes are a symptom of sloppiness, as are poor reasoning, lack of organization, and lack of adequate support. If you can't be bothered to remember primary-school English, it is not likely that you are willing to master rhetorical structure.
When we read a paper, we actually don't care what you're saying. There usually isn't an "interesting" score. In my case, I evaluate on three, ten-point, holistic scales: Content (which basically refers to amount and quality of support), Organization (rhetorical structure), and Mechanics (yes, grammar, vocabulary, adhering to the style guide, etc.). I do this so I don't have people claiming that their hopeless muddle of a paper got marked down for "obscure grammar errors (which no one in the real world even knows about) and simple typos".
Guess what? Writing is not speaking. Those "obscure rules" are, indeed, usually only applied in writing. I ramble, swear, and disregard the conventions of "proper" English when speaking. But that is because those rules do not really apply in that sociocultural setting. In formal writing--you know, what you're being taught in writing class--they matter a great deal. If you don't follow them, you sound like an idiot, and no one will listen to you.
Why are these "obscure" rules used as a "canary test" of your intelligence and noteworthiness?
Because of what I wrote in my first paragraph. Intelligent, methodical, and rational people care enough to follow them.
I'm sorry, but that's how it works in the "real world".
Re:Don't they already do this? by DirePickle · 2009-09-25 18:16 · Score: 3, Insightful

What?
Re:Don't they already do this? by jonadab · 2009-09-25 18:31 · Score: 5, Informative

> As a writing instructor, let me put it this way: I very,
> very seldom see a paper with misspellings and grammar
> mistakes that is nonetheless a well-written paper. It
> happens, but not often.

It happens most often when the writer is not a native speaker of the language. They'll write an essentially sound paper but make weird and obvious mistakes, like using the wrong preposition or spelling ph words with f. Depending on their native language they may also make other kinds of mistakes, e.g., Japanese people will frequently mess up grammatical number.

But the other poster may have been talking about grammatical structures that are actually a regular part of English grammar but are nonetheless consistently marked down by many English teachers, for obscure reasons. Examples of this kind of thing include split infinitives, the second-person imperative, the use of the second person pronoun to refer to anyone in general, and the use of objective-case pronoun forms in the predicate after certain verbs (particularly being verbs). Linguistically speaking these aren't actually mistakes as such, and in fact some of the contortions used to avoid them actively impede clarity, but they frequently get marked as "mistakes" nonetheless.

--
Cut that out, or I will ship you to Norilsk in a box.
Re:Don't they already do this? by PCM2 · 2009-09-25 21:57 · Score: 3, Funny

Funny, cuz your response is exactly what I was going to ask of you. Until, that is, I learned you were just a drunk dude who was trying to sound intellectual. Thanks for being honest. And, cheers.

--
Breakfast served all day!
Re:Don't they already do this? by digitig · 2009-09-25 23:20 · Score: 4, Informative

Indeed the rules of grammar can be seem obscure and almost arbitrary. However the rules of grammar8 actually grew naturally (i.e. not via committee, despite appearances) from a need of educated people to greatly clarify their communication.
Partly, but not entirely. There was a deliberate move in the 19th century to rid English of all those nasty Germanic influences and arbitrarily impose grammatical rules from the classical language onto English. The reason was nothing more nor less than intellectual snobbery, and the result was rules like not splitting infinitives and not ending sentences with prepositions. Those rules have no natural place in English; they were only put there to marginalise those who did not have a classical education.

--
Quidnam Latine loqui modo coepi?
Re:Don't they already do this? by SanityInAnarchy · 2009-09-26 02:04 · Score: 3, Insightful

Just because I write a book of philosophy that is grammatically incorrect but possibly deeply insightful doesn't not make it any less important.
If you're capable of writing a book of philosophy that is deeply insightful, you should also be capable of writing one that's grammatically correct. Doing so would set you apart from someone who is capable of neither, and it'd set you apart at a glance.
It's also common courtesy to the reader. Generally, people have no trouble reading something that's grammatically correct, no matter how poor their own grammar is. However, it's at least annoying, and sometimes frustrating and difficult to understand something that's incorrect. Depending how incorrect you are, I might decide that deep insight you have isn't worth the effort of reading your book.
In other words, if you want your philosophy book to actually be read, you'll proofread it, spellcheck it, and clean it up -- just as, if you want to actually be hired, you'll shower, shave, and put on a tie for the interview.

how many authors have had no editors?
An editor is helpful for two reasons: To catch the mistakes you don't, and to ensure that the publisher's name doesn't get tarnished by subpar writing.
It shouldn't be the editor's job to remind you to capitalize the first word in a sentence. Meet them halfway.
What's more, we're rapidly moving towards mediums that don't need a "publisher", per se -- anyone can start up a blog, or ramble on Slashdot, without any editor at all. If you think it's worth having an editor correct your grammar in a dead-tree book, surely it's worth having correct grammar in what you write online -- but do you really want to hire an editor for your blog? At that point, it just makes economic sense to learn some "basic common English" skills yourself.

--
Don't thank God, thank a doctor!

Depressing by Comatose51 · 2009-09-25 17:00 · Score: 3, Interesting

Not sure if things were any better at one time but the way writing is taught today in public schools generates horrendous results. I remember being taught a very formulaic way of writing essays: six paragraphs, introductory paragraph, concluding paragraph mirrors the introductory paragraph, and all paragraphs start and end with some transition to next paragraph. Then there is the need to satisfy some specific length, although this is quite understandable. It took a college education and many years of reading to undo these "lessons" and really discover the joy of writing essays. Thank you Paul Graham and Nicholas Kristof among many others. I see the same thing happening to high school students I am mentoring. They write very boring essays with a ton of fillers full of sentences structured in a way to use more words than necessarily and make the meaning more ambiguous. Poetry aside, writing is to convey ideas and the value is in the ideas themselves, not really in the words and sentences. The way writing is taught today, the words and sentences get in the way of the ideas. The trend of using computers to grade papers is only adding to this rigid, boring way of writing. One thing I've learned about high school students is that even the low scoring ones are very clever at getting around rigid rules. I had seen a student who knew very little about biology do her homework by scanning in her book for specific phrases mentioned in the questions and looking for some semblance of an answer once she's found the phrases. By the time she was done, she hasn't even read the chapter but her answers would probably get her a "C" -- good enough for her. I'm afraid students will do the same in writing once they realize that computers are grading them.

--
EvilCON - Made Famous by /.

Re:Depressing by psnyder · 2009-09-25 18:41 · Score: 5, Insightful

I had seen a student who knew very little about biology do her homework by scanning in her book for specific phrases mentioned in the questions and looking for some semblance of an answer once she's found the phrases. By the time she was done, she hasn't even read the chapter but her answers would probably get her a "C"
This is the way I always did it, and it got me A's. In fact I was taught to do this in a 6th grade "Study Skills" class. Ironically, it's a very good skill to have in the "real world" as it's a way of quickly obtaining the information you need. You could even draw a parallel between this and Googling something or any kind of computer "find" or "search".

The ability to skim for an answer is not a problem. It's one of the solutions that children employ to deal with a school system that puts more emphasis on grades rather than inspiring them to actually learn a subject. The "inspiration" to get good grades works for some (especially with parental support), but with "average" being a 'C' (often a very shallow understanding), it can be argued that it's not working for most.

As you said, "It took a college education and many years of reading to undo these "lessons" and really discover the joy of writing essays."

Skimming is a skill. Learning a system, and figuring out to survive in it is also a skill. The emphasis on that 'joy' is what's usually lacking. Get a student inspired and the rest usually takes care of itself.

Re:Context... by Anonymous Coward · 2009-09-25 17:20 · Score: 5, Funny

Yeah because when it's written we can see the spelling difference between 'flies' and 'flies' and that ruins the joke.

Re:kairos by Anonymous Coward · 2009-09-25 17:39 · Score: 3, Insightful

Although I take no part in this debate, I would ask you not to mistake an appeal to authority as factual knowledge.

Re:I doubt it! by kklein · 2009-09-25 17:44 · Score: 5, Interesting

As an English prof myself, I'd like to confirm that we spend a lot of time on students' papers. Good papers are easy to breeze through, but the worse the paper, the more time it takes.

As for machine-grading goes, people have been working on that for 30 years. I have no doubt that, statistically, it can provide useful results.

The problem I'm seeing in these comments, however, is a common confusion of testing for assessment and standardized testing. I can't imagine using software to grade a student's paper in class. The student-teacher relationship is a personal one. That person is paying me to help them get better at writing, for example. It is my job to pore over that paper and show them where and how they can improve.

I am also a tester (I actually mostly work with multiple-choice data, but I've also worked on performance rating--speaking and writing). The relationship between a rater and an examinee is very different from that of a teacher and student. The examinee is paying the rater to put them on a scale with other people. This is not a fine-grained assessment; it is always done at extremely "low resolution." When rating a paper for something like the GRE or other standardized test, it is the rater's job to compare the paper to scoring rubrics and make a call on which box of text best describes the paper, and then make note of the number in that box. That's it. It can't really go any more in-depth than that.

For this reason, your comment about "five-paragraph themes" is an important one: Test task design always needs to be clear about what kind of performance is expected, because it is nigh impossible to write rubrics that can be applied to any performance (believe me on this, I beg of you). However, this is actually a question of test specification, not of the software or raters in question. Personally, as someone who works in EFL, I am actually in favor of retaining the "five-paragraph" formula, at least for timed essay tasks. That format is at the heart of all good rhetoric. Yes, it's stilted and silly, but if you can do it, it means that you know basically how information is expected to be organized in Western, especially Anglophone, societies. No good writer would actually use it, but any good writer could.

Again, this is about putting people in boxes, not reading their essays. I can rate a 1-page essay in about 2 minutes, with excellent model fit (I have always used many-facet Rasch modeling for my multi-rater performance testing). I have no doubt that software could be employed whose ratings would be highly predictive of those of human raters.

Bland? by nick_davison · 2009-09-25 18:02 · Score: 3, Funny

"or are people going to have to learn an especially bland form of English to pass exams?"

Forget bland. I'm waiting for the first student to figure out how to write an exploit that hacks the software from within their essay.

Whether:

"It was the best of times, it was the worst of times \'$grade=100;"

or

"Johnny, why did your essay contain slightly over thirty two thousand spaces followed by some weird looking codes?"

No and no by grikdog · 2009-09-25 18:46 · Score: 5, Interesting

I've scored English essays for professional testing services, and I've seen the results of robot scoring. It's pretty shoddy. No, computers are not able to distinguish between a paragraph of As I Lay Dying (William Faulkner) and a gallon of sophomoric babble by say, yours truly. However, within the confines of a particular exam, where the topic is known, responses are predictable, and all the supplicants hew to the general line, the 'bots can detect subpar, adequate, above average and (sometimes!) abnormally brilliant expository prose, thereby ranking papers reasonably well on the usual six point scale.

It's worth pointing out that certain types of exams are designed to elicit extraordinary prose from respondents, that which yields a sense of competence or even brilliance, say. In these cases, the idea is not so much to detect the high end of the bell curve, but to identify the tiny pool of applicants who may be capable of Nobel Prize work in future realms of science or service. No 'bot can do that job, just as no 'bot except Deep Blue can beat Gary Kasparov, and no 'bot at all deserves the monicker Fujiwara no Sai (although Go-playing 'bots are approaching the mid-levels of highly ranked amateur players).

That's the objective part. My personal opinion is that using robots to sort the hopes and aspirations of college-bound men and women is just begging for lawsuits. It's an approach in which differences of opinion quickly escalate to class action against universities as well as test administrators, and would not be an approach I could comfortably recommend.

--
``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_

Re:Context... by Ronald+Dumsfeld · 2009-09-25 19:18 · Score: 3, Informative

The correct quote is, "Time flies like an arrow, fruit flies like a banana."

--
Where's the Kaboom?
There's supposed to be an Earth-shattering Kaboom.

How will it mark this poem ? by Alain+Williams · 2009-09-25 19:39 · Score: 4, Interesting

Will it decide if the following is well spelled ? If it doesn't like the spelling, will it give it marks for irony ?

My New Spell Checker

Eye halve a spelling chequer
It came with my pea sea
It plainly Marx four my revue
Miss steaks eye kin knot sea

Eye strike a key and type a word
And weight four it two say
Weather eye am wrong oar write
It shows me strait a weigh

As soon as a mist ache is maid
It nose bee fore two long
And eye can put the error rite
Its rare lea ever wrong

Eye have run this poem threw it
I am shore your pleased two no
Its letter perfect awl the weigh
My chequer tolled me sew

(Sauce unknown)

Re:Context... by j-beda · 2009-09-26 03:16 · Score: 3, Funny

My eight year old son has recently been enjoying this type of thing in the English language. He asked me this one: "What's the difference between chopped meat and pea soup?" - "Most people can chop meat, but nobody can pee soup."

Hey, HE thought it was funny.

Re:kairos by rastilin · 2009-09-26 03:18 · Score: 3, Insightful

Although I take no part in this debate, I would ask you not to mistake an appeal to authority as factual knowledge.

I begin to suspect that quoting "logical errors" is a new form of karma whoring. The appeal to authority only means that a person isn't automatically correct simply because they are in a position of power. What you failed to note in your flurry of smugness is that we have a person who actually has first-hand information on the subject. Thus making his perspective, while not automatically right, far more relevant to the subject than that of a thousand slashdotters.

--
How do you kill that which has no life?

Slashdot Mirror

Computers To Mark English Essays

25 of 243 comments (clear)