Slashdot Mirror


Computers Paraphrase English

AhaIndia submits a link to a story discussing computerized paraphrasing of English news articles. This technology, destined to eventually replace most reporters with very small shell scripts, is thankfully still in its infancy.

212 comments

  1. hmm... soudns familiar... by Dorothy+86 · · Score: 4, Funny
    This technology, destined to eventually replace most reporters with very small shell scripts

    This shirt?

    1. Re:hmm... soudns familiar... by Anonymous Coward · · Score: 0

      It's funny to imagine that phrase being spoken with the voice of Comic Book Guy from the Simpsons.

    2. Re:hmm... soudns familiar... by Anonymous Coward · · Score: 0

      mod parent up, it's funny, not redundant

    3. Re:hmm... soudns familiar... by tbone1 · · Score: 1
      Actually, I thought Gannett had already done that for their papers.

      --

      The Independent: Reverend Spooner Arrested in Friar Tuck Incident - ISIHAC, Historical Headlines
    4. Re:hmm... soudns familiar... by Anonymous Coward · · Score: 0

      I for one for one welcome our new shell script overlords.

  2. Automated slashdot? by TwistedSquare · · Score: 3, Funny
    This technology... thankfully still in its infancy.

    So one day instead of complaining against michael and co., everyone will be moaning about someone else's code - seems more appropriate for a nerd site somehow ;)

    1. Re:Automated slashdot? by evanbd · · Score: 1

      At least it can use a spellchecker. And it can probably catch dupes on occasion, too, with some work. I don't know what you'd be complaining about.

    2. Re:Automated slashdot? by Jadsky · · Score: 1

      This would be perfect for Slashdot. It's also a place where multiple readings of the same exact news story appear many times each day. This way, each time a link is submitted, the paraphraser could dupe it automatically and queue it up to go out every few days.

    3. Re:Automated slashdot? by Lord_Dweomer · · Score: 1, Funny
      "This technology is thankfully still in its infancy."

      I think Michael misspelled 'unfortunately'. But what am I saying...god forbid we have a day when scripts take over Slashdot. Of course, they'd probably program them to dupe and put in random M$-bashing statements.

      --
      Buy Steampunk Clothing Online!
    4. Re:Automated slashdot? by dnahelix · · Score: 1

      It might even know the proper use of to/too and your/you're.

      --
      Slashdot Eds Link Anonymous Posts With Logged Posts
      They Are Vermin Feeding On Each Other's Feces.
      I Hate \.
    5. Re:Automated slashdot? by Steve+Franklin · · Score: 3, Insightful

      "It might even know the proper use of to/too and your/you're."

      Yeah, but can it manage to use "There are" instead of "There is" with a plural subject?

      Actually, the long known solution to most of these *oh so difficult* translation problems is to translate everything into a neutral interlanguage like Interlingua and then translate that into other languages, sending the interlingua version along for the ride, thus preventing degradation in further translations. Then all that local linguists have to concentrate on is ONE set of problems: translating their local language into and out of Interlingua, and Interlingua, being tightly defined, is much easier to machine translate into and out of other languages. So...all this lunacy of trying to machine translate Chinese into English, German, Hungarian, Estonian...--you get the picture--is an incredible waste of time and resources and isn't the best way to solve the problem.

      --
      Hic iacet Arthurus, rex quondam rexque futurus.
    6. Re:Automated slashdot? by Anonymous Coward · · Score: 0

      >>>> This technology... thankfully still in its infancy.

      >> So one day instead of complaining against michael and co., everyone will be moaning about someone else's code - seems more appropriate for a nerd site somehow ;)

      Whatever they have, no matter how primitive, if they manage to replace Eugenia, consider it sold!

      Heck, if they just put Eliza telling everyone to behave, it would still be worth my money. :-)

      Happy New Year to everyone! To you, too, Eugenia. To show I'm not revengeful, I wish you get promoted to Manager... and relocated back to Greece! Or better yet, to China... (after all, Chinese is all Greek to me).

    7. Re:Automated slashdot? by crapulent · · Score: 1

      I don't see why you'd need any fancy AI or genetic algorithms to mimic the slashdot submitters. Most of them just copy+paste the first two or three sentences of the article, without adding anything. That could easily be replaced by a perl script in about 20 minutes.

  3. Interesting use of Technology by rkz · · Score: 2, Troll

    Google news already uses a similar technique to decide what to put in the summary beneath the headline, it does not paraphrase but it does actually extract a summary.

    Also if you have Microsoft Word lying about there is a feature called Auto-summary which is suprisingly good, amost as effective as going through a document yourself looking for the main points.

    1. Re:Interesting use of Technology by KRYnosemg33 · · Score: 0

      Don't get me wrong, Google news is great, but I've seen it capture related links and captions from images unrelated to the story! Even the big boys need to improve their tech.

    2. Re:Interesting use of Technology by Tim+C · · Score: 4, Informative

      I've provided search engine functionality to a few sites using Verity's K2 product, which provides a similar piece of functionality. If you (programmatically) ask it to return a summary of each hit, what you get is what it considers to be representative of the document as a whole, not merely the first few lines, or a paragraph, or whatever. It actually works pretty well, but then it should, as (a couple of years ago) it cost almost as much as my house...

    3. Re:Interesting use of Technology by znu · · Score: 2, Interesting

      Mac OS X users can select text and choose 'Summarize' from the Services menu in any Cocoa or Services-enabled Carbon application. Summarization is also available to any application programatically through the Find By Content API.

      --
      This space unintentionally left unblank.
    4. Re:Interesting use of Technology by garyok · · Score: 0, Offtopic

      Here's MS Word's autosummary of the Fellowship of the Ring:

      Frodo! Frodo shuddered. Frodo gasped.
      Frodo remained silent.
      Frodo halted. Frodo asked. Hobbits! Sam! Frodo! Old Gandalf. _Dear Frodo,_
      _GANDALF_.

      Frodo answered. Good night, Frodo! Frodo! '
      Frodo shivered.
      ' answered Frodo. ' cried Frodo. Frodo, Frodo! Frodo actually laughed.

      So there you go. No need to read the book or waste 24.99 getting the DVD. Cool, huh?

      --
      One of the penalties for refusing to participate in politics is that you end up being governed by your inferiors - Plato
  4. fox_news.sh by sinclair44 · · Score: 5, Funny

    #!/bin/sh curl $1 > paraphrase > slant -patriotic -stupid > fox_news_story.txt

    --
    Omnes stulti sunt.
    1. Re:fox_news.sh by Anonymous Coward · · Score: 0

      Do you know the difference between '>' and '|' ??

    2. Re:fox_news.sh by drakaan · · Score: 4, Funny
      perl makestory.pl -slant "liberal dem party-line" -severity "raving" -subject "Cheney Halliburton motives"

      Fair is fair ;)

      --
      "Murphy was an optimist" - O'Toole's commentary on Murphy's Law
    3. Re:fox_news.sh by drakaan · · Score: 3, Insightful

      Well, some of us (me, for instance) listen to fox news and NPR...my own personal take on fair and balanced...and see that the party line is alive and well on both major sides of the political fence. That's part of the reason I'll never be a democrat or a republican (or a libertarian, or any other label you want to stick on a like-minded group of people). The news has information in it. Look for it, compare notes, and make up your own mind what's news.

      --
      "Murphy was an optimist" - O'Toole's commentary on Murphy's Law
    4. Re:fox_news.sh by Anonymous Coward · · Score: 0

      #!/bin/sh curl $1 > paraphrase | fair | balance > /dev/null

    5. Re:fox_news.sh by EvilTwinSkippy · · Score: 2, Interesting
      I'm a big fan of completely blocking out the major new outlets and simply investigating matters on my own. I take a mental highlighter to the actual facts as stated in an article, and disregard the interpretation.

      I have discovered there are very few people actually collecting news. In many cases I boil a dozen or so stories down to a single quote from the same source, or even funnier, one reporter's misinterpretation of another reporter's work. My favorite is when an american reporter writes that "the bomb was detonated from approx 330 feet away." They ripped of someone else's estimate of "about 100 meters."

      You are correct though, anyone who takes what they see or hear at face value is a fool. Regardless of the source.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    6. Re:fox_news.sh by niom · · Score: 4, Funny
      Fair is fair ;)

      Except when immediately followed by "and balanced".

      --
      -- Repeat with me: "There is no right to profits".
    7. Re:fox_news.sh by drakaan · · Score: 1

      Scroll up, and be enlightened. There's news aside from Fox's that's unfair, as well.

      --
      "Murphy was an optimist" - O'Toole's commentary on Murphy's Law
    8. Re:fox_news.sh by Anonymous Coward · · Score: 0

      The news has information in it. Look for it, compare notes, and make up your own mind what's news.

      Actually, an excellent rule of thumb is to pose the following question about every news story and every ad. Assume that everything said is factually correct, but intended to mislead you. Ask yourself what the true story might be if it were consistent with the facts, but contrary to the message.

    9. Re:fox_news.sh by Anonymous Coward · · Score: 0

      this should be modded up! this man is on the money!

  5. Very small shell scripts by bunnyman · · Score: 5, Funny

    Yes, but until it can post duplicate articles with slightly different phrases, it will never replace CowboyNeal!

    1. Re:Very small shell scripts by EvilTwinSkippy · · Score: 3, Funny

      Yes, but a system will not take the place of CowboyNeal until it posts duplicate articles with slighty different phrases!

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    2. Re:Very small shell scripts by matth · · Score: 3, Funny

      After taking the place of CowboyNeal will a system like posting duplicate articles, phrase slightly different? Yes!

    3. Re:Very small shell scripts by EvilTwinSkippy · · Score: 3, Funny

      CowboyNeal's system is posting slighty different phrases. Yes takeing me places!

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    4. Re:Very small shell scripts by Anonymous Coward · · Score: 0

      Do 'slightly different phrases' include misspellings?

    5. Re:Very small shell scripts by mabhatter654 · · Score: 1

      But can it post without RTFM??? well can it? At least /. will be safe.

    6. Re:Very small shell scripts by ahodgkinson · · Score: 1
      How about the following for summarizing international news:

      #!/usr/bin/perl
      @summary = (
      "Major corporation sue its customers over IP rights.",
      "USA sends advisors to foreign country to combat terrorism.",
      "EU negotiations end in disagreement.",
      "Disaster in China kills thousands.",
      "India and Pakistan threaten war.",
      "Another Microsoft security hole discovered.",
      "UN pledges to end hunger and poverty within 10 years.",
      "Government officials freed from corporate corruption charges."
      );
      print( $summary[ int( rand( scalar( @summary ) ) ) ] . "\n" );

      Strictly speaking, one might argue that this is a news generator.

      --
      ---- It won't be as bad as you fear or as good as you hope, but it will take twice as long as you plan.
  6. a new step for natural language processing ? by Anonymous Coward · · Score: 0

    is this a new step for language processing or automato. i believe that the "paraphrasing" will leave somethings to be desired. We will see if we get articles that say stuff like "GGGGunit buys microsoft"

  7. But still.... by AgBullet · · Score: 5, Insightful

    won't you need someone to write the stuff to be paraphrased in the first place?? explain to me how that replaces reporters with small shell scripts.

    1. Re:But still.... by gregfortune · · Score: 1

      I think the joke was directed at "reporters" like those found on /. They would be first in line to be replaced as their entire job seems to be to collect interesting articles and repost. Heck, the shell script would probably get the whole dup problem figured out too ;o)

    2. Re:But still.... by EvilTwinSkippy · · Score: 3, Insightful
      There are reporters? Crap, every other article in my local fishwrap is Rueters, the other half is AP. There are one or two articles for local color, generally homicides or documenting yet more ways our local government is a) corrupt, b) inept, and/or c) playing partisen politics with/against the state goverment.

      By the time it's printed in the "News" its usually pretty old.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    3. Re:But still.... by Anonymous Coward · · Score: 0

      Corporate press release -> summarize -> paraphrase -> broadcast.

    4. Re:But still.... by geoffspear · · Score: 1
      And you think Reuters and AP just have big AI computers sitting around writing those wire reports?

      The fact that your crappy local paper doesn't employ any reporters doesn't mean that there aren't reporters somewhere writing all the original news stories. Those people will not be replaced by shell scripts. If the posting had claimed editors would be replaced, that would be more believable.

      --
      Don't blame me; I'm never given mod points.
    5. Re:But still.... by Black+Hitler · · Score: 1
      If the posting had claimed editors would be replaced, that would be more believable.
      Now there's a prospect I imagine most reporters would get behind 100%.
    6. Re:But still.... by inKubus · · Score: 1

      No Crap.

      It used to be that a reporter was there, got the facts and then got the feeling too. But I guess with all the information these days, there's no way people could do everything. Still, it's just going to seem unhuman when one day a news story is a few lines of XML:

      And the "reporter" is just a software program that turns that into a readable "story". Then you can choose how you want the news displayed, with various schema. Like if you're in a good mood, you can put in the happy schema and it doesn't mention anything about deaths, or you're feeling morbid and you want the names and pictures of the dead bodies you can enable morbid mode.. Everything they can get is encoded in the stream, and you're your own filter.

      I actually kindof like this idea, actually. Because then I can filter my own shit and of course meet people I trust and view their filters and not have to worry about the corporate spin that commercial news will eternally have.

      --
      Cool! Amazing Toys.
    7. Re:But still.... by benjamindees · · Score: 1
      documenting yet more ways our local government is... inept

      They don't have to try very hard. I remember a couple of years ago the local news had a shot of a police car that had run into a postal truck. No comment necessary, just ten seconds worth of footage. That image is with me every time I vote.

      --
      "I assumed blithely that there were no elves out there in the darkness"
    8. Re:But still.... by C10H14N2 · · Score: 1

      Just imagine all /. users replaced by very small shell scripts and you've got it. I'm suspicious the that process was begun long ago.

  8. so, will the /. version... by Anonymous Coward · · Score: 0

    so, will the /. version include a short term memory, a broken spelling checker and witty random comments from a fortune database? and how long before it would be able to pass the turning test passing as a human /. editor?

    (going for humor, but if you see flamebait, go for it.)

  9. The Ultimate Tool For Plagiarism by popo · · Score: 4, Interesting


    All someone has to do now is marry this technology with a term-paper database, and "Hello Original Work!"

    The question will then become, how many different unique "paraphrases" can the system ultimately generate?

    --
    ------ The best brain training is now totally free : )
    1. Re:The Ultimate Tool For Plagiarism by Anonymous Coward · · Score: 0

      It doesn't generate. It just ID's patterns of describing the same event and uses those patterns. It assumes a certain integrity and directness in the source material.

      This technique applied to editorials will result in "bush is great" getting translated to "bush is evil" for the same event.

      Interestingly, you could use this to predict how Al-Jazeera or Reuters would report the same event.

    2. Re:The Ultimate Tool For Plagiarism by EvilTwinSkippy · · Score: 2, Interesting
      Actually you can use topic maps to decompose a body of work into individual statements and then use a set or randomly generated "flavors" to re-constitute the facts into an original work. The rules about what goes where are pretty cut and dry.

      More stuff to help people avoid shitwork, only for humanity to discover our purpose in life IS to do shitwork.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    3. Re:The Ultimate Tool For Plagiarism by KrispyKringle · · Score: 2, Interesting
      This isn't necessarily the big problem it appears. I've heard of many college professors and high school teachers using automated plagiarism detectors in the news, and that strikes me as stupid, as well. I mean, if a student has to write a paper on _The Bell Jar_, I'm sure he can find one online. But in most classes, you expect some level of familiarity with the students, on part of the teacher. If a kid who sleeps in every class and who's comments tend to be off topic or stupid turns in a paper worthy of The Atlantic Monthly, the teacher ought to realize something is up. Sure, it may not be absolute proof of wrongdoing, but it warrants a talk with the student about his erratic performance.

      College courses might be a bit tougher; there are certainly plenty in which the course is simply too large for the professor to know all the students, but in most courses, the subject matter is novel enough that finding a paper online that's relevant should be pretty difficult.

      I went to a high school with quite a lot of cheating (probably at least half the students engaged in it occasionally or more), and it really did get me. The co-valedictorian was this fat bitch who cheated on a regular basis (and even had been caught at it). And even in college I've seen some things that were borderline or worse. But there are better answers to this than ``let's do everything we can to stop cheaters.''

      First, cheating is symptomatic of misplaced priorities and pressures. The students who cheated the most were the ones who didn't really understand why they should go to school.

      Second, as trite as it sounds, you really are only cheating yourself. The kids who cheated the most in my high school didn't get very far (save perhaps for the co-valedictorian). I never cheated, and (not to toot my own horn, of course) I was the other co-valedictorian, I went to the prestigious school, I had the career opportunities, etc. The thing that always struck me as funny was that most of the kids who cheated didn't do very well anyway.

      And finally, even if some people cheat and get good grades, does it really matter? Your grades aren't relative to others, they are your own. Sure, colleges look at what percentile you are in, but I don't think cheating ever helped anyone that much to begin with. And grades themselves, cheating or no, are pretty meaningless; grade inflation and average GPAs vary enough from school to school as to be useless as objective indicators anyway. You hope colleges can see a bit more into the personality of their applicants than simply the GPA (and if they can't, it's the admissions system, not cheating, that's at fault).

      I guess I'm a bit offtopic now. Ah, well.

    4. Re:The Ultimate Tool For Plagiarism by iabervon · · Score: 1

      And, if this technology is sufficient to write good term papers based on online information, what is the point of learning to write term papers? Certainly any students who have access to such technology will have no use for doing it themselves after school, when the technology will be more advanced and more money will be available for it.

      At that point, teachers ought to be teaching students how to get such software to produce the effect they want on the audience. For that matter, they could try teaching and encouraging students to come up with new ideas and conclusions to feed the computer.

      As far as term papers which are meant to reflect not on writing skill but on research skill, they really ought to be replaced with having the students explain the material to the class. Even if the presentation is written by a computer, the student will have to find the necessary input, and have to learn the material well enough to give the presentation sensibly, which is essentially the point of literature research.

      The skill of coming up with a new way to express the same ideas that have been expressed before is, ultimately, useless. At the same time, advances in communications have made it much more possible to find the raw material to come up with new conclusions. Writing unadorned prose will eventually become as insignificant a skill as long division. The available research techniques are also changing drastically. At some point, it will be necessary to ask what the point of term papers is supposed to be, and how best to test those skills.

      Consider this assignment: using this database of past essays from this class, and other similar classes at this school and others as secondary sources, and using the primary sources that these essays cite, find a position or focus on a topic that interests you that is not held by any of the essays, and argue it, arguing against each of the essays which contradicts your position. Simple enough to pose (given such a database and a search engine) and to do, but you're not going to find an essay in the database or elsewhere that does the trick, because you have to contradict every local essay in some way, and any essay not in the database won't reference the necessary essays in the database.

    5. Re:The Ultimate Tool For Plagiarism by SurgeonGeneral · · Score: 3, Interesting

      Yes, we've all heard the arguments against cheating.

      Especially the, 'you're only cheating yourself' one.

      Its irrelevant because this will not affect the way we cheat so much as the way we learn and the way we write. Think about it beyond your personal experience in high school.

      1. On the micro scale, an autosummerize feature like this will allow someone to take another's essay and put their facts into their own words. But I dont see how this makes any difference to the cheater other than saving him an hour. To see this tech as a problem on this level is to ignore the future.

      2. On the medium scale, it will allow someone to take multiple papers, extrapolate all the facts and their sources and then string them together again with their own interpretation. This will allow the learner to come up with a new argument and possibly a fresh insight based on the available information. In this case, it saves the learner a few hours of reading, though he has to do the same amount of thinking and logical reasoning. Is it a shame that the person doesnt have to waste time reading irrelevant information? Still, looking at it on this level is not thinking very deep.

      I take history in university and the essays we have to write are done by data mining books. Lots of books. We have to read large amounts of material in as short a time as possible. We have to find out what is important and what is relevant. Am I really learning how to analyze facts? I dont think so. I am learning how to write university papers and theorize based on incomplete information. I am learning how to make a lot of wasted time look like a lot of work.

      3. The macro scale. What if every book ever written was replicated in full electronically and available for parsing. What if I could extrapolate every fact from every source even remotely relevant to a topic. I'm right back to where I was before : hours and hours of reading. Yet, my argument will be more solid and my information more complete then it ever could be using the outdated method of data mining: looking in the indexes of books. In this case, what am I learning? I am learning how to think. I am learning how to spot holes, inconsistancies, fallacies, and etc. In this case the technology has eliminated cheating altogether because there is no single source to copy from. And if I want to understand how all these facts are related to each other I either have to think about it or read an other authors interpretation of it. (thus I could still cheat in the classical sense)

      4. But lets look at it on one more level, the very tiniest level and the most futuristic. A well constructed paragraph or sentence cant be parsed down, and wouldnt make sense if it was. The facts contained in a paragraph only become important in relation to one another. So in the end, it could just change the way we write. Enough with this puffed up crap, enough with padding your papers - either state whats important or nothing at all. A well constructed essay in the future will be one that cant be "autosummerized" without losing all its intelligability.

      --
      -- "Man is born free, and everywhere he is in chains." Jean Jacques Rousseau
    6. Re:The Ultimate Tool For Plagiarism by geoffspear · · Score: 1
      The point of a term paper isn't to show that you can express the ideas in your paper. It's to force you to actually learn about some topic in enough detail that you can explain it to someone else.

      The ability to do research and learn new things isn't going to be replaced by technology. When everyone's as dumb as you'd like to make them, who's going to write the software that does their thinking for them?

      --
      Don't blame me; I'm never given mod points.
    7. Re:The Ultimate Tool For Plagiarism by thrillseeker · · Score: 1
      if this technology is sufficient to write good term papers based on online information, what is the point of learning to write term papers ... teachers ought to be teaching students how to get such software to produce the effect they want on the audience.

      Since it's so comfortable one wonders why the baby ever leaves the womb.

    8. Re:The Ultimate Tool For Plagiarism by KrispyKringle · · Score: 1

      Your points all are valid, I think, and actually very interesting, but I don't think this means there's no such thing as cheating. If you plagiarize, the issue is not the text you copied, but the ideas. So yeah, perhaps this system would allow us to actually crack down on plagiarists, as well, by detecting copied ideas, even if restated. But I suspect we might find that a lot of papers aren't really as original as the authors thought.

    9. Re:The Ultimate Tool For Plagiarism by Kent+Recal · · Score: 1

      Excellent article.

      Am I really learning how to analyze facts? I dont think so. I am learning how to write university papers and theorize based on incomplete information. I am learning how to make a lot of wasted time look like a lot of work.

      It's all about the figures. They need a way to "objectively" measure your skill level and progress.
      The only way for them to do that is by forcing you to play by their formal rules and measure how successful you were in doing so.

    10. Re:The Ultimate Tool For Plagiarism by Saeger · · Score: 1
      The ability to do research and learn new things isn't going to be replaced by technology.

      Regurgitating boring facts and rote memorization WILL be replaced by technology eventually. A brain-computer interface -- which isn't that far off -- will, in essence, allow some future "Google" to be an extension of your brain's main memory. This still isn't the holy grail, though, because it only decreases access time to huge databases of information, but doesn't do much to decrease the time it takes to fully absorb something, understand it critically, and fork off new ideas (for that we need even more advanced Intelligence Amplication).

      --

      --
      Power to the Peaceful
    11. Re:The Ultimate Tool For Plagiarism by iabervon · · Score: 2, Insightful

      Since the point of term papers is not, in fact, to learn to write term papers, it is likely that, as the production of term papers becomes possible while missing the point, the assignment should be changed to retain the point.

      The ability to do research (of known information, at least) has already been changed by technology. Google, PubMed, and other sites make real literature research possible for high school students with just a web browser, and the kind of slogging through printed books that I learned in high school is now entirely obsolete, like long division. Doing a term paper on the Oneida Community in high school, I was limited to the books in my high school library and town library (and my parent's tableware), and I had to look in card catalogs and chase references to do it. Today, I can just type "Oneida Community" into Google, and I get primary sources, the site's own information, photos, and various essays on the subject. The old skills simply don't produce as good information as is trivially available today.

      Technology replaces the gruntwork in research, and allows a given assignment to take less time finding the information and more time thinking about it. If the point is to teach students to learn new things, shouldn't it be encouraged to eliminate with technology all of the parts which are not part of learning new things, but rather part of demonstrating that you have learned them?

      If you are required to come to a novel conclusion after looking at everything written on your topic, and then argue that position, it doesn't matter how little of the text you personally writing originally; if the result is a logical argument, you must have understood the topic and selected suitable raw material for it.

      I think the real problem with term papers is that you are encouraged to come to the same conclusions that everybody else does, but to put it in your own words. In writing such a term paper, good research will turn up something that is exactly what you want to say. New things aren't part of the assignment at all. The task is essentially to rephrase something that's already well-written, and this task will soon be automatable.

  10. Re:Nice ad... by d3faultus3r · · Score: 1

    What about the Despair calendar motivation: if a pretty picture and a cute saying are all it takes to motivate you, you probably have an easy job the kind robots will be doing soon. Is it really any surprise? After all, that t shirt saying was probably invented by someone who reads /.

    --
    read my blog
    musings on politics and technol
  11. Like Word's AutoSummarize by nstrom · · Score: 0, Redundant

    This reminds me of Microsoft's AutoSummarize feature, which has been a feature since for quite some time (I seem to remember using it in high school with Word 6)... That feature basically tried to keep the first, last, and topic sentence of each paragraph, more or less, and you could give it a percentage value to shrink the text down to. (ie. 50% to excerpt half the original text)

    1. Re:Like Word's AutoSummarize by MikeXpop · · Score: 1
      Mac OS X has this too. Select a paragraph or so of text, go up to the appname menu (the one in bold), go to services, and click summarize. An app will open, and you can drag a bar for the amount to go away. It does't really work that well. For example, here's what it gave the former paragraph (at 75% summary).
      This reminds me of Microsoft's AutoSummarize feature, which has been a feature since for quite some time (I seem to remember using it in high school with Word 6)... That feature basically tried to keep the first, last, and topic sentence of each paragraph, more or less, and you could give it a percentage value to shrink the text down to.
      --
      Etiquette is etiquette. He kills his mother but he can't wear grey trousers.
  12. beta by fuck_this_shit · · Score: 0, Redundant

    it's already being betatested on slashdot, one of the main bugs is that the scripts post the same news multiple times, just paraphrased slightly differently. Did you really believe humans are able to make such an amount of doubleposts? please. be realistic...

  13. Dupe by greenhide · · Score: 5, Informative

    Unfortunately, there isn't yet a way to use computers to detect dupes.

    Or Is there?!?

    --
    Karma: Chevy Kavalierma.
  14. School Reports by gregfortune · · Score: 3, Insightful

    So, will there be difference between paraphrasing and copying now in an educational setting? Seems like this could make a report pretty easy...

    1) Brainstorm some key points/ideas
    2) Have this program data mine for relavent articles online
    3) Feed sections of each article into the program and have a finished paper

    Granted, the tech isn't quite that powerful yet and probably wouldn't do a whole paper, but it sure looks like it could supply several paragraphs of material per page...

    1. Re:School Reports by roninmagus · · Score: 2, Interesting

      I do very much hope so; as a computer science major who hhaaatteess general studies classes, I hope very much that the English/History classes which so graciously waste my programming time with useless writings go down the drain. Of course, my website is entirely such useless writings, so I stand trumped.

      However, I did meet my girlfriend and hopefully future wife in Sophomore English at MTSU. Go figure.

  15. Rethink English ! by Thinkit3 · · Score: 4, Informative

    Lojban is among the more interesting newer languages. It can be parsed just like c! Esperanto is somewhat interesting. English will be regarded in the future as a curious artifact--it was swept along with the technology revolution simply because ASCII didn't include accents and extra marks on letters. Eventually we'll get away from vocalization all together and have purely numerical, written laguages.

    Right now, trying to work with English in computers deals way more with the strangeness of the language than the more interesting issues of cognition that lie underneath.

    --
    -Libertarian secular transhumanist
    1. Re:Rethink English ! by Patrik_AKA_RedX · · Score: 1
      Eventually we'll get away from vocalization all together and have purely numerical, written languages.
      That would be dubbleplus good. Instead of a spell check we would be using a CRC checker.
    2. Re:Rethink English ! by TwistedSquare · · Score: 2, Insightful
      English will be regarded in the future as a curious artifact

      One man's informative is another man's troll... Esperanto was interesting and look where it got. Nowhere. People will speak in what's easiest. English is becoming a de facto standard that will continue to be the most spoken language in the world. People won't use odd designed languages because it will be harder than current languages, which got where they are today though iterative refinement to be the best suited language for us to communicate in.

    3. Re:Rethink English ! by Just+Some+Guy · · Score: 2, Interesting
      Right now, trying to work with English in computers deals way more with the strangeness of the language than the more interesting issues of cognition that lie underneath.

      That's true. Computer languages that don't stick close to "regular" human expression are very popular and growing quickly. Languages that resemble written English are dwindling rapidly.

      After all, code is meant to be written, not read, and programmers should strive to write such that their work can't be understood by anyone not an expert in the language they're using.

      Put another way: as long as I have to fix other people's code, or I want my boss to be able to read my code without me spending an afternoon explaining it to him, I really hope it doesn't look like a string of line noise. English-like constructs may be distracting for some, but they're pretty handy for the rest of us.

      --
      Dewey, what part of this looks like authorities should be involved?
    4. Re:Rethink English ! by Anonymous Coward · · Score: 0

      How did this get modded up?

      English...continue to be the most spoken language in the world? English is not (nor has it ever been) the most spoken language in the world, although it is the language used most on the internet (and with computers in general). It has recently reached the top three "native" languages (how they should be ranked depends on how you count - e.g. does a primarily Spanish-speaking American citizen count as English, Spanish or both? - but the top three are Chinese, English and Spanish).

      Most current human languages are not particularly easy, but they feel easy for people because they learn them from a very young age. Learning languages that are fundamentally different compared to the languages you are already familiar with after an age of about 10 is difficult, but if a new, less ambiguous language were developed and taught to young children, they might well consider it superior to other languages, although I'm not sure this is likely to happen any time soon.

      If you think that human languages naturally evolve toward something "better", you're disagreeing with a huge number of linguists. Human languages change fairly arbitrarily, as can be seen from how isolated communities form their own dialects that reflect culture more strongly than utility.

  16. Re:DUPE by Anonymous Coward · · Score: 2, Funny
    It's not a dupe; it's a computer generated paraphrasing of the earlier story.

    ~~~

  17. Fake literature by MAPA3M · · Score: 4, Funny

    Isn't this the way those trashy love novels are written?

    1. Re:Fake literature by amRadioHed · · Score: 1

      Not quite, but very similar. That is by far the most stunningly dumb book I came across in my stint working at a bookstore.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    2. Re:Fake literature by Lord+Kano · · Score: 1

      Isn't this the way those trashy love novels are written?

      Wow, it must take a hugh shell script to turn "Mary wanted the big strong muscle man to solve all of her problems. Henry, the big strong muscle man, was horny. They had unprotected sex. Everything was perfect from then on."

      Imagine what that could to to a Hello World program.

      LK

      --
      "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  18. Or games... by A55M0NKEY · · Score: 3, Funny

    Someone set up us the bomb!

    --

    Eat at Joe's.

    1. Re:Or games... by Anonymous Coward · · Score: 0

      Somebody set up us the bomb. body. Sigh, get it right people!

  19. Reference by Epistax · · Score: 1

    ...most reporters with very small shell scripts...

    I know I heard this phrase (loosely) before, but does someone know the name of the reference?

    1. Re:Reference by Bryan_W · · Score: 1

      maybe from this tshirt

    2. Re:Reference by arth1 · · Score: 1
      ...most reporters with very small shell scripts...

      I know I heard this phrase (loosely) before, but does someone know the name of the reference?


      At ThinkGeek perhaps?
      Or one of myriads of signatures quoting this?

      Regards,
      --
      *Art
  20. Re:Nice ad... by danidude · · Score: 1

    Well, after all, they (/. and thinkgeek) are owned by the same group::

    "A month or so later we were Slashdotted. And promptly thereafter ThinkGeek was acquired by the good folks at Andover.Net who have since been acquired by the great folks at VA Software. Andover.Net then became OSDN which is the central entry point for the Open Source community's favorite web sites such as ThinkGeek (hey that's us!), slashdot.org, linux.com, sourceforge.net, and freshmeat.net. Pretty nice company to be amongst, eh? We're pretty proud of it!"

    --
    - no sig.
  21. I bet this will be used to cheat school homework by Via_Patrino · · Score: 1

    I've a lazy friend that when he needs to do some paperwork from school he just copy some text from internet and use auto-shrink resource from M$ word, so it texts become different from other people that ocasionally copy their homework from the same site.

    One thing this tecnology is able to do is improve homework cheating.

  22. Replace reporters?? by dyj · · Score: 2, Insightful

    How is this going to replace reporters? Reporters don't just paraphrase other reports. They actually are supposed to search for stories (hopefully factual!) on their own.

    1. Re:Replace reporters?? by EvilTwinSkippy · · Score: 1
      I think someone just wanted Journalists to know what it feels like to be a tech in this day and age. What they can't get a computer or a trained chimp to do, they will find some guy in another country who will do it cheaper.

      We know we will be in trouble when every commentary article begines with "I am thinking that..."

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    2. Re:Replace reporters?? by tbone1 · · Score: 1
      Until you can get computers to drink on the job, get paid by businesses to write advertisement for them (a la Enron, CART, etc), and fall for any buncomb that someone says in a serious voice, then they won't exactly 'replace' journalists.

      --

      The Independent: Reverend Spooner Arrested in Friar Tuck Incident - ISIHAC, Historical Headlines
  23. Something similiar existed on the Amiga by Serk · · Score: 3, Interesting

    Back in the late 1980's I had a word processor for my Amiga that had a function whereby it would do a global search and replace of every Xth word (User settable) with a synonym from the built in Theasarus... Very handy for those term papers I so hated in high school...

    I'm assuming this (Of course I didn't RTFA) is far more advanced than what we had back then, but the idea for this has been around for quite a while at least...

    --
    Never ask a geek why, just nod your head and slowly back away. -Rob Malda
  24. Possible outcome of computerized paraphrasing by CapnCarrot · · Score: 2, Funny

    AhaIndia submits story discussing paraphrasing of articles. This technology, destined to replace reporters shell, is still in its infancy. Huh, perhaps we'll still need humans after all . . .

  25. The next wave of layoffs... by geekpuppySEA · · Score: 1
    This is just the beginning. One of the coming sectors to feel the layoff-inducing effects of technology will be the wordsmiths: novelists, script writers, reporters, copywriters, editors, speechwriters, and translators. (The plus side of this are all the jobs that no one really wants or respects: some types of marketers, and the customer service reps who email (email only, since voice reco still sucks.)

    As inventors realized over the course of the 20th century that human capital could be replaced by factories and assembly lines, so will computational linguists make it clear over the next one that human language isn't just a biological phenom (that's what current theory proposes) but also a mechanism that is studyable and reproducible.

    It sounds like comp ling stands to be one of the next decade's hot-shit career options (in addition to intellectual property lawyer.) Now if there could only be more than, like, five or six linguistics departments who offered specializations in it, I could have a better selection of where to send my grad school apps! Who the hell wants to live in Tucson?

    (Just kidding, selection committee! Wow I love mariachi music, i'd love to come live in your city!)

    --
    Intelligent Design: because MATH is HARD.
    1. Re:The next wave of layoffs... by Anonymous Coward · · Score: 0

      You obviously don't understand any of the "wordsmith" professions. Putting prose on a page isn't the end all of any of the jobs you listed. Some one has to come up with stories, plots, themes, characters and their traits etc.

      A screenplay doesn't use prose, no one will want to read a comp generated novel, a reporter doesn't just "make up" his stories (unless he works at the Times)...

      The people important enough to employ speechwriters aren't going to understand if a computer generated speech is good or not so they'd need someone to look it over (a speechwriter).

      Those careers are about as vulnerable to this technology as computer programming.

  26. mod abuse?? by Anonymous Coward · · Score: 0, Offtopic
    i wonder if this comment has been modded down for reasons other than the content.

    Such modding is clearly abuse, since mod points are to simply judge the content of the post. If you don't like the poster, or his sig, then ignore it. But it seems there are some vengeful and jealous moderators out there who can't stand to see somone else get modded up for insighftul opinions.

    This clearly needs to stop.

    1. Re:mod abuse?? by Anonymous Coward · · Score: 0, Offtopic

      $100 sez that parent AC is rkz or one of his fellow K/W friends.

    2. Re:mod abuse?? by CrankyFool · · Score: 1

      I think it's mostly been modded down because the link in the sig is fairly solidly abusive, offensive, and misleading. You can't easily ignore something that someone is actively trying to obfuscate, and that link is a good example of such obfuscation. The post includes the sig, and as such the sig affects the post. It's perfectly reasonable to claim the post as a whole is a troll if the sig is egregious enough.

  27. Do you know what reporters DO? by DavidinAla · · Score: 4, Insightful

    For you to say that this technology will someday replace reporters makes me think that you're clueless about what reporters do. Do you realize that the biggest parts of a reporter's job are gathering facts and making judgments about 1) which stories are worth reporting, 2) which are the relevant facts about a story and 3) who's lying and who's telling the truth about a story? The actual writing that you see is many times almost incidental to most of what a reporter does. You might not like the judgments that a reporter makes (and I could agree with that in many cases), but software can't go out into the world and talk to people and use judgment and intuition to find information to write about.

    As an ex-reporter and editor, I find it laughable that anyone might think this technology will replace reporters. It's sort of like suggesting that machines that can read source code and interpret it can somehow figure out what new software people want and then write it. Both possibilities are equally insane.

    1. Re:Do you know what reporters DO? by Anonymous Coward · · Score: 0

      of course not. remember to look who posted this story, michael sims, one of the most incompetent, moronic, and mindnumbingly stupid people on slashdot.

    2. Re:Do you know what reporters DO? by Anonymous Coward · · Score: 0

      There are definitely still reporters like that, but there are a large number who just want to be celebrities, and unfortunately the dumbing down of the media (led by TV) has allowed this type to ruin the name of what was once a laudable profession.

      How many "current affairs" show are based around some anchor who does nothing but introduce themselves after each commerical break?

      "Hi, I'm Bimbo Tryhard".

      Who cares?

      Quality journalism takes time and resources. When all the people want are shots of celebrities, car accidents and accused-but-not-let-tried people trying to leave their houses, where is the motivation to actually take the time to prepare a story.

      Journalism used to have the power to change the world. I reckon they've lost it.

    3. Re:Do you know what reporters DO? by DavidinAla · · Score: 2

      But even BAD journalism requires abilities that software just doesn't have. Software can't have sources and the ability to call them on the phone. Software doesn't have the ability to differentiate between good information and patently false stuff. A bad journalist might write a sensational or even purely false story, but even doing THAT requires abilities that software can't have in anything like a forseeable future.

    4. Re:Do you know what reporters DO? by Linux_ho · · Score: 1

      The fact is that this technology is just replicating what people are already doing. The technology won't replace many reporters. It will replace the cut-n-paste people who have already replaced the reporters. Most "news organizations" are nothing more than AP chop-shops right now. You think there's a lot of fact-checking and analysis going on? I wish that were true, but it's just not.

      --
      include $sig;
      1;
    5. Re:Do you know what reporters DO? by ediron2 · · Score: 1
      DavidinAla asked:
      Do you know what reporters DO?
      This is slashdot.

      This is Michael, Hemos, and Taco we're talking about. What kind of dumbass question is that?!

      Of *COURSE* they don't know. Heh, even avoiding dupes, spellcheck and fact checking are alien concepts...

    6. Re:Do you know what reporters DO? by shaitand · · Score: 2, Interesting

      "which stories are worth reporting"

      With this technology, ALL of the stories could be reported.

      "which are the relevant facts about a story"

      odd, I myself get very pissed about reporters who don't give ALL the facts. If you mean summarizing, that is EXACTLY what this is supposed to do.

      "who's lying and who's telling the truth about a story"

      That's for the reader to decide. A reporter who makes judgements concerning what they are reporting and expresses their view of the subject is a bad one. At least in terms of news, a review of course is another matter since that is it's entire purpose.

      "You might not like the judgments that a reporter makes"

      A reporter shouldn't be making judgements, this is constant, most reporters do and slant the news toward what THEY believe is the truth, letting their own opinion of the matter interfere with the information they provide me to use to form MY opinion. A reporter should be a fact gather and a writer, nothing more. Gather the facts, put as much information about the subject as possible down in as concise a manner as possible SO THAT I THE READER can decide what it means, who is telling the truth and whether or not it's interesting.

    7. Re:Do you know what reporters DO? by DavidinAla · · Score: 2, Insightful

      I'm sorry, but you're SO ignorant about the way the process works that I can't begin to correct all of your misunderstandings. If you really and truly believe that it's even possible to give readers ALL of the available information every single day, you're completely unaware of how much information is out there.

      Do you want to report what is on the menu at every restaurant in town every day? What about an attendance list of who made it to school at every school in town? What about the results of every medical test at every medical provider in town? What? You say those things aren't news under normal circumstances? Well, you've just made a judgment about what should be reported. At the most elementary level -- and as simplified absurd examples -- that's the first step of what a reporter learns to do.

      There is far, far, far too much information to do what you propose. The reporter is a "gatekeeper of information," whether he likes it or not. Someone has to decide what's news and what is going to be included and what is going to be cut. SOMEONE MUST MAKES THOSE DECISIONS. Reporters and editors do it every day. You might not always agree with their decisions -- and I'm a huge critic of many decisions made in news organizations today -- but to say that nobody should be making those decisions betrays a lack of understanding of the volume of information available.

  28. Pair of phrase by Effugas · · Score: 1

    HeySubcontinent's story linkage analyzes the automatic stegoplagarization of documents written in the language derived from Britain. Expected to displace at some point journalists, these hacks presently bash with the force of a small child. Good.

  29. Someone must research a story . . . by kfg · · Score: 5, Interesting

    conduct interviews and generate original copy. These people are called reporters.

    The people who take this copy off the wire and paraphrase it for publication in the local paper are called copy writers.

    This software will reduce the number of copy writers needed, not reporters.

    This is certainly an issue to the copy writers and their families, but overall it's really just a blue collar worker being replaced by a robot issue.

    The idea of a 'style dial' I find a bit more disturbing.

    KFG

    1. Re:Someone must research a story . . . by Andy_R · · Score: 2, Funny

      You get news that isn't just a bunch of paraphrased press releases?

      Man, I gotta find the preferences checkbox for that stuff!

      --
      A pizza of radius z and thickness a has a volume of pi z z a
    2. Re:Someone must research a story . . . by Florian+Weimer · · Score: 1

      This software will reduce the number of copy writers needed, not reporters.

      Just license your content from The New York Times, and you can lay off both copy writers and reports.

      (You can use Google to watch how many online news sites republish this story.)

  30. Generation isn't that easy by Ezubaric · · Score: 4, Insightful

    The poster incorrectly assumes that this could be used to replace reporters. The problem is that computers have a difficult time generating new text. The methods that computers use to evaulate text (as any user of grammar-check would realize) aren't that great.

    In fact, most language models cannot generate even a large portion of English text. Those that do have a good range rarely have good accuracy, because there are many things that we "just don't say that way." This is why when you're talking to a non-native speaker, you often cannot explain why something they said was wrong. This is because there is no real grammar rule against speaking in a given way.

    So if we rule out syntax-based models, that just leaves statistical-based models. I worked in a NLP lab during the summer of 2002, and my prof there said that syntax and statistics are like the two sides of the force. Statistics are quick and easy but are seductive. They corrupt you and leave you unable to really think about the language itself. You only think in terms of bigrams and HMMs.

    So even though these systems are doing well, they are mostly statistical. Thus, it's hard to get incremental improvement. You have to have larger corpora, and larger corpora usually have more errors, thus defeating any advantage you might get by capturing more aspects of a language.

    In my opinion, only with well-developed language models that can effectively generate NL can we get anywhere. Which is what Barzilay is working on, but it's still a long, long, long way off.

    --

    ----------
    I am an expert in electricity. My father held the chair of applied electricity at the state prision.
    1. Re:Generation isn't that easy by ncr53c8xx · · Score: 0
      The problem is that computers have a difficult time generating new text.

      How do you figure? In TAOCP Knuth gives an example of computers that write a western play. If you can create new stories then writing a report based on certain facts is trivial.

  31. The article, summarized by MacOS X by sakusha · · Score: 4, Interesting
    MacOS X has a summarization feature implemented in the Services menu. I decided to summarize the CNet article just to see what I got, and because I like the idea of summarizing an article about summarizing.
    In the famous sketch from the TV show "Monty Python's Flying Circus," the actor John Cleese had many ways of saying a parrot was dead, among them, "This parrot is no more," "He's expired and gone to meet his maker," and "His metabolic processes are now history."
    ...The program gathers text from online news services on specific subjects, learns the characteristic patterns of sentences in these groupings and then uses those patterns to create new sentences that give equivalent information in different words.
    The researchers, Regina Barzilay, an assistant professor in the department of electrical engineering and computer science at the Massachusetts Institute of Technology, and Lillian Lee, an associate professor of computer science at Cornell University, said that while the program would not yield paraphrases as zany as those in the Monty Python sketch, it is fairly adept at rewording the flat cadences of news service prose.
    1. Re:The article, summarized by MacOS X by 2nd+Post! · · Score: 1

      I tried this on the last article that talked about summarizing; It's slightly less relevant in this article because they talk about using multiple sources to cross reference, correlate, and paraphrase rather than actually summarizing.

    2. Re:The article, summarized by MacOS X by sakusha · · Score: 1

      I cut it down to only 3 paragraphs, the shortest version that contained the lead paragraph. The longer summaries did contain that content.

  32. yr comment's a journalism integrity question... by geekpuppySEA · · Score: 2, Interesting
    ...not nec a problem to be solved by the code. Which BTW probably are a leetle more complex than small shell scripts, and see a good textbook like Jurafsky and Martin (pub 2000) for why.

    Re journalistic integrity - There's the possibility that a single entity could issue the release to the wire services, they could relase it in some kind of 'compiled' form (where it's just the syntax/semantic relations.) (How this could be different from how releases are issued now is a good question, but I guess there'd have to be reporters on hand to inquire about details... so maybe journalism might be saved after all... but not if templates for information were used, and the templates themselves needed to fill in the missing gaps...)

    You could imagine how each news outlet could receive the relase, and use their own reconstructive code to flesh out the [NP][VP]{NP] ("who did what to who"* scenario) and then write their own story from that.

    Editing scripts could decide what in the story would be details that would shine damaging light on that paper's politics, and then stuff those details in the 37th paragraph that no one reads, write a potentially-misleading headline that would allow for a reading that would tell its readers the exact slant they want to give the story, and DONE - they've printed the ostensible truth, but since few people are going to read the article, they've done their job and done it well.

    "Wait a minute, isn't that what happens now anyway?" Maybe, but now papers can save that much more on spin-sters' salaries. And then there'd be yet more English majors who can't find a job. Go capitalism, yay. *shudder*

    *it's who. not whom. No one has said whom in english for a century or so, and then only because they 'think' it's correct. Anytime I hear someone saying it for real, I shudder to think that they're so neurotic about their grammar that they use something they've been told is right but have never really heard themselves. None of my linguistics profs ever used "whom", EVER. I think they privately hate the word.

    P.S. This entire post have been wrote by a really good scripts.

    --
    Intelligent Design: because MATH is HARD.
    1. Re:yr comment's a journalism integrity question... by Anonymous Coward · · Score: 0

      P.S. This entire post have been wrote by a really good scripts. ... which apparently doesn't use spell (or grammar) check either.

  33. Hardly news... by JayJay.br · · Score: 4, Informative

    This article posted before already tells us all this, the paper that originated it was mentioned in the comments, and this one is another of a series of papers by this researcher.

    OK, nothing else to see here, move on to the next redundant post (Is that paraphrasing 'dupe'?)

  34. well... by ducomputergeek · · Score: 1

    ...that explains cowboyneal

    --
    "The problem with socialism is eventually you run out of other people's money" - Thatcher.
  35. Posting by Anonymous Coward · · Score: 0

    We can do the same with extra long /. posts.

  36. If I could really write the script to do that... by geekpuppySEA · · Score: 1

    I'd be the richest man to ever appear on "Oprah".

    --
    Intelligent Design: because MATH is HARD.
  37. for i in `echo "for one"`... by Anonymous Coward · · Score: 0

    ...if[false] then kill -9 fi our new interpreted command-line overlords.

  38. CRC? by Thinkit3 · · Score: 1

    Well parsing of course becomes your grammar checking. Spell checking is more lexical analysis--but for numbers that's just seeing what's in range.

    --
    -Libertarian secular transhumanist
  39. It won't. replace beat reporters... by geekpuppySEA · · Score: 1
    but it might replace the ones who just lip-synch the press releases that are sent to them... i.e., the White House.

    Beat reporters on the other hand might be helped, by police reports, witness accounts, etc, but not replaced. That's android territory and we hope, is maybe for the 22nd century, not the 21st.

    --
    Intelligent Design: because MATH is HARD.
  40. NY Times reporters have nothing to fear by Anonymous Coward · · Score: 0

    At least until this starts to work:

    cat /dev/random > NewNewsStory.txt.

  41. Obligatory link by Florian+Weimer · · Score: 1

    For your convenience, here's the link to the original article that requires registration.

  42. And your duty is clear... by Anonymous Coward · · Score: 0

    To build and maintain these shell scripts.

  43. The Ultimate Troll/kiddie tool. by Quirk · · Score: 0

    Trolls will rush to this technology, exploiting it for endless ways to phrase anal sex and slashdot bashing.

    --
    "Academicians are more likely to share each other's toothbrush than each other's nomenclature."
    Cohen
  44. CS Freshman Initiating Machine Traslator Project by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically.

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project.

  45. Bring on the Machines by DumbSwede · · Score: 4, Interesting
    I don't think many people read the article. While Michael suggest this could replace reporters, it is not about summarizing a whole article, but merely paraphrasing individual sentences and elements. This would be useful for checking for plagiarism where one author has merely line by line paraphrased another. Another useful area is in language translation, where the paraphrasing may make the translation more understandable. I don't think todays translation programs allow you to say the the same thing two or three times, but repeat it back differently (paraphrase) if not understood by your listener the first time.

    Of course the time will come when machines summarize articles, and I believe I have seen where this has already been tried with mixed success. It would be kind of neat to see /. use both a summary engine and a paraphrase engine on submitted articles. Then we could have 3 article descriptions: the posters description; a machine summary of the same article; and a machine paraphrase of the original posters summary.

  46. Paraphrase by JediDan · · Score: 2, Insightful

    Would be nice to be able to summarize + paraphrase large articles and documents. Not all of us have the necessary time to read 20+ page documents.

    It won't replace original works, but it could help reduce a lot of extraneous data on the web :)

    --
    - Dan
    1. Re:Paraphrase by arose · · Score: 1

      Open Text Summarizer may be your friend.

      --
      Analogies don't equal equalities, they are merely somewhat analogous.
  47. Computer responds by Anonymous Coward · · Score: 0
    This technology, destined to eventually replace most reporters with very small shell scripts, is thankfully still in its infancy.
    POSSIBLE RESPONSE -
    YES
    NO
    WHAT?
    PLEASE COME BACK LATER
    FUCK YOU, ASSHOLE
    JUST YOU WAIT

    Just you wait.

  48. Maybe you're not sure what linguists do... by geekpuppySEA · · Score: 2, Insightful
    Hey, don't troll this stuff out quite yet - sure it's future ware right now, but think ahead, and ... more to the point, read some about it. There's more to language and computational linguistics than you might think. Just because your (former) line of work stands to be partially replaced doesn't mean that the technology is insane.

    to wit, there are attributes of register, tone, and modality that can be applied not just to individual sentences, but to entire pieces of text that may be able to indicate a piece's slant, political tone, reading level, and (ahem) ability to incite readers to flame.

    Some of the decision making processes you're talking about that go on during editing and truth judgments admittedly will probably not be computerized. But some of them can.

    The point of the responses here are not to relegate journalism or wordsmithy (as it were) to the level of manual labor, as manual labor has been replaced by machines. But the truth is that machines are more complex now and they're ready to take on more complex tasks. Some things about language are very much NOT a mystery. Code isn't either.

    --
    Intelligent Design: because MATH is HARD.
    1. Re:Maybe you're not sure what linguists do... by DavidinAla · · Score: 2, Insightful

      Maybe you're not clear about the difference between a reporter and an editor.

      It's theoretically possible that an editor could be replaced in some instances by software, but not the reporter. The reporter doesn't have anything to start with -- no sentences for software to analyze. A reporter normally starts with some vague thing like a source in the city clerk's office telling him that some bogus expenditures are being put into the sanitation department budget for next year, but nobody really knows what's going on. It's about rumor and bits and pieces of evidence picked up almost from the wind. The reporter has to follow up on lots and lots of little wisps of nothing and figure out which ones are worth checking out and maybe writing about.

      Software cannot do that. Until there is really perfect AI software -- which I think is so unlikely as to preclude reasonable speculation for the purpose of this conversation -- reporters won't be replaced by software.

    2. Re:Maybe you're not sure what linguists do... by Saeger · · Score: 1
      Until there is really perfect AI software -- which I think is so unlikely as to preclude reasonable speculation for the purpose of this conversation -- reporters won't be replaced by software.

      Whatever helps you sleep at night I guess.

      AI and IA is an inevitability in our lifetime as long as ancient exponential trends continue on track.

      --

      --
      Power to the Peaceful
  49. CS Freshman Proposes Human-Aided MT Method by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically.

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project.

  50. I wonder, sir, by Anonymous Coward · · Score: 0

    are you a Negro? I get the distinct impression that you're an angry white Liberal Negro. Thoughts? Conjecture? You know, I always laugh about silly shit like this because I've read almost identical quotes during the Reagan administration, the [Theodore] Roosevelt and Wilson administrations. Even Taft and McKinley.

    1. Re:I wonder, sir, by Teh+Bungi · · Score: 0

      The fact that you use the word "Negro" so freely shows how un-enlightened you really are. Sir... go fuck yourself with a soldering iron.

  51. CS Freshman Proposes Human-Aided MT Approach by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically.

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project...

  52. 20 word summary by 20wordsummary · · Score: 0

    Software that can paraphrase statements in online articles is in preliminary stages of development, for applications not limited to translation.

  53. Typical /. story.. maybe they need the engine? by mattr · · Score: 4, Insightful
    Slashdot needs to implement another new editorial policy: if you have nothing intelligent or really funny/biting to say, don't! An interesting topic with a another half-assed presentation.

    Obviously this is a developing field. The best models seem to use phrases from the original text, anyway the Mac OSX example above shows that it is useful to users willing to take it with a massive grain of salt, even if we are not into full computational sentience yet.

    When it works even a little better it will replace all those awful grade school teachers who assign paraphrasing as a homework assignment. The reporters who might have been replaced by it will have already lost their jobs, except for the ones in AhaIndia of course who will paraphrase for the rest of us, usually at a marginally better level than the machine.

    The research is interesting - and I'd like to understand Barzilay's notation is that APL or calculus of statement? - in the paper (pdf) I found on google. Also see the papers on her site.

    Of course structured text is easier, and news stories are known to have most of the meat in the beginning, but this is great stuff.

    One interesting older system is ThoughtTreasure which was built to understand a story and answer questions about it. The author also did work on news analysis ("NewsForms") too. There are tools out there, I've been making a survey myself too. If anyone has information about practical NLP tools for real world tasks please post.

  54. Newsreaders != Reporters by Saeed+al-Sahaf · · Score: 0
    This technology, destined to eventually replace most reporters with very small shell scripts, is thankfully still in its infancy.

    I know you probably mean "newsreaders", the helmet-hair-headed idiots that are found in most newscasts. These are not reporters.

    --
    "Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck
  55. CS Freshman Proposes Human-Aided MT Approach by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically....

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project.

  56. It's unlikely to catch on... by ChunKing · · Score: 3, Insightful

    The main problem is that languages, especially English, are so idiomatic that mechanical translators will be a too much of a disadvantage - take the Babelfish translator for instance.

    Furthermore, the English language is so flexible that just about any word can arbitrarily substitute for anything else - for instance, take 'bad' meaning 'good'.

    It would be impossible to program a machine to be able to understand the full spectrum of idiomatic phrases but the future may lie in employing neural net technologies so that computers can do some limited learning.

    --
    cogito ergo sig...
    1. Re:It's unlikely to catch on... by anubi · · Score: 1
      This could be entertaining though...

      Imagine a report being fed in that all water served in a trendy restaurant required the approval of the chef.

      But the text rendered for the public view clearly states that "water served there had been personally passed by the chief cook.".

      I have heard all sorts of stories of foreign lands where concepts were almost accurately translated, with hilarious results. Maybe someone has a link to a few lists. It might be very interesting to revisit those days of humor.

      --
      "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

  57. This could be good by fdawg · · Score: 0, Offtopic

    This technology, destined to eventually replace most reporters with very small shell scripts, is thankfully still in its infancy.

    Considering how much bad news there is today, it would be nice to have a completely unbiased point of view. I dont trust CNN as they seem to be driven by politics behind the cameras as well as in front, dont trust MSNBC (or whatever its called) because you dont really know who is paying who to cover the story, and FOX is..well...FOX. Everyone has a slant, itd be nice to have something free of this. This is why I like slashdot; even if you have a slant, your story is treated like everyone elses.

    1. Re:This could be good by Anonymous Coward · · Score: 0

      And, of course, a computer-generated summarization of a group of facts collected by biased humans will magically be free of those biases because the bot that generates it is guaranteed to be free of the biases it's human creators held?
      Don't count on it, dawg. We're human. The things we do, the things we create, are going to be affected by our pre-conceived notions about things (whether those notions are about how what we create should look, or about how the people who receive them are going to perceive them and are calculated to generate a particular perception, or whatever) -- we can minimize them but we can't completely eliminate them (and probably shouldn't try to) without removing what makes them valuable in the first place. You want "unbiased" reporting? Go check it out for yourself! Think the number of people gassed in China was 10X the number China reported? Go count 'em! Think Land Rovers are just as prone to flipping-over as a Jeep? Buy one of each and drive like a maniac! Just don't be too surprised if your "unbiased" results conflict with someone else's "unbiased" results of the same "investigation"!
      Your best bet is to check out the reporting from a number of different sources and compare them for inconsistencies -- sooner or later (my bet is, if you're reasonably intelligent and you're reasonably diligent about not unduly applying your own biases to it, most likely sooner) you'll be able to figure out which sources to generally distrust and which to generally trust (and on which topics; Regis Philbin may be quite authoritative when it comes to entertainment gossip, but his accounting of tech information might not be up to the same standards...or maybe it is -- could that have been some of my bias rearing it's ugly head?!?!)

      Long story short: Unless you've got deep pockets and a lot of free time on your hands, you've got to trust somebody to "bring you the news"

    2. Re:This could be good by fdawg · · Score: 1

      Strong words for someone who posted AC. Its apparent you missed my point. Then again from your retort, I doubt you're capable of forming any determined conclusion about anything.

  58. Maybe none of are sure what Ashcroft does... by geekpuppySEA · · Score: 1
    (Rumor? Evidence picked up on the wind? Sounds like a job for the eyes and ears of the Justice Department.)

    No, no, I'm in agreement. The investigation part, insurmountably a human task. The writing, though? Probably somewhat automatable.

    Am I correct in assuming that many reporters would love it if their editors were replaced by robot masters, then?

    --
    Intelligent Design: because MATH is HARD.
    1. Re:Maybe none of are sure what Ashcroft does... by DavidinAla · · Score: 1

      Reporters would typically be happy if editors didn't exist, because they tend to believe their copy doesn't ever need ANY changes. ;-)

  59. Columbia News Blaster by Richard+Allen · · Score: 3, Informative

    I believe this was covered in a related Slashdot before regarding to this site: http://www1.cs.columbia.edu/nlp/newsblaster/

    Here is a quote from their site:
    Columbia Newsblaster is a system to automatically track the day's news. There are no human editors involved -- everything you see on the main page is generated automatically, drawing on the sources listed on the left side of the screen.

    Every night, the system crawls a series of Web sites, downloads articles, groups them together into "clusters" about the same topic, and summarizes each cluster. The end result is a Web page that gives you a sense of what the major stories of the day are, so you don't have to visit the pages of dozens of publications.

    Newsblaster is an academic project from the Natural Language Processing group at Columbia University's Department of Computer Science. It is designed to demonstrate the Group's technologies for multidocument summarization, clustering, and text categorization, among others. It is funded under DARPA TIDES and KDD and has been operational online since September 2001.

    Current and future enhancements include international perspectives, multilingual capability, and tracking events across days.

    1. Re:Columbia News Blaster by Richard+Allen · · Score: 1

      Yes, here is the link:

      http://slashdot.org/article.pl?sid=02/03/15/155524 2&mode=thread&tid=149

    2. Re:Columbia News Blaster by Frisky070802 · · Score: 1

      Yeah, I saw a demo of this, and was pretty impressed, though eventually I decided that paraphrasing wasn't nearly as interesting as simply identifying the big news... so I went back to Google News. Now I just read CNN, the New York Times (print), and Slashdot, and I figure that between the three, everything's covered.

      --
      Mencken had it right. So glad that's old news.
  60. CS Freshman Proposes Human-Aided MT Approach by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically.

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project.

    ###

  61. Re:bbc_news.sh by tealover · · Score: 1

    #!/bin/sh curl $1 > paraphrase > slant -liberal -mindless > bbc_news_story.txt

    --
    -- You see, there would be these conclusions that you could jump to
  62. CS Freshman Proposes Human-Aided MT Method by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically.

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project.

    ---

  63. CS Freshman Proposes Human-Aided MT Approach by Anonymous Coward · · Score: 0

    December 14, Z.Y. Yao, a Computer Science freshman from Fudan University in Shanghai, posted an article "Introduction to BabelCode Project" to his newly set-up project website www.babelcode.org, releasing the theoretical principles and technical specifications for his human-aided machine translation approach. The approach actually enables the human writer to directly author machine-translatable content and guarantees such content be converted to correct, natural and multilingual translation versions, automatically.

    See the English version of the Introduction at
    http://www.babelcode.org/doc/intro.htm

    or PDF:
    http://www.babelcode.org/doc/intro.pdf

    Considered the pioneer if its kind, the project has been receiving both praises and doubts in China's IT media. And Yao is currently developing three demo programs (an input module for English, an output module for Chinese and another output module for German) with his friend Zheng Shao from UIUC. They are calling for volunteer participation in this project.

    +++++

  64. Creativity by dacarr · · Score: 1

    Silliness aside about an Apple 2 being able to gather the news for us and feed it, the thing about wordsmithery is that there is a certain amount of creativity that needs to go into it. Otherwise you have the literary equivalent of the Backstreet Boys and such. Not a good mix.

    --
    This sig no verb.
  65. I'm a sexist, so what? by Lord+Kano · · Score: 2, Funny

    From the article:
    The researchers, Regina Barzilay, an assistant professor in the department of electrical engineering and computer science at the Massachusetts Institute of Technology, and Lillian Lee, an associate professor of computer science at Cornell University, said that while the program would not yield paraphrases as zany as those in the Monty Python sketch, it is fairly adept at rewording the flat cadences of news service prose.

    Two women came up with this! Why doesn't it surprise me in the least that women are officially researching ways to automate the process of saying the exact same thing in an infinite number of different ways?

    LK

    --
    "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  66. The reporter comment was a joke by Anonymous Coward · · Score: 0

    It was apparantly a reference to a t-shirt on thinkgeek (see the fp). I try to give the editors the benefit of the doubt before railing on them.

  67. THANK GOD by the_2nd_coming · · Score: 1

    now maybe we can get some real news that covers all sides...oh wait...there will still be Editors..DAMN

    --



    I am the Alpha and the Omega-3
  68. Hollywood by wampus · · Score: 1

    Now, correct me if I am wrong, but hasn't Hollywood beem using this system for some time now? If a movie isn't a direct rip of something that was made in the past, then it takes familiar characters and tosses them in a blender with a dash of CG effects and frappes until smooth.

    Television uses this system, too. The formula there seems to also involve borrowing a successful British TV show's concept, just to keep things a little fresher.

  69. an ambiguous language? by Thinkit3 · · Score: 1

    You're confusing the issue. No programming language is ambiguous. The English language is ambiguous. Using "integer" instead of "int" isn't what I'm talking about.

    --
    -Libertarian secular transhumanist
    1. Re:an ambiguous language? by Anonymous Coward · · Score: 0

      What about Java.

      System.out.println( "Age: " + 12 );

      vs

      System.out.println( 2 + 12 );

      In the first sentence, + means "concatenate", while in the second sententce, it means "add".

  70. you really are only cheating yourself by themusicgod1 · · Score: 1

    sure.

    ok...i have never cheated. ever.

    i'm now on academic probation and fear that this may be my last semester of university. sure, i could be smarter(my iq is only around 120), and i could go on provigil, ritalin or speed to sleep even less...but i've really allready been studying 12-16 hours every day...and the 65 average is just barely within my reach.

    why am i complaining here? because the program i am in is an _easy_ one... in my university you need a 85% average to just stay in the electrical engineer program... i could never do that and this university is actually one of the most lenient in the contry...

    in other words...when the differece is between a 4.9 gpa and having my marks...id suggest for anyone else to cheat, if they think that they can pull it off.

    its evolution baby. survival of the fittest. and i can tell you right now that i am not the fittest...and if you were to cheat, and me not...this gives you that much more of an edge that you could survive that much longer on... get good marks, succeed, at any cost short of not learning anything(after all, knowledge is power)

    --
    GENERATION 26: The first time you see this, copy it into your sig on any forum and add 1 to the generation.
    1. Re:you really are only cheating yourself by KrispyKringle · · Score: 1
      OK, so you're going to argue that there is no morality, that we all have free reign to do what we must to get ahead? I can't argue this position; if you don't feel that there is some imperative to do what is right--say, the greatest good for the greatest number--I certainly can't convince you otherwise. Or perhaps you don't feel that cheating harms anyone (and you may well be right; I acknowledged that most of those who cheated never took anything away from me, those conceivably they did take things away from some students who didn't cheat but also underachieved).

      But I doubt you truly feel this way, or else you yourself would cheat; it's not really that hard (especially in a major like EE where there really isn't enough variety in the answers to homework and tests that your distinct writing style stands out).

      I sympathize with your predicament (and I admit my grades sunk quite a bit from what I was used to when I went to college), but I stand by what I said. Cheating, of the teen-age sort we talk about most frequently, doesn't get you anywhere. However, expanding the argument, we might conclude that there are serious issues of cheating (a number of famous historians have been discovered to be plagiarists recently; a number of corporate executives have been discovered to be completely falsifying financial and performance related information recently; etc) that are largely irrelevant to technology-based fixes like automatic plagiarism detection.

      Anywho, good luck in school. And really, a 120 IQ doesn't sound all that bad (isn't 100 average?). If there's anything I can say, it's that your goals may not match those you are trying to achieve, anyway. I know that I wouldn't particularly enjoy my success if I simply cheated to get there. And money, as Pink Floyd sang, isn't all it's cracked up to be anyway.

  71. Markov word chains by austad · · Score: 1

    This reminds me of when I used to work for Marketwatch.com. I was bored one day, and wrote a program called ThomCalandra.pl. It used Markov word chains to generate new news stories based on Thom Calandra's previous articles and new news stories coming in. Just glancing over the story, it looked legit, but then it would say things like "Deutch Bank filed for bankruptcy" and things like that which were totally false. It was entertaining though, especially when you fed it a mixture of articles by Thom Calandra, and Smoove B or Herbert Cornfeld from the Onion. Then you'd get things like:

    "Baby, you are so beautiful, I'm going to make sweet love to you all night long, and the Euro is increasing in value against the dollar."

    or

    "The muthafucking DJIA has dropped another 10%, and those bitches from accountz recievable are responsible."

    --
    Need Free Juniper/NetScreen Support? JuniperForum
  72. US NEWS NETWORKS ALREADY DO THIS by Anonymous Coward · · Score: 0

    Here's the US news agency formula :

    1) Have reporter attend whitehouse briefing

    2) Take notes

    3) Regurgitate content to news anchor in live shot

    4) NEVER ask any questions that would challenge the message of the day

  73. Obligatory Simpsons Quote by disntrstd · · Score: 1

    Automated Smokey The Bear: "Only who can prevent forest fires?" Bart: *presses the "You" button* Automated Smokey The Bear: "You have pressed 'You,' referring to me. That is incorrect. The correct answer is 'You""

  74. Relief at last? by Tandoori+Haggis · · Score: 1

    Couldn't come sooner! I'm tired of those damn "don't I look wonderfull with my new outfit today" no-brain news presenters. BBC News 24 interviewing an astronomer ref Beagle 2 introduced him as an ASTROLOGER! For F***s Sakes!!! There we have it folks. Theres been a failure to communicate 'cause Beagle 2's birthsign coincides with Rus*el Gr*nts bad dose of piles! Bring on the automatic presenters and lets dispense with those vain overpaid faggots. " Theres been a quake in Irqustanoble, er does my bum look big in this?" Typical reports contain 1% substance 3% hype chasing spin and 96% filler - hmm what OS does that remind me of?

    --
    My hyperlinks aren't worth the paper they're printed on.
  75. Old news! by Alsee · · Score: 1

    Slashdot has been using this system to generate its articles for a while now. Obviously it's still loaded with bugs.

    -

    --
    - - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
  76. Obligatory Thinkgeek link by InsomniaCity · · Score: 0

    Go away before I, replace you with a very small shell script!

    http://www.thinkgeek.com/tshirts/frustrations/374d /

    --
    You cant make anything foolproof, they'll only invent better fools.
  77. Umm; by monomania · · Score: 1
    "...destined to eventually replace most reporters with very small shell scripts..."

    Shouldn't that read "very small shill scripts"?

  78. Interlingua, or Lojban? by tepples · · Score: 1

    Interlingua, being tightly defined

    By "Interlingua" do you actually mean the Interlingua of IALA, or do you refer to "Lojban", a more precise interlanguage defined specifically for machine comprehension?

    1. Re:Interlingua, or Lojban? by Steve+Franklin · · Score: 1

      Yes, I was specifically referring to IALA Interlingua. And no, Interlingua is not as tightly defined as Lojban (Loglan). But Lojban is fairly difficult to learn and is not instantly recognizable. Perhaps a good compromise would be the Interlingue (formerly Occidental) of the Interlingue-Union. This is similar to Interlingua but derives its vocabulary more precisely, as does Esperanto for that manner. The advantage of an interlanguage that is easily recognizable (at least to speakers of European languages--most Europeans, Australians, North and South Americans, many Indians, and a good part of Anglophone and Francophone Africans) is that once it occurs to everybody that all this international babble doesn't make much sense in the 3rd Millennium AD, the solution is obvious and right at hand.

      --
      Hic iacet Arthurus, rex quondam rexque futurus.
  79. This is news? by Anonymous Coward · · Score: 0

    This is news? We're supposed to get worked up about what two ding-bat women do, once they're finished playing leap-frog in graduate school? Where's the practical application? Do these idiots even have to provide such a justification before they get REAMS of funds to do their silly things - funds that often come out of TAXPAYERS's pockets?

    Who puts these stories on Slashdot anyhow? Are Malda and Hemos on holidays?

  80. ah the possibilities... by AngryShroom · · Score: 1

    Term paper obfuscation! Technology at its finest.

    --
    "The greatest tragedy in mankind's entire history may be the hijacking of morality by religion." - Arthur C. Clarke
    1. Re:ah the possibilities... by Anonymous Coward · · Score: 0

      I'll be happy when technology will stop people from posting exactly the same sort of crap that 100 people have already said on a Slashdot story....

    2. Re:ah the possibilities... by AngryShroom · · Score: 1

      And you read 100% of all posts before posting right? People who think they have some kind of ownership or special bond with slashdot really need to get a life. The system is designed to filter out the crap. If I post redundant crap so be it. Why are you reading score 1 posts anyway?

      --
      "The greatest tragedy in mankind's entire history may be the hijacking of morality by religion." - Arthur C. Clarke
  81. Wake me up when... by Megane · · Score: 2, Funny

    ...when we can replace upper level management with small shell scripts.

    --
    #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
  82. Not news... by Jayfar · · Score: 1

    Autonomy has been been offering similar products for 4 or 5 years now and, IIRC, they have a number of Fortune 500 companies among their customers. Their work is based on Baysian algorithms. They used to have a free desktop app incorporating some of this technology, but withdrew that a few years ago. I'd suspect they probably have patents out the wazoo on this stuff, which might come back to bite others using Baysian techniques.

    http://autonomy.com/Content/Press/FactSheet

    "Autonomy Corporation was founded in 1996 by Dr. Michael Lynch using a proprietary pattern matching technology that was the result of Cambridge University research on the probability theorems of an 18th century mathematician, Reverend Thomas Bayes."

  83. Word Summary by cornjones · · Score: 1

    Hey, I went and played w/ this feature of word. Here is the summary of the article. hmmm... maybe if we set up an auto summary more people would RTFS?

    Anyway, here it is:
    Now, computers can play along

    Computers can't do nearly that well at paraphrasing. Now, using several methods, including statistical techniques borrowed from gene analysis, two researchers have created a program that can automatically generate paraphrases of English sentences.
    The program gathers text from online news services on specific subjects, learns the characteristic patterns of sentences in these groupings and then uses those patterns to create new sentences that give equivalent information in different words.
    Then the computer sought clusters of sentences that had similar words or phrases.
    Testing for statistical evidence that expressions were paraphrases, the system compiled templates or patterns that formed the backbones of equivalent sentences. Barzilay said the system did well at paraphrasing short articles but bogged down as the articles grew longer and the text more idiosyncratic. When the researchers tested their paraphrasing rules by using articles on violence in the Middle East published after they had developed their system, the program was able to paraphrase 61 percent of the sentences in articles with 10 or fewer sentences. Fernando Pereira, chairman of the computer and information science department at the University of Pennsylvania, said the authors were wise to focus on news articles. This type of information seeds a database of paraphrases that can then be used to generate new sentences. The paraphrasing program might one day be useful in machine translation, said Kevin Knight, a senior research scientist at the Information Sciences Institute of the University of Southern California. Pereira said that the paraphrasing work had given him pause.

  84. less reporters is a good thing by Anonymous Coward · · Score: 0

    just let the vast majority that bias their article in one way or another be offshored to india also.

  85. Re:bbc_news.sh by Bombcar · · Score: 1

    #!/bin/sh curl $1 | paraphrase | mispel -slashdot | slant -page-hits -linux | troll --max-troll=50 > slashdot_story.txt

  86. *WHEW* - Still in infancy by ToadMan8 · · Score: 1

    Boy, I'm glad that computers don't have their (hands?) in reporting news; it'd be terrible to get rid of all that slant in the media this way and that. I mean who wants fair, equitable stories?! You read the NYT to ra ra for the Bleeding Heart shit, or if you're a heartless republican the Journal is for you. Now how would they sell if they just told the facts as they were and left interpretation up to the readers?!

    Well, at least Slashdot will always be biased, thank god for that.

    --
    I haven't posted in so long, my sig is out of date.
  87. Think of the uses for assignments... by electrichamster · · Score: 1

    As far as I can see, this system is best at rewording things, and the best place to use that ability would be in writing essays.

    Want to plagiarise something without getting caught? Feed it through the handy-dandy-rewordatron and you're away.
    Hell you could feed an entire essay through the damn thing.

    It's a scary thought, but one that I as a lazy student quite like the idea of ;)

  88. AS OPPOSED TO THE BBC? by Anonymous Coward · · Score: 0

    Or maybe we should all just settle for Al-Jezeera or those French faggots that write "cannon fodder" on their gay-wad photographer vests.

  89. prove it by Anonymous Coward · · Score: 0

    you conservative ultra-rights make me puke.

  90. very small shell script paraphasing by grikdog · · Score: 1

    I'll bet it's not as cute as Linda Vester, though.

    --
    ``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
  91. This is kent brockman reporting by loosewing · · Score: 0

    This is kent brockman, you may remember I used to report live on how the little guy lost his job to technology and how I appeared to care.

    Well today its my turn to lose my job to small scripts.

    This reporter DOES NOT welcome our new script overlords.

    [and now back to weather with wendy]

  92. I used the OS X Summarize function... by constantnormal · · Score: 1
    ... (in the Services menu) to summarize the referenced article:
    The program gathers text from online news services on specific subjects, learns the characteristic patterns of sentences in these groupings and then uses those patterns to create new sentences that give equivalent information in different words.

    The researchers, Regina Barzilay, an assistant professor in the department of electrical engineering and computer science at the Massachusetts Institute of Technology, and Lillian Lee, an associate professor of computer science at Cornell University, said that while the program would not yield paraphrases as zany as those in the Monty Python sketch, it is fairly adept at rewording the flat cadences of news service prose. Give it a sentence like "The surprise bombing injured 20 people, 5 of them seriously," Barzilay said, and it can match it to equivalent patterns in its databank and then produce a handful of paraphrases. For instance, it might come up with "Twenty people were wounded in the explosion, among them five in serious condition."


    The OS X feature appears to operate not so much by restating in different words, but by identifying redundancies and eliminating them. It works very well.

    It appears to me that it's editors that are in jeopardy, not reporters.

  93. Not ambiguous. by Thinkit3 · · Score: 1

    The type of the first argument will tell you exactly what the "+" will do. The English problem is best represented by "right". Get in the right lane. There is no way to know whether to get in the lane opposite the left, or choose the correct lane.

    --
    -Libertarian secular transhumanist
  94. It's globalisation (Re:Rethink English !) by sadiklis · · Score: 1
    Esperanto was interesting and look where it got. Nowhere. People will speak in what's easiest.

    Wrong.

    People learn those languages that make the biggest amount of valuable information available to them.

    English is my third language by acquisition while clearly the first one in terms of importance now.

    And i find it to be the weirdest of the three.

    For example, the spelling system is just silly. For example, why are there five ways to write "k" (click, kick, suck, schedule, iraq)?

    And while we are at it, why not use "f.e." instead of "e.g."?

    And why such an irrelevant six-billionth of the whole deserves to be honored by the capital leter ("I")?

    I kould go on but i'm a bit busy katching up with my skedule. And BTW i like the spelling reform that KDE krowd is doing ;-)

    Rest assured that the above sentence reads much weirder to me than the "proper" english spelling but that's because i've got used to it during the years of usage. I remember my first English (BTW isn't it about time to relabel it "Earthish"?) classes... i was outraged when introduced to the weirdness of that spelling nonsence and it took years to cool down and get used to it.

    In case you are wondering, my languages are (by sequence i've learned them): 1. Lithuanian (so called "native" one (i'd just label it "local")), 2. Russian (i used to be in Soviet Empires' claws), 3. English (computers, internet, huge amount of books being published, etc... plenty of reasons but certainly not the quality of the language).

    And the bottom line is: I'm ready to "waste" yet another huge amount of time learning one more language artificial this time if i see the long term benefits ("long term" stands for - centuries and millenias not just years) of an introduction of such a language globally. But for now i'd just like to see the whole EU switching to English as a primary means of communication (i find talks about "EU-wide information society" to be silly without an idea of the common communications protocol (language) being promoted as well. f.e. - how do you imagine a US-like-dynamic EU-wide flow of (skilled) workforce if there is no common language being used in workplace?)

    1. Re:It's globalisation (Re:Rethink English !) by TwistedSquare · · Score: 1
      For example, the spelling system is just silly. For example, why are there five ways to write "k" (click, kick, suck, schedule, iraq)?

      And while we are at it, why not use "f.e." instead of "e.g."?

      And why such an irrelevant six-billionth of the whole deserves to be honored by the capital leter ("I")?

      I imagine almost every language has redundancy in it's alphabet. In French for example, ce and se would be pronounced the same. Over time this will probably be reduced, as I said, iterative refinement is the process used inherently in languages over the generations.

      I think a little bit of history is not out of place in lanuages. English was strongly affected by Latin (the alphabet for a start!) so why not keep a couple of Latin abbreviations in. i.e. and e.g. are not hard to remember once you learn them and given their (incredibly) common use.

      Finally, I (Je/Ich/whatever) is pretty much the most important concept in a lanuage as it is one of the most frequently used words. So I don't see having it capitalised as being that odd...

  95. English quirks by sadiklis · · Score: 1
    I imagine almost every language has redundancy in it's alphabet.

    Some silly redundancy? Yes. But the amount of it in English is just crazy in comparison to the other two languages i know (lt and ru).

    I think a little bit of history is not out of place in lanuages. English was strongly affected by Latin (the alphabet for a start!) so why not keep a couple of Latin abbreviations in. i.e. and e.g. are not hard to remember once you learn them and given their (incredibly) common use.

    I think it's bad when history stands in a way of logic and convenience. I just wanted to point out that wasting time learning some extra quirk of a language is a negative rather than positive thing. If i want to abbreviate "for example" i'm incilned to write it as "f.e." and that's it. One more example of such an annoyance is a.m./p.m.

    Finally, I (Je/Ich/whatever) is pretty much the most important concept in a lanuage as it is one of the most frequently used words. So I don't see having it capitalised as being that odd...

    Just for a record: neither lt nor ru capitalize "I".

    Anyway this "I" and "e.g." thing is pretty irrelevant in contrast to the spelling madness.

  96. But I doubt you truly feel this way by themusicgod1 · · Score: 1

    "there is no morality,"
    or perhaps there are other things in the world than a human idea by that very name...and that it may not be the most important or at all important. i don't think i even have to go that far, however. wouldn't a sheer "i will not impose my [elitist] morality on you, feel free to cheat if you think you can get away with it" work?
    "free reign to do what we must to get ahead?"
    i think the idea of freedom is a hack and fraud, but this may be besides the point.
    "Or perhaps you don't feel that cheating harms anyone"
    lots of things carry risk, this one in specific carries risks both to the person involved, the community(as the cheater may not know needed information, and pseudoinnocent bystanders(students like me)... but i expect a level of risk:productivity assessment from everyone, or at least encourage it.

    "said. Cheating, of the teen-age sort we talk about most frequently, doesn't get you anywhere. " i think 0.5% made the difference between 20,000$ worth of scholarships and nothing in scholarships for a student 2 years ago...(2 students had high 90s averages. one hit the jackpot multiple times, the other... is treated like a normal everyday student) i actually liked him much better, was less involved in football than in math and Computers...but the both of them were doing so many things on so many fronts, getting extreemly high marks, being way over involved, volunteering way too much, etc...its too bad there had to be a loser in that case. i at least hope the second guy got into university(with his marks i can see it).

    (isn't 100 average?) yes

    while money isn't all its cracked up to be, starvation/poverty sucks.
    Anywho, good luck in school thanks. semester starts on the 5th... i have a lot of ground to cover, although i'm starting to get that neo-at-the-end-of-the-first-matrix-flick-feeling dealing with this pre-calc math that ive kept myself busy with.

    --
    GENERATION 26: The first time you see this, copy it into your sig on any forum and add 1 to the generation.