Slashdot Mirror


An App to Boil Down Online User Reviews

An anonymous reader writes "Is this a glimpse at the future of the Semantic Web? A new startup named Pluribo has developed a technology that can auto-summarize user reviews on the internet. It is a Firefox extension that can take a webpage filled with reviews and condense it down into a couple of sentences. Currently, it just works with Amazon electronics, but the potential seems incredible. Ars Technica took an in-depth look."

82 comments

  1. Quick. by Slashdot+Suxxors · · Score: 5, Funny

    Somebody fix this so it work's on /. Maybe then I'll RTFA.

    1. Re:Quick. by Anonymous Coward · · Score: 0

      A new startup named Pluribo has developed a technology that can auto-summarize user reviews on the internet.

      Awesome! I could really use this because my attention span is so sh....I don't feel like finishing this comment anymore. Bye.

  2. First Post by Harmonious+Botch · · Score: 5, Funny

    I for one welcome our hot grits pouring overlords.

    Imagine a Beowulf cluster of Al Gores in Soviet Russia releasing Duke Nukem Forever, you insensitive clod!

    In Korea, only old Natalie Portman must be new here.

    All your Linux are belong to us Sharks with frickin' laser beams attached to our heads.

    1)Stephen King is dead

    2)BSD is dying

    3)Profit!

    There. Fixed that for you.

    1. Re:First Post by svank · · Score: 1

      1)Stephen King is dead

      2)BSD is dying

      3)???

      4)Profit!

      There. Fixed that for you.

    2. Re:First Post by echolock · · Score: 1

      Imagine a Beowulf cluster of Al Gores in Soviet Russia releasing Duke Nukem Forever, you insensitive clod!

      Yes, but does it run Linux?

      (Sorry) :)

  3. Here, let me test by Anonymous Coward · · Score: 4, Funny

    Enter> The Wicker Man, 2006

    Result: "Sucks monkey balls"

    Hey, it really does work!

    1. Re:Here, let me test by xstonedogx · · Score: 1

      Enter> Anything with Nicolas Cage

      Result: "Sucks monkey balls"

      Hey, works on more general queries too!

  4. How about a comment synopsis generator by Falstius · · Score: 1

    Can we apply this to Slashdot comments? Please? Please? Pretty Please??
    I'd love to get a one page summary of all the informative, insightful and interesting comments.

    1. Re:How about a comment synopsis generator by Harmonious+Botch · · Score: 1

      I tried that. See 'first post' above. It got modded flamebait.

    2. Re:How about a comment synopsis generator by Rob+Kaper · · Score: 4, Funny

      I'd love to get a one page summary of all the informative, insightful and interesting comments.

      [url=http://slashdot.org/~Rob+Kaper]Here you go[/url].

    3. Re:How about a comment synopsis generator by Rob+Kaper · · Score: 4, Funny

      Joke's on me, this time, I guess. *sigh*

    4. Re:How about a comment synopsis generator by Anonymous Coward · · Score: 0

      Okay, here goes:

      Microsoft sucks! Linux^HGNU/Linux is so awesome, I want Linus Torvalds and Richard Stallman to have my babies. OSS ^H FOSS ^H FLOSS ^H OMGWTFROTFLMAOSS is always better than the proprietary alternative, regardless of reality. Why don't you RTFA? In Soviet Russia, article reads you! The Republicans suck. The Democrats suck. McCain sucks. Obama sucks. The RIAA sucks. The MPAA sucks. You suck. Your opinions, ideas, and family are all horrible, stupid, and idiotic because I disagree with you, and obviously my opinion is better because I regard myself as an intellectual, well-read person, even though I never even read the article in the first place and am just spewing bull out of my ass. Why don't you RTFA? In Soviet Russia, article reads you! Imagine a Beowulf cluster of Soviet Russias! In Soviet Russia, Soviet Russias Cluster Beowulfs! Hey, look, another Twitter sockpuppet account! Ruby is better than Java because you use Java (see a few sentences above for my reasoning). But can Rails run Crysis? In Soviet Russia, Crysis runs on YOU! 1. Run Crysis 2. ??? 3. Profit! Why don't you RTFA? ..........

      fr1st p0st

    5. Re:How about a comment synopsis generator by SeaFox · · Score: 2, Funny

      You could just browse at 4 and apply an extra -2 modifier to all Funny comments.

    6. Re:How about a comment synopsis generator by xant · · Score: 4, Informative

      Man, you guys really don't know about Alterslash yet?

      --
      It's rare that you're presented with a knob whose only two positions are Make History and Flee Your Glorious Destiny.
    7. Re:How about a comment synopsis generator by crenshawsgc · · Score: 1

      That's what I do. Unfortunately, that means "funny" posts not even "funny" enough to get upmodded once, end up cluttering my views.

    8. Re:How about a comment synopsis generator by Dishevel · · Score: 1

      Try not reading at -1? That might help.

      --
      Why is it so hard to only have politicians for a few years, then have them go away?
    9. Re:How about a comment synopsis generator by Anonymous Coward · · Score: 0

      Cool site, but an unfortunate name. I have grisly visions of nerd fetuses... or maybe that's just the recent Ctrl-Alt-Del dross...?

    10. Re:How about a comment synopsis generator by Anonymous Coward · · Score: 0

      this has been done by http://www.reviewgist.com/

  5. One problem by Rob+Kaper · · Score: 4, Informative

    The application seems to assume that the best summary is the one with the most correlation to the other posts, in other words: the most common viewpoint. While that may work fine for user reviews, in most cases the viewpoint of the masses is usually not the best.

    1. Re:One problem by spoco2 · · Score: 1

      But seeing as though it is specifically targeted at user reviews, and currently just Amazon Electronic ones, it would seem to be a great way to summarize the lot.

      I mean, what do you do when you're looking at user reviews of products? Look for what the common threads are, or focus on a few outliers?

    2. Re:One problem by Vectronic · · Score: 4, Insightful

      ...or focus on a few outliers?

      Yes, probably more than half the time actually, because 90% of the reviews are about as accurate as Slashdot comments, or as accurate as that percentage I just made up.

      When it comes to hardware, I may work a bit better, however with the diversity of hardware that exists, that one comment that you missed saying "but don't EVER purchase this and install it if you have an [Insert Product]!" might be exactly what you need to know and the rest is just fluff. The same goes for a lot of software, and you might miss out on that "great find" by that one guy that said "its ok, but it's less efficient/configurable/easy and more glitchy/resource hog than [Product X]"

    3. Re:One problem by billcopc · · Score: 4, Insightful

      Precisely: I like reading about the edge cases, because Joe Random is a freaking moron when it comes to electronics.

      The whole "me too" effect is a huge part of today's marketing. That's why everyone and their mother wants a freaking iPod/iPhone.

      --
      -Billco, Fnarg.com
    4. Re:One problem by aztracker1 · · Score: 1

      That's why I tend to actually read the comments on newegg. Some of the lower feedbacks are usually from ranting idiots, and bring things down.. this can really be the case when there are only a handful of reviews, and one asshat didn't know that you have to pay for XM as a subscription, beyond buying the receiver... (actually, my favorite sidebar app, the lower ranked XM one was ranked down because people didn't realize it only worked for people with xm subscriptions/accounts)...

      Actually, I really just wish more sites had meta-modding for reviews.. that would kick ass.

      --
      Michael J. Ryan - tracker1.info
    5. Re:One problem by linzeal · · Score: 3, Interesting

      That is why geology geeks use different phones that may not be all fancy and la dee da but they can be dropped from moving vehicles, be submerged in water/beer and still work to call into town for more supplies/beer. I do not understand the fascination with cell phone internet access. The majority of use I see for it is people in bars looking up statistics to win arguments or endless texting and picture sending from giggling girls. I have a GPS navigation system in the car as well as a laptop for full blown internet access when I need it. 95% of the time I have no desire or reason to be online when I am away from home as I have 10's of thousands of textbooks and reference books that I can use on my ebook reader. Thank you cheap SD cards.

    6. Re:One problem by Phurge · · Score: 1

      so you have a gps, a laptop, an ebook reader and a phone. Do they all fit into your pocket at the same time? (and still have room for a camera & mp3 player)

      --
      I'll see your hokum and raise you a boondoggle.
    7. Re:One problem by linzeal · · Score: 1

      No they fit in my pack though. I usually backpack with a PV charger, a phone and an ebook reader. About 2k books including herbals, astronomy and trail guides. If I am going in the city I go with a phone, an ebook reader and a moleskin notebook. I have never found the ability to use the internet outside the home, school or job useful in my studies, job or life. People are far too connected and produce too little original work.

    8. Re:One problem by Anonymous Coward · · Score: 0

      "in most cases the viewpoint of the masses is usually not the best."
      Yes. Look at Digg.com, for instance.

  6. I approved it. by jez9999 · · Score: 3, Informative

    I approvied the Pluribo extension for being pushed to public. All I can say is that it gets most of its data from the Pluribo server, and does very little client-cide besides display the data. At the moment it is *extremely* limited, literally to a handful of products. Apparently their system doesn't do the scanning of comments automatically otherwise it would work for everything, whereas it actually has to query the Pluribo server to get results.

    1. Re:I approved it. by drinkypoo · · Score: 2, Insightful

      Well, you're not going to download a thousand product reviews in a couple of seconds, let alone summarize them. On the other hand, I would love to have this kind of functionality available on my own website, perhaps through a drupal module - a comment summary block would be dandy. Heck, you could even have summaries by tag - just summarize all the content tagged with "sco" for example, and find out that they are litigious bastards and that their unix sucked. I wonder if anyone is working on a Free/Open engine for this kind of thing.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    2. Re:I approved it. by jorgevillalobos · · Score: 1

      Pft... wake me up when it can play free music.

      I have no shame.

  7. Okay, now here's a request: by Penguinisto · · Score: 4, Interesting

    ...is there any way to have it filter out the obvious astroturfers and trolls?

    Seriously, any big-name product or service will have a coterie of fanboys (or paid astroturfers) who will praise something no matter what, and a flock of trolls who will point out everything wrong with it, no matter what.

    ...now how do you filter those out?

    Do that, and it'd be one hell of an advancement in filtering. :)

    /P

    --
    Quo usque tandem abutere, Nimbus, patientia nostra?
    1. Re:Okay, now here's a request: by ken4516 · · Score: 1

      Not sure if it takes the rating of the review into account when filtering, but what you are talking about may be possible through this. If few people find the review helpful then it is weighted less or thrown away. Finding astroturfers would be harder since they might now be reflected as such in the review ratings.

    2. Re:Okay, now here's a request: by Dachannien · · Score: 4, Interesting

      I was going to post something similar. I was apartment shopping earlier this year, and the amount of astroturfing and astrotrolling* was incredible.

      Any filter that decreases the amount of information that I can use to evaluate the "truthiness" of a review is a bad thing. What's more, if filters like this catch on, people will be selling FEO (filter engine optimization) services to game the filters with their astroturf, and then the reviews will become completely useless.

      * In case I just made up a word, what I mean by "astrotrolling" is people who post shit about a product to get people not to buy it because they have a separate axe to grind against the seller. In the case of apartments, it's often poor tenants who tore up the apartment/broke the lease/got evicted and still amazingly expected their security back.

    3. Re:Okay, now here's a request: by fuzzlost · · Score: 3, Insightful

      Hello Farenheit 451. At least, that's what it sounds like to me. I don't know the exact quote, but it goes along the lines of; "There were so many novels, soon people just began reading the Reader's Digest version, just a snippet of all the classics. Soon that became too much, and so all that was left was flashy magazine articles." If someone can find the exact quote, that would be great.

      That is what this article seems to me. It seems that people don't want to invest the time to actually learn or research something, and so they'd prefer the condensed version, the sound bytes, if you will.

    4. Re:Okay, now here's a request: by Penguinisto · · Score: 1

      Err, what does online reviews filtering have to do with reading a book?

      Hell, I pretty much do this unconsciously anyway when evaluating something, for things that I am familiar with (e.g. computer parts, electronics, etc).

      Also - last I checked, Powell's Books hasn't gone out of business or become a glorified magazine stand. Folks on the train haven't stopped reading books and novels (of an amazing variety judging by last night's commute home).

      It's a question of priorities - I have zero problems with curling up and reading a nice long novel - I do hate having to weed through a pile of user-generated reviews built with various stages of literacy and agendae.

      /P

      --
      Quo usque tandem abutere, Nimbus, patientia nostra?
  8. A reflection of the times. by plasmacutter · · Score: 5, Insightful

    This is a serious reflection of our current times, where people's eyes gloss over if the concept at hand is not condensed into a convenient sound-byte.

    I suppose you could call it the bleeding edge where complacency meets the loss of freedom and the fall of darkness where critical thought once stood.

    Now there is enough probable demand to launch a startup designed to remove what minimal labor people are interested in dedicating to the quality of even their leisure time.

    I'm sure many fantasize about strangling people this lazy/complacent, but honestly if they're unconscious enough not to care about their own toys, do they really possess a "life" for you to take from them?

    --
    VLC FOR MAC IS DYING! IF YOU DEVELOP, PLEASE SAVE IT!!
  9. I'm to lazy to read TFA by mrroot · · Score: 4, Funny

    can someone boil it down to a couple sentences for me?

    --
    I Heart Sorting Networks
    1. Re:I'm to lazy to read TFA by rm999 · · Score: 1

      Good point, I'm surprised Slashdot doesn't do that for us. Here ya go:

      "A new startup named Pluribo has developed a technology that can auto-summarize user reviews on the internet. It is a Firefox extension that can take a webpage filled with reviews and condense it down into a couple of sentences."

  10. It's not summarization. by melted · · Score: 3, Interesting

    It's heavily templatized generation of language based on the automatically extracted sentiment data. The important difference here is that the language of the summary does not include phrases from the original user reviews. While this is a new twist on the old problem, automatic extraction of evaluation criteria and sentiment analysis in product reviews are not new. Heck, even Microsoft has a working system for that (electronics only):

    http://search.live.com/products/?q=nuvi%20350%20GPS%20-%20Asian%20American%20(City%2FVehicle%2C%203.5%22%20LCD)&p1=%5BCommerceService+scenario%3D%22reviews%22+docid%3D%222BECBBF6F17C98618C2E%22+p%3D%2220df8fe62a9b4e9490993ff7b91032af%22%5D&wf=Commerce&FORM=ENCA

    See the bars on the left, and be sure to click through to the individual sentences. It's spooky how accurate that thing seems to be.

    The problem with all these systems is that they're heavily domain dependent. You will use different language to write a review of a book than for kitchen appliance. In fact, you may even use different language from different kinds of books or different kinds of kitchen appliances. Worse yet, some things are notoriously difficult to accurately measure sentiment on. Once innuendo and sarcasm become frequent, all hope is lost - you need strong AI to figure that out.

    This is not to say these systems are useless - to the contrary, they are very useful in their respective domains. This is just to say that the only new thing I see here is the generated blurb.

    1. Re:It's not summarization. by rm999 · · Score: 2, Funny

      I think that Microsoft live product already has strong AI; it knows how to brown-nose so Microsoft doesn't kill it:

      "Windows? Millennium Edition, or Windows Me, is the home operating system for PC?s that brings the richness and convenience of the digital world to your home. Windows Me is designed specifically for the home PC user. It represents the first major milestone towards advancing the vision of the Windows Division and further simplifying the computing experience for consumers. Windows Me delivers in 4 key areas. It?s one of the best in digital media, it improves user experience, enhances home networking, and delivers a rich internet experience. Advanced help functions make it the most trouble-free operating system for the home. System Restore lets users easily return their systems to a working state, and System File Protection safeguards key system files. Help and support resources are easily accessible from a single location, and AutoUpdate lets users easily schedule automatic updates from the Windows Update site. For a high-powered operating system for your home computer, go with Microsoft?s Windows Millennium Edition."

    2. Re:It's not summarization. by Pollardito · · Score: 1

      Worse yet, some things are notoriously difficult to accurately measure sentiment on. Once innuendo and sarcasm become frequent, all hope is lost - you need strong AI to figure that out.

      that's why Amazon is easy, you have the stars rating to go by. i think they're just marrying the fact that you mention a particular aspect of the product to your overall rating of the product, and in the end they can make a nice graph saying what the average rating is for a review that mentions a particular aspect.

    3. Re:It's not summarization. by Anonymous Coward · · Score: 0

      Actually we evaluate every sentence in the review for sentiment and for features. When you look at the user interface the first 'feature' is all reviews, and uses the review rating to sort the reviews. If you click through to a specific feature then the sentences are grouped by sentiment.
      The GP is quite right in saying that complex language is very hard to interpret. Luckily most reviewers are not Shakespeare. It is very interesting to see the variation between very concrete domains ( like electronics ) and much softer ones ( like books and movies ).

  11. Mac OS X Summary Service by Penguin+Follower · · Score: 3, Informative

    Built into Mac OS X is a Summarize Service (click on the application's menu and point to "Services" and choose Summarize) that is pretty cool. Unfortunately, Firefox was never coded to work with that OS X native service, so I had to copy and paste the text of that Ars Technica article into Text Edit and then use that service. But here is the resulting summary (I had to do as short a summary as possible):

    While summarizing other types of non-review content isn't on the company's roadmap for now, Chakrabarti said the company is listening to user feedback. He also admitted that, "in many ways, summarizing facts is actually a lot easier than summarizing opinions." Pluribo's current focus on user-generated reviews and opinions is certainly a useful one, but I can see all kinds of other consumer applications for things like long Wikipedia entries and even articles like the one you're reading.

  12. Err, which times? by Penguinisto · · Score: 1

    Dude, even back in history the sound bite, even if it makes no sense nowadays ("Tippecanoe and Tyler Too!") has been prevalent.

    Also, "our current times" include things that simply didn't exist before: near-universal literacy, 24/7/365 media, and a desire to do something with one's time off other than simply sit around in a half-drunken stupor somewhere while catching up from the work-week's exhaustion. The latter part was pretty much what the vast majority of humanity did with what little leisure time they got.

    /P

    --
    Quo usque tandem abutere, Nimbus, patientia nostra?
    1. Re:Err, which times? by plasmacutter · · Score: 1

      Also, "our current times" include things that simply didn't exist before: near-universal literacy, 24/7/365 media, and a desire to do something with one's time off other than simply sit around in a half-drunken stupor somewhere while catching up from the work-week's exhaustion. The latter part was pretty much what the vast majority of humanity did with what little leisure time they got.

      /P

      ah, so the world sucked worse then for joe everyman (so we should just bend over and take it is that right)?

      --
      VLC FOR MAC IS DYING! IF YOU DEVELOP, PLEASE SAVE IT!!
    2. Re:Err, which times? by Penguinisto · · Score: 1

      Who says you have to? It can be a switch, an option.

      --
      Quo usque tandem abutere, Nimbus, patientia nostra?
  13. Ugh. The opposite of what I want. by Anonymous Coward · · Score: 5, Insightful
    Ugh.

    This sounds like the opposite of how to get useful information out of reviews, and more like the "consumer products" equivalent to the automatic resume scanner.

    You know the resume scanners I'm talking about -- the ones that circular-file the candidate who took three years off work to get his Ph.D. in cognitive science (and whose thesis is a perfect fit for your business plan), preferring, instead, the guy who listed "20 years PROLOG, PL/1, BASIC, C, 10 years C++, 5 years Java, MCSE, A+", because obviously the second guy triggers more buzzwords. Because the HR drone won't understand any of the resumes, he/she just picks whichever one the scanner selects, and that's typically the one with the fewest career gaps and the most buzzwords. ("But that other Ph.D guy only has one or two languages, this guy has six! And that Ph.D guy's been out of work for three years, so obviously nobody would hire him!")

    Ten reviews reading "Works. Fast, cheap, lightweight" and three reviews reading "Doesn't work" don't tell me anything, other than that the product might have reliability issues.

    One review reading "Didn't work the first time. The manual doesn't mention that you have to make sure the jumper is in the correct position first, and then it works. I own an XYZ-123 and this new product was at least as fast, but at about half the price. Weighs about a pound." tells me everything I need to know -- that the three people who claimed it didn't work almost certainly didn't know how to configure it correctly, and that the first seven reviewers never had a problem because they weren't part of the edge case.

  14. 24/7/365?? by pbhj · · Score: 1

    Do you mean 24/7/52? Or perhaps 60/24/365? But 24/7/365 seems a bit redundant?!?

    I bet I made a mistake in this post, tis the law of slashdot!

    1. Re:24/7/365?? by Omestes · · Score: 0, Redundant

      86400/1440/24/7/365.2425/52?

      --
      A patriot must always be ready to defend his country against his government. -edward abbey
    2. Re:24/7/365?? by Anonymous Coward · · Score: 0

      Wow... A mod that fits...

  15. This is not the Semantic Web by redfood · · Score: 2, Informative

    The idea behind the Semantic Web is that content providers tag content with semantic information to allow the creation of an ontology so that programs can easily use the for reasoning. (see http://en.wikipedia.org/wiki/Semantic_Web) The is an example of text summarization which is a classic natural language processing (NLP) task.

    1. Re:This is not the Semantic Web by Anonymous Coward · · Score: 1, Insightful

      The idea behind the Semantic Web is that content providers tag content with semantic information to allow the creation of an ontology so that programs can easily use the for reasoning.

      The idea behind the real world is that content providers will put bullshit in their tags, so that your hopelessly naive tools will instead create a fauxtlogy so that programs can easily duped into misleading customers to purchase worthless products.

      The is an example of text summarization which is a classic natural language processing (NLP) task.

      The Semantic Web is an example of great ideas for doctoral theses that invariably end up being corrupted by greedy idiots, which is a classic marketing task.

    2. Re:This is not the Semantic Web by alefq · · Score: 1

      If you mean "tag" as the kind of a "post-it", like the tag for a picture or a file, then you are only talking about a little part of semantic web. Actually, semantic web is more complex than just tags joined to "build an ontology". It is based on several ontologies combined, and most of all, the use of a language that allows me to stablish semantic relations, which will allow me to do semantic queries like "what are other books this authos has written and are related to other investigations I'm doing now". If you want to give a try to a Semantic Desktop, the Nepomuk project http://nepomuk.semanticdesktop.org/xwiki/bin/view/Main1/ (IBM, HP, SAP, Mandriva among others, are participating) is building a very interesting solutions, PSWE is the desktop application (Eclipse RCP based) and have Nightly builds http://dev.nepomuk.semanticdesktop.org/download/, KDE is also working on having a semantic desktop, with colaboration from Nepomuk. One of the authors of Nepomuk, Leo Sauermann, did Gnowsis http://www.gnowsis.org/ from where some ideas were taken.

  16. Keywords in Context by The+Raven · · Score: 2, Interesting

    I'm concerned how well the applet properly discerns the meaning of words in context. For example, just because I mention that a product is 'a portable laptop' does not mean I am impressed with it's size or weight... it's just the category the product falls in. But judging from the screenshots in the article, this exact error was committed by the plugin.

    Reading natural language is hard, and I'm of the opinion that a Firefox plugin just won't cut it for understanding the nuanced opinions given by reviewers.

    --
    "I will trust Google to 'do no evil' until the founders no longer run it." Hello Alphabet.
    1. Re:Keywords in Context by plasmacutter · · Score: 1

      Reading natural language is hard, and I'm of the opinion that a Firefox plugin just won't cut it for understanding the nuanced opinions given by reviewers.

      now toss it some ebonics or wow-ese and watch it go nuts..

      --
      VLC FOR MAC IS DYING! IF YOU DEVELOP, PLEASE SAVE IT!!
    2. Re:Keywords in Context by biggahed · · Score: 1

      As I'm working on a really similar app for my college final project i got really interested in this plug in and grabbed it instantly to see what magic was behind this.

      fetchSummary: function(doc) { // Extract the Plid

                      var plid = pluribo.extractPlid(doc); // Launch XHR to get Summary

                      if (plid) {

                              var req = new XMLHttpRequest();

                              req.open('GET', 'http://'+pluribo_server+'/xhr/topic/widget/'+plid+'/', true);

                              req.onreadystatechange = function (aEvt) {

                                      if (req.readyState == 4) { // Render the Pluribo Panel (if successful)

                                              if (req.status == 200) { pluribo.renderPanel(doc, req.responseText); } // Otherwise Fail Silently

                                              else { /* alert("Pluribo: error loading summary\n"); */ }

                                      }

                              };

                              req.send(null);

                      }

      But as of the above, theres not much to look at as all the magic happens server side.
      So you're right... A simple fx plug in wont do it.
      From their page:
      "Languages and software

      Nearly all of our code is written in Python, a wonderful and elegant language. We use Boto to access AWS from python. Our NLP algorithms get help from NLTK, WordNet, and SciPy. Most of our servers run Ubuntu Linux. Our API is built with Django and Apache Lucene."

      Which makes me happy as I'm also using most of this stuff, and so now I know I'm on the right path.
      Now if only I could get these guys to share some love...

  17. It might do that, even with Slashdot. by myCopyWrong · · Score: 1

    Astroturf is repetitive, so it will get listed in one place. You won't be able to eliminate the talking points but you won't have to read them 50 times to find one genuine opinion. Both will be weighted as original content. You will then have to use your brain as you do now. At the same time, you can eliminate distractions like "hot grits". It took me a while to realize this but now I see that it is true.

  18. I'm too lazy to read the summary by Anonymous Coward · · Score: 0

    Could you boil it down to a word?

    1. Re:I'm too lazy to read the summary by dfm3 · · Score: 3, Funny

      Could you boil it down to a word?

      Forty-two

    2. Re:I'm too lazy to read the summary by DamonHD · · Score: 1

      !

      --
      http://m.earth.org.uk/
  19. Spam reviews... by dten · · Score: 1

    Unfortunately, the product will never be able to filter out useless spam reviews that artificially inflate product ratings, such as "OMFG THIS IS THE BEST!!!!" (entire review) (especially for products that aren't even released yet) or paid site-fillers like "this is good I'm so glad [this site] made it available".

    I expect most media product reviews will summarize to something like "This is the best ever, if that's your kind of thing."

    1. Re:Spam reviews... by Klaus_1250 · · Score: 1

      If they would build in a SPAM-filter, to filter them out, it would even be more useful. Personally, those spam reviews irritate the hell out of me.

      --
      It only takes one man to change the Wisdom of the Crowd to Tyranny of the Masses.
  20. Comment removed by account_deleted · · Score: 3, Insightful

    Comment removed based on user account deletion

  21. Most of us have already figured this out... by funkify · · Score: 1

    Most of us have already figured this out, but apparently some still haven't...

    The way to properly get information from online reviews is to sort from lowest rating to highest, skip the trolls, and read until you get to the fanbois.

    1. Re:Most of us have already figured this out... by dbcad7 · · Score: 1

      Sometimes the bad reviews are helpful.. For example, I put together a system on the cheap for my brother. I used solid and affordable components... now when I bought the case, it should have been just fine for the other components, and the majority of reviews were great, and it was dirt cheap.. but there were some bad reviews where people said the power supply was bad, or wouldn't boot... long story short (too late) I also ended up being one of the lucky people where the power supply was crap... the minute it didn't boot, I immediately ordered a decent power supply.. and everything was fine.. If I had only relied on the middle or best reviews, I would have been wondering if it might be something else like the motherboard.

      --
      waiting for ad.doubleclick.net
  22. MacOS X: How to Summarize the Contents of Document by mattkime · · Score: 3, Interesting

    This document explains how to use the Summarize services available in Mac OS X applications.

    If you have a long document, you can use the Summarize service to get a summary of the contents. For example, use this to get a short version of a long page on a Web site.

    To get a summary of a document, select the text and choose Services from the application's menu, then choose Summarize.

    If the application you are using doesn't support services, copy the text to a TextEdit document to get a summary.

    Note: The information is this document comes from Mac OS X Help, the help system included with your computer. It is based on Mac OS X 10.1.2. If a different version is installed on your computer, choose Mac Help from the Help menu. Updated and expanded information may also be available in other Knowledge Base documents.

    http://docs.info.apple.com/article.html?artnum=61336

    (but i know this feature was in OS 9 or earlier)

    --
    Know what I like about atheists? I've yet to meet one that believes God is on their side.
  23. love the idea of this... by plasticquart · · Score: 1

    Would be very nice if it could process the reviews on Newegg. Some items have tons of reviews.

  24. Re:Ugh. The opposite of what I want. by WhatAmIDoingHere · · Score: 1

    So, use this plugin to get your choices from 50 items down to 5-10, and actually read the reviews for those 5-10 items. The best of both worlds.

    --
    Not a Twitter sockpuppet... but I wish I was.
  25. Soon to be banned by Snaller · · Score: 1

    By people who think it's slimy for a program to manually scan a lot of pages you aren't going to read anyway.

    --
    If Google really cared they would fix Android Chrome to reflow text, instead of discriminating
  26. Re:MacOS X: How to Summarize the Contents of Docum by Anonymous Coward · · Score: 0

    (Score:2) by plasmacutter (901737) This is a serious reflection of our current times, where people's eyes gloss over if the concept at hand is not condensed into a convenient sound-byte.I suppose you could call it the bleeding edge where complacency meets the loss of freedom and the fall of darkness where critical thought once stood.Now there is enough probable demand to launch a startup designed to remove what minimal labor people are interested in dedicating to the quality of even their leisure time.I'm sure many fantasize about strangling 24/7/365??

    (This Slashdot Page)

  27. Several summarizer tools already there by ruphus13 · · Score: 3, Informative

    that do essentially the same thing with text summary (similar to Mac OS X's summarizer). There's an Open Source project that does this too called Summarizer - http://sourceforge.net/projects/summarizer

  28. astroturfers and trolls by bcrowell · · Score: 3, Interesting

    I run a site that catalogs free books, and accepts user-submitted reviews. (See my sig.) It's a constant source of amazement to me what a low level of morality (and intelligence) some authors have. They'll add their book to the catalog even though it's not free. (The site's UI tells them very clearly that it has to be free online in order to be listed.) Then they'll post their own "review" of the book, which reads exactly like a dust-cover blurb rather than a review. Then I check the email address they used to sign up on the site, and it's the same as the email address of the author of the book -- this despite the fact that the button they had to click on to submit their review was labeled I am not the author, and have no personal, professional, or business relationship with the author. I am submitting my review..

    About 50% of the reviews I get are like this, and I have to delete them by hand. I don't actually get that many reviews submitted, which is a good thing in a way, because if the site was really busy I'd never be able to keep up.

    I don't think there's any way of solving this problem, since the internet was designed for anonymous use, and even if it was technically feasible to verify identities on the internet, I wouldn't want to do it. Amazon tries fairly hard to deal with this problem. These days they won't let you submit reviews unless you've bought something from them, which is probably a reasonable way to stop sock puppets. They also try to get you to build up a reputation for your online persona, even if it's not publicly tied to a meatspace identity. That doesn't really work that well, though. For instance, there are certain people on amazon who submit something like ten reviews per day, 365 days per year -- obviously they're not really reading all those books. I also don't see any way to stop the phenomenon of the author getting his friends, family, and grad students to write good amazon reviews of his book.

    Because of all this, I'm suspicious of any statistical method of analyzing user-submitted reviews. You just have no way of knowing which reviewers are honest. You really have to look at the individual reviews and see if what they say makes sense. Ebay feedback is an example of how silly this can all get, even in a community where people really do have long-term online identities that they have an interest in maintaining good reputations for. What the heck does it tell you if the seller has 99% positive feedback? Absolutely nothing. You have to read the 1% negative reviews and try to evaluate whether they sound reasonable.

  29. A Nanna Mass Cowterd. by Anonymous Coward · · Score: 0

    Do we really want the internet to represent the people's opinion the same way Congress represents the people's opinion? Let some software compile it and dictate what others read?

    Whomever wrote this piece-of-hunk-of-crap really... sucks.

  30. State of research in the area by Anonymous Coward · · Score: 1, Interesting

    NIST runs yearly evaluations regarding automatic summarization. Some information about that stuff is available at http://www.nist.gov/tac/ (used to be http://duc.nist.gov/).

    There are two main approaches: the domain-dependent template filling plus text generation, or the domain independent statistical sentence extraction. And either way, the quality of the generated summaries is far far away from what a human can write.

    While machine-learning-powered research systems are much better than Word or OSX summarization, the way to go is still long...

  31. Methinks it would make it worse by Moraelin · · Score: 3, Interesting

    Since they explicitly mention Amazon, heh, my experience with Amazon's user reviews has been pretty bad to start with. Caveat: it's not about electronics, but I do buy games and the occasional DVD movie off Amazon.

    My impression is that the amount of fanboyism, astroturfing and bullshit is... epic. Monumental.

    E.g., read some reviews for a game that's not released yet. My favourite example was Gothic 3, when it wasn't even in beta yet, or even alpha. The only thing anyone had were some screenshots of what the graphics engine can do. That's it. Nobody had anything playable yet, probably not even the devs.

    Well, people were already writing reviews in which it's the greatest game ever, and the gameplay rules, the graphics are the best since Michelangelo, etc.

    When released, the game was a buggy mess that didn't even vaguely resemble those "reviews". The graphics had some major glitches. Quests could be broken because the NPC had fucked off, and I know someone who encountered that right in the freaking intro. The game had a nasty memory leak, where eventually it would start to barely crawl and eventually crash... often while saving, leaving you with a corrupt and unusable saved game. Gameplay too was a broken fuckup: e.g., combat was a broken whoever-hit-first-wins affair, because then the other would be continuously interrupted and unable to hit back or change weapons or whatever. Even a flea could probably kill you, if it hit first. Etc.

    Most of that stuff _still_ hasn't been fixed, after more than a dozen patches and the publisher giving up on it.

    But, of course, going by the user reviews, you'd think it's the greatest game ever.

    Now as a human, you can filter out the blatant bullshit, see which reviewers better reflect your taste and didn't post too much bullshit before, etc. I'm skeptical that a program can be too good at doing the same.

    But I have an even worse fear: that once people figure out that they only need to game a program, and how, we'll see even more fanboyism, astroturfing and bullshit. Plus an army of sock-puppets to mod each other up, if the bot takes that into account. Basically, think about all the link farms and link spam on the net to game Google's page rank. Now think the same for a bot aggregating reviews. I find that scary.

    So, no, I don't want it on Slashdot too. Basically, would you really want 300 goatse links, just so the bot includes it in the digested version?

    --
    A polar bear is a cartesian bear after a coordinate transform.
    1. Re:Methinks it would make it worse by WNight · · Score: 1

      It's a good idea, but just needs to be under your control. At that you could have it apply negative weights to votes from people who voted those articles up. They'd then push those down, out of your way.

      This is only less than useful because it'll be a one-size-fits-all solution and, if YouTube comments are any gauge, most people aren't as discerning as you.

      P.S. Yes, Gothic 3 stunk. Unplayable at release, barely playable weeks later with patches. 2+ minute load times (quick-load, hah!) on modern hardware, constant stun-locks when fighting boars/etc, broken quests... Ugh. Not to mention that it's Gothic 3(!!). You've saved the world twice before. So why are you level 1 at the start, having to suck up to itinerant villagers and hunt wildlife? Slight continuity fault...

  32. whatcouldpossiblygowrong, perfecttrap by rootpassbird · · Score: 1

    someone is summarizing for me according to his algorithm which i have no clue about as an end user.

    --
    Hackers have long memories. It works both ways.
  33. Birth of a new word: "astrotrolling" by onitzuka · · Score: 1

    Just googled it and it looks like we got to see the birth of this new term! http://www.google.com/search?q=astrotrolling

  34. Rollerball by Anonymous Coward · · Score: 0

    Anyone else is getting memories of the original Rollerball movie (the remake sucked donkey balls) where the AIs had summarised all the human literature and books were no longer available?
    If anyone laughed at that concept, start trembling.

  35. Useless Chaff by Jekler · · Score: 1

    The submitter commented that this seems to have promise. How so? It's obviously just a filter that's specific to the comment format of Amazon Electronics. It's not like this plugin does any type of natural language processing. The person who developed the extension obviously noticed a pattern that's specific to how people comment at that specific web site. If that's the case, the application has no promise for any wider application. If that's not the case, this person has done some world class work in breaking the barriers between humans and machine, which means they can understand us now!