Slashdot Mirror


AOL Introduces Neural-Net Content Filtering

An unnamed reader writes: "I thought this was kind of interesting. AOL has implemented a new form of parental controls, using neural net AI instead of hand-picked "lists". They seem to be willing to accept that no automated solution is infallible, and offer end-users the ability to vote to block or unblock sites. If there is an acceptible solution to parental filtering (not mandatory filtering, mind you. This scenario leaves it up to the parents), the seeming efficiency of neural net ai (at least, as efficient as the input) coupled with end-user's ability to influence the filter state seems to be it. The company that developed the AI in partnership with AOL (RuleSpace) doesn't appear to have much to say on the internals. Anybody know any AOL users who have tried it yet? If the market is pushing towards optional filtering, what would make for a better solution?"

24 of 156 comments (clear)

  1. Vote to *unblock* a site... by Genom · · Score: 3

    How exactly would this work for AOL users?

    If the site is blocked, there's a good chance they won't see it. If they don't see it, they can't decide whether it's "indecent" or not. Therefore, they won't vote to unblock it.

    Now conceivably, you could turn *off* these filters - but would the standard AOL web filters still be in place?

    Would you get a truly *uncensored* view of the web with the filters turned off, or would you simply get a larger subset, lacking what AOL's execs/censors have decided that you don't need to see?

    No thanks - I'll stick with a direct connection where *I* control what I can see, and what my family can see. I don't want my 'net experience "filtered" by a company's views - or even worse, by some glorified hivemind's views.

    ---
    "Everything is objectionable to someone, and sheeple are easily swayed to the views of someone with conviction. Therefore, they will vote in the manner proscribed to them by those with conviction. Without an opposing viewpoint, there becomes a monopoly on public opinion."

    1. Re:Vote to *unblock* a site... by MAXOMENOS · · Score: 3

      Here's how something like this might work:

      Timmy (yes, little Timmy) tries to look at a web page on sexual reproduction, from a science web site. The web site is blocked. Timmy emails AOL and tells them that www.encyclopedia.com/sex-education/ is unfairly blocked. AOL sends this to RuleSpace. RuleSpace looks at the site, figures out that it's not porn, wonders how the hell the AI flubbed that up, unblocks the site.

      Alternatively, Billy (Timmy's pal) tries to go to www.moreporn.com and finds it blocked. Billy complaines to AOL, who sends the complaint to RuleSpace. RuleSpace laughs their asses off.

      You can vote something as censorable similarly. Joe Camel sees www.hotbabeslickingeachother.com even though the filters are on. Pleased to see the website (but dismayed that it got past parental controls), Joe Camel complains to AOL, who complains to RuleSpace. RuleSpace wonders how the hell this site got past the AI and adds it to the list of blocked sites.

      Alternatively, Aunt Gwen who still lives in the 1890s could look at a site that depicts a couple kissing in public and complain to AOL that it's pornographic. AOL sends this on to RuleSpace. RuleSpace laughs their asses off.

      ObJectBridge (GPL'd Java ODMG) needs volunteers.

  2. they need multiple communities of censors by astrashe · · Score: 3

    This is almost there. The problem with this system is that different would be censors have very different ideas about what to suppress.

    Some parents will want to suppress homophobic hate speech, and other parents will want to suppress discussions about evolution.

    Instead of one big mass of rules, they need to make it possible for spliter groups of parents to "fork the rules", or to start out from scratch with a new set of rules. That way concerned parents can pick the censorscheme that fits their own biases best.

    As long as none of this is compulsory, I think it's probably a reasonable approach.

  3. Re:One possible solution? by Samrobb · · Score: 4

    I've thought about this and discussed it in the past. In order for this sort of thing to work, I think you need to make a couple of assumptions:

    1. The data format for filter lists must be standardized and flexible enough to permit anyone to easily assemble and disseminate a list. A simple text file would probably do, as that can be "published" via HTTP or downloaded and "installed" by dropping it into a directory.
    2. Another data format item... I'd think this is the sort of thing that XML would be overkill for, but supporting an XML format in addtiion to a plain-text format would allow for greater flexibility. For example, a list of sites could be broken down by general content, publisher, etc., in order to allow additional filtering (allow "Child-Friendly" sites, except for those run by The Mouse).
    3. Filtering is opt-out by default. By this, I mean that if a site does not appear in any of your subscribed lists, it gets blocked. Unfortunately, this is the only real way (short of analyzing content in real-time) to ensure that various types of web spammers, particularly pornographers, can't fo an end-run around the filtering. Note that I don't think this should be the only mode of operation; just that the default settings should be "disallow access unless approved by me or someone I have decided to trust".
    4. Multiple subscriptions are going to be the default. While I'd be quite happy suscribing to a list of Christian-friendly sites, it's unlikely that other interests of mine (/., for example) would end up on those lists. This could certainly be handled in part by a "local" list.
    5. Adding new sites to the "local" list should be trivial for someone who has permission to do so. Editing a text file and restarting your browser doesn't cut it; seeing a "Access to this page has been blocked... add it to the local list?" in your browser would.
    6. Whatever proxy is handling the filtering should be able to handle differing collections of access lists based on user. If Junior logs in, I want to ban potentially dangerous sites. On the other hand, I've got to deal with those buggers daily, so when I'm logged in, I should be able to access them.

    Comments? Or has someone already gone and registered a Source Forge project for this?

    --
    "Great men are not always wise: neither do the aged understand judgement." Job 32:9
  4. Will Disney get blocked? by JJ · · Score: 4

    If the neural net takes linkages as input then Disney could find themselves on the list of blocked sites, a somewhat ironic development to say the least. (If you dont understand this, then you obviously haven't poked around these sites very much.)

    --
    So long and thanks for all the fish . . . !!!
  5. How it probably works by scruffy · · Score: 3
    Here is an educated guess as to how this works.

    First, you create a training set by having people categorize a lot of web pages as porn, mature, PG-13, or whatever. By a lot, I would guess that around 1,000 to 10,000 web pages would be sufficient. Then you make a list of the words from each web page in the training set, maybe also keeping track of how many times each word appears (this is called the "bag of words" representation in the literature). Now, you train your learning algorithm (neural nets in this case) to correctly categorize the training set. You use standard experimental procedures to tune your learning algorithm and to confirm its accuracy.

    The whole basis of this technique is that certain combinations of keywords are more likely in porn web pages than in, say, safe sex web pages. One reason why this approach should continue to work is that web pages intentionally put various keywords into the web pages so that you can find them using any standard search engine. If they try to fool this technique, they also risk fooling anybody trying to search for these pages.

  6. Who trains the Neural Net? by Stavr0 · · Score: 4

    Is this going to be one giant Neural Net for all AOL? Or is it going to be community-based? I don't necessarily want to filter content the same way as, say, the Bible Belt users.
    Also, who gets to train the filtering decisions? Can the Slashdot or Everything2 model be applied here? That would mean that all users would have to sit down and go thru all decisions made by the engine and vote aka train the NNet. If there's only one engine that applies to the whole userbase, it'll fail, because it will filter too much for some, too little for others.
    ---

  7. Skynet anyone? by redsmoke · · Score: 4

    On October 23, 2001 Aol's Nueral Net AI(ANNT) comes online.

    October 25, 2001 the Nueral Net AI gains self awareness.

    October 27, 2001 AOL Executives desperately try to shut down the ANNT but can't seem to find the any key.

    October 28, 2001 In retaliation the ANNT begins the launch sequence of all of Aol's secret Nuclear weapons and launches a attack on Microsoft. Microsoft launches a retaliation on Aol with their secret Nuclear weapon stash. The two major Computer Monopoly's are destroyed.

    October 29, 2001 The first wave of the giant minature space penguins begin to take over all the computers.

    November 1, 2001 Linus Torvalds is crowned king of the world.

  8. Yeah, this will work..... by nuintari · · Score: 4

    Yet another "advanced" pr0n filter that no adult can seem to break, but every horny fifth grader on the planet will no the workarounds for in under 2 days.

    --

    --Nuintari

    slashdot : where an opinion can be wrong.

  9. Re:AI on the net huh? by jovlinger · · Score: 3

    There already is an automated system. The suck/rule-someter using google is one such instance. Actually, the hot-oral-sex-ometer might be a good way to screen against porn. Figure most porn site will link to other porn sites, so the links ought to be fairly accurate.

  10. YECIAFV (Yet Another CIA FLUENT Variant) by joq · · Score: 5

    FLUENT, enables an operator to search stored documents in a language s/he doesn't understand by using his or her own language for queries. (read on and here)

    Funny how we Americans are such tightwads when it comes to sexual content. After visiting Europe last year I saw people were a slightly bit more laid back, even though pornography is shown on television just about every night. Wow I'm surprised Parents all over the USA aren't condemning Europeans for being sexually free.

    Here's a suggestion for some parents: How about talking to your kids before placing mental handcuffs on them?

    I wonder if AOL has taken the time to filter regular expressions such as pr0n/s3x/etc. Then I also wonder how are kids doing homework on "sexual reproduction" or "sexual organisms" are going to fair when using AOL. What I'm waiting to see, is who is going to be the first to open online "concentration camps" AOL-TW or MS


  11. AOL violates patents. by Tsujigiri · · Score: 3

    Great. It looks like AOL are implementing a combination of Everything2 and the Slashdot moderation.

    • www.xxxbustybabes4you.com (Score:1, Overrated)
    • www.timewarner.com (Score:5, Informative)

    Get those patent lawyers ready...umm...you did patent moderation, didn't you...well...at least we can get 'em with Everything2...can't we?

    "I'll take the red pill, no, blue. AAAHHHHHHHHHHHHHHHH........"

    --

    "I'll take the red pill. No! Blue! AAAaaaahhhhhhhhh"
    - Monty Python meets the Matrix

  12. Full Circle by Greyfox · · Score: 4
    Time was when you wanted pr0n you raided dad's pr0n stash. Then the Internet came along and you could get all the pr0n you wanted the the click of a button. Now it looks like we're completing the circle, making it much more likely that raiding dad's stash will once more be the preferred method of obtaining pr0n. Or dad's web cache.

    Of course, pitting parents against children in access control battles over the computer will always almost always result in one victor -- the children. Unless the parent is an IT security consultant, the children seem to inevitably know more about the computer.

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

  13. Parents will vote in/out sites? by FunkyChild · · Score: 5

    "What are you doing, honey, and what are those groaning noises coming from the computer?"

    " *cough* Just ..er.. exercising my vote to ..er.. make the internet a better place for our children..."

  14. Re:ARG! Why not go the easy way? by Christianfreak · · Score: 3

    You're over-simplifying the issue here: the Internet isn't controled by one body, it has no centralization so who would make the law? And if somehow every one of us could agree on such a law (all over the world) then how would it be enforced? Take places like Geocities for example, they have a rule against porn sites but there is still porn there. When they get taken down they just create a new account. Such a law would be far too easy to get around and then we're back to the same problem we have now.

    Its sad but its so easy to see people's attitudes when it comes to issues like this. "My kid saw some naked girl! Its the governments fault!" The answer is good resposible parenting, not more laws and not blaming the government for not doing more. They've done what they can, they can't very well take away the of people to see it if they want too. We need better parents.


    "One World, one Web, one Program" - Microsoft promotional ad

  15. Voting goes to a review board... by khym · · Score: 3

    The voting feature that is mentioned doesn't get automatically processed by the system, but instead goes to a human review board; if the review board agress, they presumeably either add the site to some type of "override" list, or tell the engineers to tweak the AI code. The AI itself is supposed to understand words in the context that they're used; for example, the article claims that the page "The Art of Oral Sex" was blocked while "Is Oral Sex Safe?" wasn't blocked.


    Suppose you were an idiot. And suppose that you were a member of Congress. But I repeat myself.
    --
    Give a man a fire, and he'll be warm for a day, but set him on fire, and he'll be warm for the rest of his life.
  16. The best filter is your own two eyes... by RobinH · · Score: 4
    If the market is pushing towards optional filtering, what would make for a better solution?

    IMHO, a small child should not be left unattended for long periods of time on the internet. The best filtering is for you to watch your kids and see what they're viewing. This goes for television too.

    When your kids are older (i.e. teenagers) just make sure they understand what you approve of them looking at on the internet. At some point you have to trust their judgement of what's right/wrong. You still need to monitor, but don't put automated filtering there, because that just shows that you don't trust them. There are lots of ways to check what they're viewing without having a screen pop up saying that your parents have blocked this site because it contains objectionable content, when all they were reading were some /. postings.

    --
    "I have never let my schooling interfere with my education." - Mark Twain
  17. neural nets by bluesninja · · Score: 3

    That's i guess a decent solution for once. Not ideal though. It means that all AOL-ers are now governed by the will of the "average" AOL member. That's a scary thought.

    Ideally, instead of voting sites up or down, why not just let everybody host their own list? Instead of "voting" for the privelege of allowing your kids to view a site, just let them. And vice versa.

    /bluesninja

  18. One possible solution? by pjdepasq · · Score: 4
    A number of years ago ('96?) I was at a Software Development Expo in Washington DC. Tim Berners-Lee was the keynote the day I went and he presented what I thought was an interesting solution.

    Essentially there would be local communities (churches, schools, etc) who made restricted lists available via the browser to anyone that "agreed" with their standards. That is, if you are a parent and you like the standards your church sets, you "subscribe" or download (or whatever) the church's list of "bad" sites.

    In this scheme there's little to no mandating of someone else's standards (what AOL deems inappropriate), and you can decide what's right for your family, situation, children, morals, etc.

    I've never heard more about this scheme but I am interested in it (though I have no kids to patrol). One of these days when I get done with my PhD I might try to implement this solution and see how it works out.

  19. Really a neural net? by HongPong · · Score: 4

    It sounds like just a voting system on websites. I read the article, but I don't get where the neural net part comes in. Also, does it apply to whole domains or subdomains, directories?

    --

  20. I don't know about you... by NineNine · · Score: 3

    ...but I'm not a big fan of filtering software.

  21. That's helpful indeed by ishrat · · Score: 3
    Even though our upbringing may be ideal but a certain age and the human mind work towards finding out things for themselves. And the banned items give the most excitement and thrill of life, and as such the upbringing comes to naught. The internet being so private unleashes the desire to take a peep into the other world, without the risk of the parents ever finding out.

    This kind of automated filter along with the manual control will certainly come in handy, although I would like to advice parents to leave some scope for indulgence so that the children are not desperate enough to fid other more complex ways of satisfying the quest for information in those fields. And it all wears of in time because the novelty of the situation no longer exists.

    --

    There's always sufficient, but not always at the right place nor for the right folks.

  22. Worrisome by RareHeintz · · Score: 4
    What worries me about this "innovation" from AOL is the potential for large subsets of users to get together to ban political speech that they don't like - for example, by voting down the Village Voice site or gwbush.com or the RNC website.

    Some things (and not all that many) are best done by majority vote, but some things are better left to individual discretion.

    OK,
    - B
    --

  23. query by Publicus · · Score: 3

    Does anyone know what end user sees when a site is blocked by AOL? For example, if my parents are being over-protective and I want to look at some nudity, what message do I see when I try? "Sorry, your parents don't want you to see this kind of thing" or "This site has been blocked?"

    Also, was anyone confused by this line?:
    "This (AOL filtering technology) is (only) that good," Nunberg said.

    Should those parens be taken as brackets? If not, how would that have sounded in the interview? :)

    --

    My Karma was at 49, then they switched to words. All that work for nothing!