Poison Attacks Against Machine Learning

← Back to Stories (view on slashdot.org)

Poison Attacks Against Machine Learning

Posted by timothy on Sunday July 22, 2012 @01:33AM from the think-zippy-the-pinhead dept.

mikejuk writes "Support Vector Machines (SVMs) are fairly simple but powerful machine learning systems. They learn from data and are usually trained before being deployed. SVMs are used in security to detect abnormal behavior such as fraud, credit card use anomalies and even to weed out spam. In many cases they need to continue to learn as they do the job and this raised the possibility of feeding it with data that causes it to make bad decisions. Three researchers have recently demonstrated how to do this with the minimum poisoned data to maximum effect. What they discovered is that their method was capable of having a surprisingly large impact on the performance of the SVMs tested. They also point out that it could be possible to direct the induced errors so as to produce particular types of error. For example, a spammer could send some poisoned data so as to evade detection for a while. AI based systems may be no more secure than dumb ones."

82 comments

Min score:

Reason:

Sort:

Why solely the link to "i-programmer.info"? by Anonymous Coward · 2012-07-22 01:42 · Score: 5, Informative

Why the hell is the only link in the summary to that rather useless "I Programmer" website? The summary here at Slashdot is basically the content of the entire linked "article"!
Here is a much more useful link for anyone interested in reading the actual paper: http://arxiv.org/abs/1206.6389v1
1. Re:Why solely the link to "i-programmer.info"? by Anonymous Coward · 2012-07-22 01:47 · Score: 0
  
  Because the guy that posts these likes spamming links to his own site.
  http://developers.slashdot.org/story/12/07/21/2040257/html5-splits-into-two-standards
2. Re:Why solely the link to "i-programmer.info"? by mbone · 2012-07-22 01:48 · Score: 2
  
  The "original article" has a link to the arxiv preprint at the bottom.
3. Re:Why solely the link to "i-programmer.info"? by Anonymous Coward · 2012-07-22 02:40 · Score: 0, Troll
  
  For the wonderful low prices of $7USD, I will touch your sex.
4. Re:Why solely the link to "i-programmer.info"? by Anonymous Coward · 2012-07-22 03:08 · Score: 0
  
  Why the hell is the only link in the summary to that rather useless "I Programmer" website? The summary here at Slashdot is basically the content of the entire linked "article"!
  That's easy: timothy doesn't proofread, much less edit.
5. Re:Why solely the link to "i-programmer.info"? by Lisias · 2012-07-22 04:17 · Score: 0
  
  For just $93USD more, I'll leave your sex in place when I retrieve my hand.
  
  --
  Lisias@Earth.SolarSystem.OrionArm.MilkyWay.Local.Virgo.Universe.org
6. Re:Why solely the link to "i-programmer.info"? by EdIII · 2012-07-22 05:54 · Score: 0
  
  For just $93USD more, I'll leave your sex in place when I retrieve my hand.
  So... for a hundred dollars you will not rip my off my johnson? Sounds quite reasonable. Do you happen to work for Viacom?
7. Re:Why solely the link to "i-programmer.info"? by justforgetme · 2012-07-23 18:59 · Score: 1
  
  Yes but the sentiment stands. why base the story on a story about an article and not on the article itself?
  Bone idleness? Idiocy? Hunger? Gas?
  
  --
  -- no sig today
Try this on humans by s_p_oneil · 2012-07-22 01:46 · Score: 4, Interesting

Universities should run a number of psychology experiments to see how this can be done to human intelligence to see how susceptible it is compared to AI. Or you could just study people who tune in to .
1. Re:Try this on humans by s_p_oneil · 2012-07-22 01:47 · Score: 1
  
  Sorry, Slashdot stripped out my "insert questionable media outlet here" message. I previewed it a bit too quickly.
2. Re:Try this on humans by Yaa+101 · 2012-07-22 03:27 · Score: 2
  
  It is already known that human brains make up what they miss in presented info.
  With people you only have to withhold info to get them to make bad decisions
3. Re:Try this on humans by sg_oneill · 2012-07-22 03:48 · Score: 1, Insightful
  
  When you think about it, whats going on here is inducing mental illness in "thinking" machines.
  We already know how to induce mental illness in humans. Religion and war.
  
  --
  Excuse the Unicode crap in my posts. That's an apostrophe, and slashdot is busted.
4. Re:Try this on humans by Lisias · 2012-07-22 04:19 · Score: 1
  
  Universities should run a number of psychology experiments to see how this can be done to human intelligence to see how susceptible it is compared to AI. Or you could just study people who tune in to .
  They're still busy trying to understand the Milgram's results.
  
  --
  Lisias@Earth.SolarSystem.OrionArm.MilkyWay.Local.Virgo.Universe.org
5. Re:Try this on humans by marcosdumay · 2012-07-22 04:23 · Score: 3, Insightful
  
  You mean propaganda and social pressure.
  Religion and war are just consequences of those.
  
  --
  Rethinking email
6. Re:Try this on humans by Anonymous Coward · 2012-07-22 04:37 · Score: 0
  
  This could explain religion.
7. Re:Try this on humans by ceoyoyo · 2012-07-22 05:14 · Score: 1
  
  They have. Google Milgram experiment.
8. Re:Try this on humans by Runaway1956 · 2012-07-22 09:49 · Score: 1
  
  GP is probably on a crusade to stamp out religion and war. Your definitions will have zero impact on his views.
  
  --
  "Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br
9. Re:Try this on humans by Decker-Mage · 2012-07-22 13:37 · Score: 1
  
  That was what came to mind immediately after just reading the summary here. Unfortunately the cure, open-mindedness, frequently sets up the 'victim' to the disease. Sad.
  
  --
  "[I]t is a wise man who admits the limits of his knowledge or skill, and that pretending either causes harm." --Terry Go
Propaganda by mbone · 2012-07-22 01:47 · Score: 5, Insightful

On this side of the human / AI line, we call this propaganda. It has historically proved very effective, specially if you can control all of the "training data."
1. Re:Propaganda by betterunixthanunix · 2012-07-22 01:53 · Score: 1
  
  Historically? Just what do you think D.A.R.E. is?
  
  --
  Palm trees and 8
2. Re:Propaganda by MightyYar · 2012-07-22 02:28 · Score: 1
  
  D.A.R.E. would be a pretty poor example - it has never been found to be effective.
  
  --
  W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
3. Re:Propaganda by betterunixthanunix · 2012-07-22 02:55 · Score: 2
  
  That depends on your definition of "success" -- D.A.R.E. has been overwhelmingly successful at convincing people that some drugs should be illegal. See, for example, the large number of people who are convinced that cocaine, heroine, and methamphetamine are evil and must be banned (and never mind that two of the three drugs are legal by prescription).
  
  --
  Palm trees and 8
4. Re:Propaganda by peragrin · 2012-07-22 03:25 · Score: 1
  
  I don't believe drugs are bad, but the use of some drugs should be tightly regulated.
  the average person is really bad at self medication. either going way to far or to little. Drugs with side effects trigger attachments. Caffeine is just as dangerous as Alcohol in that respect. Some people really can't handle their caffeine very well either. Go to a coffee stand (or at work) and watch some people with their hands shaking so hard they can't hold the coffee in the cup.
  That is a sign of a drug addiction beyond the persons ability to control.
  Prescribed drugs can be abused but at least someone is trying to limit the effects
  
  --
  i thought once I was found, but it was only a dream.
5. Re:Propaganda by mbone · 2012-07-22 03:37 · Score: 1
  
  Historically? Just what do you think D.A.R.E. is?
  amateurs
6. Re:Propaganda by LateArthurDent · 2012-07-22 04:30 · Score: 2
  
  the average person is really bad at self medication.
  And why is it our job to protect them?
  Boxing is extremely dangerous. If two people make the choice to get in the ring, we may think that's unwise, but it's their decision. If you make the decision to do something that will harm you, you may be an idiot, but I don't have the moral right to stop you through means other than making an argument to try to change your mind.
  When you get into things that have the potential of harming others, then that's another story. You're free to drink alcohol and use whatever other drugs you want to. You're not free to drive on public roads under their influence.
7. Re:Propaganda by betterunixthanunix · 2012-07-22 04:38 · Score: 5, Insightful
  
  Drugs with side effects trigger attachments. Caffeine is just as dangerous as Alcohol in that respect
  Except that "attachments" are not dangerous. Coma and death are dangerous, brain damage is dangerous, liver damage is dangerous, and the typical doses of alcohol are frighteningly close to such adverse effects -- whereas the typical dose of caffeine is nowhere near that point.
  
  Go to a coffee stand (or at work) and watch some people with their hands shaking so hard they can't hold the coffee in the cup.
  Which may be scary, but is not a sign of any permanent damage to that person's mind or body. Caffeine withdrawal is tough, but it is not life threatening, and a person who is committed to it can get through the symptoms at home (maybe with the help of close friend) in less than a week. Alcohol withdrawal, on the other hand, can be so dangerous that it requires medical supervision.
  
  That is a sign of a drug addiction beyond the persons ability to control.
  Yet the drug abuse and dependence treatment programs that emerged from clinical psychology (read: science) are based on teaching people how to take control and avoid harmful behaviors.
  
  Prescribed drugs can be abused but at least someone is trying to limit the effects
  Really? A typical Adderall prescription (d,l-amphetamine salts) is for 10-20mg, two-three times per day, for a month. That is well above a lethal quantity, and a person could easily give themselves brain damage by taking a large fraction of their month's supply. People who abuse Adderall and related medicines (other amphetamines, Ritalin, etc.) can have psychotic episodes; see, for example, this recent NY Times article (sorry for paywall) about prescription stimulant abuse among high school and college students:
  
  https://www.nytimes.com/2012/06/10/education/seeking-academic-edge-teenagers-abuse-stimulants.html?_r=1&hp
  
  It's not just psychiatric drugs; prescription opiates are also readily abused, and people get high by using the prescribed amount of those drugs. Some pharmaceutical opiates are more potent than heroin, and abuse is an ever-present concern with those drugs; Rush Limbaugh abused prescription opiates:
  
  http://www.cbsnews.com/2100-201_162-1561324.html
  
  Here is the problem with the war on drugs: recreational drugs need not be any more dangerous than prescription drugs. Pharmaceutical methamphetamine is safer than "truck stop" methamphetamine, not because it is a different drug, but because the production is much better controlled. Many of the dangerous of recreational methamphetamine stem from the adulterants that are left over from poor production techniques.
  
  So in a sense, I agree with you: we need better regulation. That means legalizing recreational drugs, and requiring that legal sources adhere to standardized and regulation production and distribution methods (I do not think anyone can argue that a 14 year old should be buying recreational drugs). When someone buys cocaine, they should not have to worry about what is mixed into the drug; when someone buys MDMA (ecstasy), they should not worry about having actually received methamphetamine mixed with caffeine (a well known trick on the black market). There will still be problems with abuse, but when someone visits their doctor, they should be able to tell their doctor what drugs they have been taking, and in what doses -- which is basically impossible if you are buying some mystery powder in an alley somewhere.
  
  --
  Palm trees and 8
8. Re:Propaganda by betterunixthanunix · 2012-07-22 04:43 · Score: 4, Interesting
  
  I disagree; D.A.R.E. has been overwhelmingly successful at convincing people of the legitimacy of the war on drugs and the paramilitary police that were created in the name of that war. Hardly anyone questions the fact that we have soldiers (but with "POLICE" or "DEA" written on their uniforms) attacking unarmed civilians just to serve an arrest warrant. Hardly anyone questions the fact that the executive branch of government, through the Attorney General's office, now has the power to make and enforce drug laws, without democratic action. Hardly anyone questions the fact that the DEA, supposedly a law enforcement agency, has so much signals intelligence capability that the dictators of some nations have tried to demand the DEA's help in spying on political opponents.
  
  How many propaganda programs have been so successful at convincing people that this sort of unwinding of a democratic system is the right thing to do?
  
  --
  Palm trees and 8
9. Re:Propaganda by inasity_rules · 2012-07-22 04:46 · Score: 1
  
  I deliberately quit coffee every 4 months or so. Then when I start again it is so much more effective. Quitting isn't that hard, given I drink more than 7 cups a day normally..
  
  --
  I have determined that my sig is indeterminate.
10. Re:Propaganda by betterunixthanunix · 2012-07-22 04:49 · Score: 1
  
  I found that quitting coffee came with headaches and tiredness for a day or two -- not the worst thing in the world (people go through worse with tobacco) but not something to shrug at.
  
  --
  Palm trees and 8
11. Re:Propaganda by adri · 2012-07-22 06:31 · Score: 1
  
  Because you don't live in a world where individuals' actions have no effect outside of the individual.
  If two people decide to get in the ring and box, and suffer brain damage in the long term, so be it. What effect could it have?
  If a hundred thousand pairs of people decide to get in the ring and box, what kind of long term effects will that have on the people around them? Would there be an increase in accidents? A decrease in critical thinking? What kind of effects would it have on their planning and execution skills? What about those families whose fathers/mothers/daughters/sons are suffering from boxing effects and what stresses/effects does it have on them?
  Done at a large enough scale, _everything_ has an influence on society as a whole.
12. Re:Propaganda by Anonymous Coward · 2012-07-22 07:46 · Score: 0
  
  "That is a sign of a drug addiction beyond the persons ability to control."
  Which results in zero* harm anyway.
  *Neglible
13. Re:Propaganda by Rhalin · 2012-07-22 08:13 · Score: 1
  
  the average person is really bad at self medication.
  And why is it our job to protect them?
  Boxing is extremely dangerous. If two people make the choice to get in the ring, we may think that's unwise, but it's their decision. If you make the decision to do something that will harm you, you may be an idiot, but I don't have the moral right to stop you through means other than making an argument to try to change your mind.
  When you get into things that have the potential of harming others, then that's another story. You're free to drink alcohol and use whatever other drugs you want to. You're not free to drive on public roads under their influence.
  I'm unfamiliar with a theory of social morality that supports the line of reasoning you start from. Could you point me towards more information on this that is supported by contemporary social theory? Preferably grounded in a processural approach?
  Thanks!
14. Re:Propaganda by colinrichardday · 2012-07-22 08:28 · Score: 1
  
  heroine
  So they object to novels with female protagonists? :-)
15. Re:Propaganda by colinrichardday · 2012-07-22 08:39 · Score: 1
  
  Caffeine is just as dangerous as Alcohol in that respect.
  I can see it now: MACC, Mothers against caffeinated coffee/cola.
16. Re:Propaganda by Anonymous Coward · 2012-07-22 09:58 · Score: 0
  
  the average person is really bad at self medication
  Patients who are on doses of narcotics that are high enough that addiction is a concern are frequently given use of a mechanism to (within limits) control the dose, because doing so reduces the likelihood that the patient will become addicted. I.E. the patient, despite being out of his head from pain and narcotics, is better able to get the dosage right than is the attending physician.
17. Re:Propaganda by Anonymous Coward · 2012-07-22 14:49 · Score: 0
  
  Just injecting them in the privacy of your own home.
18. Re:Propaganda by Smauler · 2012-07-22 16:09 · Score: 1
  
  I've recently (the last 6 months or so) been on and off of tobacco, ie. smoke about 20 a day for a week, stop for 3 or 4 days, smoke for a week, stop again, etc. I've been a smoker for almost 20 years. This isn't because I want to quit - I don't, I enjoy smoking. I think the physical dependencies are completely exaggerated...
  I have a much bigger physical craving for alcohol after not drinking for a while, to the extent I deliberately don't drink a lot of the time. Cocaine's not too bad, but it's insidious - I used to be a weekend user, and found that sometimes I wasn't looking forward to the weekend, I was looking forward to the cocaine. I slowed down a bit after noticing that. Mephedrone I went a little silly on some weekends, when it was legal, because it was gorgeous, but it made the next few days feel dull as hell. When they made it illegal, I quit, because when something is illegal, it's generally cut to crap, and dosage was quite important for me with it - too much, and you end up talking complete crap constantly, if you're not careful. Cocaine you can regulate better, ie. you know how high you are more easily (though it's often cut with other uppers, just to make it more difficult).
  I've stopped illegals now, not for moral or self-preservation issues, but for practical ones - I go out less, and if I get caught again I'll be in deep shit.
Will AI's become too smart for us? by k(wi)r(kipedia) · 2012-07-22 02:01 · Score: 1

The security implications aside, one problem I see is a possible arms race between the poisoners and the AI designers. The only way for the designers to win is to build tests that are less tolerant of the poisoned data. This is good if AI systems are built to interact only with other AI systems. But what if humans are the end users?
At some point, the increase in data precision will come up against the natural imprecision of human users. Fewer humans will be smart enough to pass the Turing test. A practical example: I've noticed how Google's recaptcha puzzles have become more difficult. I now need to magnify the page view in order to make out some of the letters.
1. Re:Will AI's become too smart for us? by mbone · 2012-07-22 02:54 · Score: 2
  
  I have this mental image that in the future not everyone will be able to pass as human (i.e., routinely solve captchas), and the ones who can may be able to rent out that service to those who can't.
2. Re:Will AI's become too smart for us? by Gaygirlie · 2012-07-22 03:48 · Score: 2
  
  I have this mental image that in the future not everyone will be able to pass as human (i.e., routinely solve captchas), and the ones who can may be able to rent out that service to those who can't.
  The good thing is that us non-humans can then travel all around the world really cheap. I, personally, belong in healthcare products as a natural Fleshlight-substitute!
3. Re:Will AI's become too smart for us? by betterunixthanunix · 2012-07-22 05:08 · Score: 1
  
  I have a mental image of a future without captcha, where we rely on things like HashCash instead -- slowing down spammers, rather than defeating them entirely.
  
  --
  Palm trees and 8
works on people too by circletimessquare · 2012-07-22 02:15 · Score: 1, Troll

it's called propaganda
see: Fox News

--
intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
1. Re:works on people too by Toonol · 2012-07-22 05:50 · Score: 1
  
  Your comment is amusing, because by singling out Fox News, you're demonstrating that you're a victim of very successful propaganda.
2. Re:works on people too by circletimessquare · 2012-07-22 06:24 · Score: 1
  
  because I point to a source of propaganda can only mean I am a victim of propaganda?
  
  --
  intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
3. Re:works on people too by colinrichardday · 2012-07-22 08:49 · Score: 1
  
  He's saying that if you believe that Fox is the only source of propaganda, then you are a victim of the other sources of propaganda. Your citing Fox may not be singling them out, but just an indication that you believe that they are the worst in this regard.
4. Re:works on people too by gl4ss · 2012-07-22 23:42 · Score: 1
  
  no, it's more like a guy down the street yelling that the end of the world is nigh and you believing him despite fox news(the main source) telling otherwise..
  
  --
  world was created 5 seconds before this post as it is.
GIGO by Anonymous Coward · 2012-07-22 02:34 · Score: 0

Well,m duh ... leave the learning on and GIGO rule is active! Leave the learning off and people will figure out how to be ignored by it.
Nothing new here at all.
Known problem, known solutions by Kanel · 2012-07-22 02:43 · Score: 4, Interesting

There's already a whole subfield of machine learning which concern itself with these problems. It's called "adversarial machine learning".
The approaches are very different from usual software security. Instead of busying oneself with patching holes in software or setting up firewalls, adversarial machine learning re-design the algorithms completely, using game theory and other techniques. The premise is "How can we make an algorithm that works in an environment full of enemies that try to mislead it?" It's a refreshing change from the usual software-security paradigm, which is all about fencing the code into some supposedly 'safe' environment.
1. Re:Known problem, known solutions by Anonymous Coward · 2012-07-22 04:27 · Score: 0
  
  Also, this very same effect they mention happens in human intelligence...
  The first spams we ever see we likely thought were real to some extent... then we got wise to spam... the spammers change their MO and some people are caught by surprise by the new approach... until they wise up...
  AI does not somehow exclude all the bad parts of real intelligence (and how it can be effected)
2. Re:Known problem, known solutions by betterunixthanunix · 2012-07-22 05:18 · Score: 1
  
  "How can we make an algorithm that works in an environment full of enemies that try to mislead it?"
  This sounds like it is closely related to secure multiparty computation, where the goal is to correctly compute some function on multiple parties' inputs without revealing those inputs. This has been researched since the 1980s, and there have been numerous results on feasibility and impossibility, as well as several practical systems (including at least one that was used in the real world). It is likely that both approaches can be used to solve the same set of problems, but that the machine learning approach is more natural for some problems and MPC is more natural for others.
  
  --
  Palm trees and 8
3. Re:Known problem, known solutions by node636 · 2012-07-22 06:32 · Score: 1
  
  agreed. there already exist plenty of simple methods for identifying and removing 'bad' data. Currently they're usually applied to a static data set before sending it to the machine. It should be simple to implement algorithms that perform this computation at run time.
4. Re:Known problem, known solutions by Anonymous Coward · 2012-07-22 20:11 · Score: 0
  
  How can we make an algorithm that works in an environment full of enemies that try to mislead it?
  That's sounds like a question that could drive internal security, regulatory compliance and consistency inside a corporation's firewalls.
Throwing sand into the gears of MI by Anonymous Coward · 2012-07-22 02:47 · Score: 0

I wonder how long it will take for Machine Intelligence Sanding to be incorporated into a sci fi flick:
"What are you doing? I didn't even know you liked to fish."
"Whenever I order weapons I also order something from an unrelated site. Besides, a box of streamers might come in handy"
Thanks for your order! Bass Pro Shops
Not very practical by ceoyoyo · 2012-07-22 02:58 · Score: 3, Insightful

So if you know the algorithm and training data, and you can feed the system new data with manipulated labels then you can confuse it. It's a little early to panic about your spam filter. Hopefully everyone realizes that if you let the spammers tell your computer what is and is not spam, they can cause it to let their spam through.
1. Re:Not very practical by Kjella · 2012-07-22 03:19 · Score: 1
  
  So if you know the algorithm and training data, and you can feed the system new data with manipulated labels then you can confuse it. It's a little early to panic about your spam filter. Hopefully everyone realizes that if you let the spammers tell your computer what is and is not spam, they can cause it to let their spam through.
  Well I assume that's why the spam/not spam buttons are there in my webmail reader, that somehow this goes into a form of feedback system. I'd not be surprised if spammers send spam to themselves, then flag it as not spam in order to confuse the system. Or signing up for stuff legitimately, then flagging it as spam anyway. Anything to increase the noise floor so they have to back off on filtering or lose genuinely wanted mail.
  
  --
  Live today, because you never know what tomorrow brings
2. Re:Not very practical by Anonymous Coward · 2012-07-22 03:33 · Score: 0
  
  Hopefully everyone realizes that if you let the spammers tell your computer what is and is not spam, they can cause it to let their spam through.
  I can guarantee you they won't. You could say the same thing about most malware. Surely nobody would be dumb enough to run CutePuppy.jpg.exe they downloaded from an unknown site in Romania, but people actually are that dumb. When it comes to computing, 99% of the public employs zero thought. There simply isn't any, beyond "click what pops up on my screen".
3. Re:Not very practical by Sqr(twg) · 2012-07-22 03:43 · Score: 1
  
  I doubt they would spend energy on this. Setting up fake mail accounts costs time/money, and even though the spammers as a collective might benefit from attacking the spam filter, it is more profitable for the individual spammer to use those accounts for sending spam.
  Also, a support vector network could easily learn that the "not spam" flag from certain users actually means the opposite.
4. Re:Not very practical by Anonymous Coward · 2012-07-22 04:01 · Score: 0
  
  Most spam comes from fake accounts. It costs almost nothing for them to set some up and use them to game feedback channels. Good filters learn how much trust to put into various feedback channels.
5. Re:Not very practical by ceoyoyo · 2012-07-22 05:48 · Score: 1
  
  I doubt it. Google's spam filter seems to work just as well as my local one, and spammers are definitely not managing my spam/not spam button.
  If that were the case though, it's an excellent reason not to use spam filters that spammers control.
Not new? by TheRealMindChild · 2012-07-22 03:10 · Score: 1

I know that email spammers have been exploiting this to make bayesian filters for the past decade

--

"When life gives you lemons, don't make lemonade. Make life take the lemons back!" -- Cave Johnson
SVM != AI by SpinyNorman · 2012-07-22 03:38 · Score: 3, Informative

Support Vector Machines are just a way of performing unsupervised data partitioning/clustering. i.e. you feed a bunch of data vectors into the algorithm and it determines how to split the data into a number of clusters where the members of each cluster are similar to each other and less similar to members of other clusters.
e.g. you feed it (number of wheels, weight) pairs of a lot of vehicles and it might automatically split the data into 3 clusters - light 2-wheeled vehicles, heavy 4-wheeled ones, and very heavy 4-wheeled ones. If you then labelled these clusters as "bikes", "cars" and "trucks" you could in the future use the clustering rules to determine the category a new data point falls into.
This isn't Artificial Intelligence - it's just a data mining/classification technique.
1. Re:SVM != AI by lorinc · 2012-07-22 04:23 · Score: 1
  
  SVM are primarily a classification technique that has been extended to clustering, regrssion, structured output learning (such as ranking), and so on. So yes, the max margin principle has been used is basically all the areas of machine learning.
  How do you argue machine learning is not AI? You know the vast majority of researchers and publishers in the ML field consider it to be AI.
  
  --
  Video of some good progressive thrash music
2. Re:SVM != AI by tommeke100 · 2012-07-22 05:10 · Score: 5, Informative
  
  Wrong. SVM is a supervised learning technique. It looks like you're talking about K-means clustering which is unsupervised.
  The difference between supervised and unsupervised is that in the first you use both features and outcome in your training of the system, where the unsupervised will just use the features. So supervised uses both X and Y to learn (if X are the features and Y is the class/cluster), whereas unsupervised will just use X.
3. Re:SVM != AI by Anonymous Coward · 2012-07-22 06:11 · Score: 0
  
  Well, if you listen to Marvin Minsky, then ML is not AI...
4. Re:SVM != AI by Anonymous Coward · 2012-07-22 07:20 · Score: 0
  
  A SVM can be considered a trainable building block to build a simplistic AI but I would not call a simple SVM an AI on its own. It's far too dumb even if very useful for building trainable classifiers that can adapt to the problem at hand.
  To build an AI you would probably need some trainable machine learning tools to parse the raw sensorial input of your system and to cast it into a higher level representation (e.g. semantic image segmentation of matrix of pixels or semantic parse tree for text snippets). Then use that higher level representation to take decisions on what to do next.
  IMO AI requires some kind of embodiment of the system into an evolving yet actionable environment: a sensorimotor loop + some kind of reinforcement learning to interact with the environment and adapt to new situations based on delayed action feedbacks. The environment could be physical (e.g. if the AI controls one or several robots) or digital (e.g. if the environment is the internet or a simulated universe as in games).
  Many ML researchers don't bother with embodiment / reinforcement learning part while they are probably aware that their contributions could indeed be useful one day to build some kind of AI.
5. Re:SVM != AI by SpinyNorman · 2012-07-22 08:34 · Score: 1
  
  It depends on what level of (artifical) intelligence you're talking about. If it's amoeba level intelligence, then maybe ML can achieve similar results, but if it's rat or human level intelligence then obviously not.
  I think most people take AI to mean something that could minimally pass a Turing test, not a silicon slug.
6. Re:SVM != AI by Anonymous Coward · 2012-07-22 10:26 · Score: 0
  
  Well, if you listen to Marvin Minsky, then ML is not AI...
  http://en.wikipedia.org/wiki/AI_effect
Inflammatory self-aggrandizing self-advertising by fygment · 2012-07-22 03:47 · Score: 2

From the article, if you have access to the training data and know the learning algorithm, you can game the machine learning (SVM,not AI) system. How is that anything but self-evident, non-news?!

--
"Consensus" in science is _always_ a political construct.
Shhhh.... by ibsteve2u · 2012-07-22 04:33 · Score: 2, Interesting

Stop talking about how easy it is to poison data collection efforts; you're going to kill the golden goose of those who insist that analyzing social data can allow you to pinpoint psychopaths and other "problematic" individuals before that goose ever takes to the air (on the wings of "black budget" funding, no doubt).

--
Orwell: "In a Time of Universal Deceit, telling the Truth is a Revolutionary Act"
And what about 'poisoned' neurons/nodes? by garyebickford · 2012-07-22 04:48 · Score: 1

A couple of commenters have noted that there is a branch of research related to defending against this - according to one it's called "adversarial machine learning". I've been casually wondering for some time about a related question, which is very relevant to the questions of using the various 'bottom up' AI systems like SVM and neural nets as models of human intelligence and of various complex adaptive systems ('living systems') including economies and polities (and evolutionary biology for that matter). If we look at these systems (both the real world ones and the mathematical models) as decision convergence models, what is the effect of nodes that make errors once, occasionally, frequently, or continuously ? And how does a successful neural network that is dealing with a continuously changing environment accommodate an element/node that provides, for example, randomly varying responses? What about a node that 'purposely' provides poisoned responses - like a secret agent putting false data into the news? In a machine, those things may be manageable by simply starting over, but in a continuous system like a real brain, that is not an option.
I learned a while back that in the human brain, a neuron whose output signals become ignored (the output from its axons becomes weighted so low that it has no influence on the 10,000 other neurons it is talking to), it dies. The brain seems to act very much like a republic of cantankerous, disagreeable citizens arguing at many different levels (and with shifting alliances). But if one continuously shouts "We're all gonna dieeee!!!", pretty soon nobody listens any more.

--
It's easier to be a result of the past, but more fun to be a cause of the future! http://www.spacefinancegroup.com/
Minimun poisoned data by PPH · 2012-07-22 05:28 · Score: 0

The Texas board of education has a pretty good handle on the minimum amount of poisoned data it takes to affect learning.

--
Have gnu, will travel.
Cat and Mouse 2.0 by WOOFYGOOFY · 2012-07-22 06:00 · Score: 1

Cat and Mouse 2.0. Nothing new here.
Some restrictions may apply by Anonymous Coward · 2012-07-22 06:10 · Score: 0

From the paper:
"...we assume that the attacker knows the learning
algorithm and can draw data from the underlying
data distribution. Further, we assume that our attacker
knows the training data used by the learner;"
They characterize these assumptions as "unrealistic", which I think is about right in a real world setting.
Comment removed by account_deleted · 2012-07-22 08:23 · Score: 1

Comment removed based on user account deletion
Comment removed by account_deleted · 2012-07-22 08:43 · Score: 4, Informative

Comment removed based on user account deletion
Is the future like the past? by scruffy · 2012-07-22 15:45 · Score: 1

I'm not sure why this would be surprising. ML algorithms work best if the future behaves like the past, if it has the same probability distribution as the training data. Some algorithms can handle slow changes if they can continually get new training data, but large changes is a problem.
this is not ai by gl4ss · 2012-07-22 23:41 · Score: 1

it's just an elaborate filter program. which is far away from real AI.

--
world was created 5 seconds before this post as it is.
1. Re:this is not ai by k(wi)r(kipedia) · 2012-07-23 00:02 · Score: 1
  
  Real AI doesn't exist yet (and I'm not sure it will).
in other words by minstrelmike · 2012-07-23 06:01 · Score: 1

In other words, artificial intelligence is just as limited and varied as regular ole human intelligence.
Jeez. Who'd a thunk it?