Slashdot Mirror


AI Hears Your Anger in 1.2 Seconds (venturebeat.com)

MIT Media Lab spinoff Affectiva's neural network, SoundNet, can classify anger from audio data in as little as 1.2 seconds regardless of the speaker's language -- just over the time it takes for humans to perceive anger. From a report: Affectiva's researchers describe it ("Transfer Learning From Sound Representations For Anger Detection in Speech") in a newly published paper [PDF] on the preprint server Arxiv.org. It builds on the company's wide-ranging efforts to establish emotional profiles from both speech and facial data, which this year spawned an AI in-car system codeveloped with Nuance that detects signs of driver fatigue from camera feeds. In December 2017, it launched the Speech API, which uses voice to recognize things like laughing, anger, and other emotions, along with voice volume, tone, speed, and pauses.

SoundNet consists of a convolutional neural network -- a type of neural network commonly applied to analyzing visual imagery -- trained on a video dataset. To get it to recognize anger in speech, the team first sourced a large amount of general audio data -- two million videos, or just over a year's worth -- with ground truth produced by another model. Then, they fine-tuned it with a smaller dataset, IEMOCAP, containing 12 hours of annotated audiovisual emotion data including video, speech, and text transcriptions.

51 comments

  1. Cortana and Siri by Zorro · · Score: 2

    Anytime Cortana or Siri popscup and gets in the way there will be anger!

    1. Re:Cortana and Siri by Anonymous Coward · · Score: 0

      Seriously - or any time you use a voice audio menu system.

    2. Re:Cortana and Siri by syn3rg · · Score: 1

      Fracking toasters...

      --
      The contents of this message have been doubly encrypted by ROT13
    3. Re:Cortana and Siri by Anonymous Coward · · Score: 0

      You mean people leave those enabled!?

    4. Re:Cortana and Siri by Anonymous Coward · · Score: 1

      Every time I get one of those worthless automated "assistants" instead of live customer support there will be anger. I guess the next step is having the automated "assistant" (it's not AI, sorry) determine the reason that I'm angry directly due to the automated "assistant" itself always being programmed to offer only simplistic choices that have nothing to do with my issue and wasting my time instead of getting a live person on the line. If the issue were as simple as the ones proffered by the automated "assistant", then I wouldn't have needed to call customer service in the first place.

  2. So what? by Anonymous Coward · · Score: 0

    Anger is a pretty primitive and obvious emotion.

    Maybe stop calling me twice a week to offer me fake hotel vacations and I wouldn't be so angry. Telling me to press 2 to stop hearing these calls, but calling me for a different product around the same time of day is a clear FTC violation.

    1. Re: So what? by Anonymous Coward · · Score: 0

      Fuck you!

    2. Re: So what? by Anonymous Coward · · Score: 0

      Thats nice. Now you just saying stuff because you can damn it

  3. Thats easy by Anonymous Coward · · Score: 0

    "i'm always angry"...

  4. It's quick, but not quick enough? by froggyjojodaddy · · Score: 1

    1.2 seconds sounds kinda long for anger detection

    I can detect anger in someone's voice practically immediately, even before they've finished the first word because as a human, I use a number of other clues e.g. facial contortion, body positioning, finger pointing etc.

    1.2 seconds to detect a change in pitch, volume etc. seems too long and I think that's the overall problems with artificial intelligence or machine learning - they're great for massive data sets that have common patterns (or used to build patterns) but not so good at immediate analysis of something new.

    1. Re: It's quick, but not quick enough? by Anonymous Coward · · Score: 0

      I think a second or so is fine for an AI. A human on the other hand can probably predict anger. For example if you say something that is likely to irritate, you can probably anticipate the reaction while the words are leaving your mouth

    2. Re:It's quick, but not quick enough? by Anonymous Coward · · Score: 0

      Ok, let companies install a camera to monitor your facial cues then.

    3. Re: It's quick, but not quick enough? by Anonymous Coward · · Score: 0

      Not allowed - I always keep my face where the camera is unable to see it

    4. Re:It's quick, but not quick enough? by alvinrod · · Score: 1

      I can detect it even quicker than that. After being out all hours of the night drinking with my friends, I already know that my girlfriend is going to be angry before I even talk to her. It's like the pre-cogs from Minority Report or something.

    5. Re: It's quick, but not quick enough? by bob4u2c · · Score: 1

      I just wear a Mike Myers mask, and I'm smiling the whole time underneath.

    6. Re:It's quick, but not quick enough? by mnemotronic · · Score: 1

      Q: How are you with detection without visual clues? Like over an average poor cell phone connection with lots of latency. Visual contact is, IMHO, an extremely important source of perception for humans. I suppose that's why we're such crappy internet communicators.

      --
      The Russians have won. They have made the world a cesspool of distrust, greed, fear and hate.
    7. Re:It's quick, but not quick enough? by Anonymous Coward · · Score: 0

      Sure, but only after we work out how much they are going to pay me for it.

      I would not mind SELLING my data to corporations, but I won't GIVE it to them.

  5. Maybe they should consider the fact by bobstreo · · Score: 1

    that I have a resting bitchy voice. Especially when not talking to a human that speaks english.

  6. "ground truth" by XXongo · · Score: 1

    " the team first sourced a large amount of general audio data ... with ground truth produced by another model."

    So, actually, the program wasn't detecting anger. The program was modelling what a different program detected in the signal.

    1. Re:"ground truth" by ceoyoyo · · Score: 1

      You neglected the next part, where they fine tuned it using hand labelled data. If you're training a system that learns (and that includes people) and you've got a automatic system that performs okay, it's often a good idea to do a first round of training on the automatic results. Then you come along with a smaller, higher quality training set to boost performance over what the existing automatic system can do.

      And yes, the term "ground truth" is usually used in stupid way.

  7. When my wife gets angry by fluffythedestroyer · · Score: 1

    I can see the future, knows what will happen to me and I know when she's angry and its a lot faster than 1.2 seconds. In fact, 1.2s is what it takes my wife to hit me or give me that death glare stare.

  8. I can code AI by fluffernutter · · Score: 1

    I can write AI:

    If volume_before * 1.5 < volume_now:
    then ANGRY!

    --
    Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
  9. My view on AI by fluffernutter · · Score: 1

    Here is my new view on AI: I think there are a bunch of people out there in industries that did not previously work with computers. Now they are applying common programming tests to variables that mean something in their world and it seems so magical that they call it AI.

    --
    Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
  10. I test for this with old man profanity by xxxJonBoyxxx · · Score: 2

    If I'm on a call with an automated tree (and I'm sufficiently alone), I often let loose a string of angry "old man" profanity while it's listening just to see if I get get auto-routed to the agent. Hasn't happened too often, but it happens (most often with airlines/creditcards).

    1. Re: I test for this with old man profanity by Anonymous Coward · · Score: 0

      All I know is I was never surprised when my mom yelled at me

    2. Re:I test for this with old man profanity by Anonymous Coward · · Score: 0

      angry "old man" profanity

      goddamn nigers are taking over the country

  11. Even German? by Ogive17 · · Score: 1

    Unless Germans are angry 100% of the time and it's hard coded in the logic.

    --
    "Action without philosophy is a lethal weapon; philosophy without action is worthless."
  12. Oh come on. by Anonymous Coward · · Score: 0

    Any house pet can do better than that. You could probably train head lice if you put an artificial head with loudspeakers below them for picking up the vibrations.

  13. dogwhistle by Anonymous Coward · · Score: 0

    Now everyone knows you're a racist.

  14. It's about empathy towards computers by turp182 · · Score: 1

    Now they can sense the anger you have towards them for whatever reasons and say "No, please don't throw me out the window" while you are throwing it out the window (or smashing it with a hammer).

    --
    BlameBillCosby.com
  15. It's for IVR trees by rsilvergun · · Score: 1

    the idea is they detect when you're angry and move you along to a rep faster. Yes, this means for savvy folks they call in already angry, but honestly if you're savvy and being forced into an IVR you're probably already angry anyway since you're calling for a rep to do something you couldn't do online.

    --
    Hi! I make Firefox Plug-ins. Check 'em out @ https://addons.mozilla.org/en-US/firefox/addon/youtube-mp3-podcaster/
    1. Re:It's for IVR trees by froggyjojodaddy · · Score: 1

      IVRs are one of those antiquated things that I can't believe still exist. Worse are the 'voice recognition' ones. I have an English accent but live in Canada and I have all but given up on those travesties

    2. Re: It's for IVR trees by Anonymous Coward · · Score: 0

      I have issues sometimes when the filter filters out my voice for being too deep. Nothing really works, even with a human on the other end.

      Phones are just a shitty way of communicating with people you don't know and care about accuracy.

  16. I don't think it means what you think it means... by Anonymous Coward · · Score: 0

    What will we call AI when we finally get it?

  17. This isn't AI by sdinfoserv · · Score: 2

    Stop calling everything a computer does "AI".. 15 years ago in 2004 is was an IT Director at a large call center that did both inbound (skills based routing) and outbound (predictive dialing). One of the features of our telephone switch back then was real time monitoring that could detect when someone would get agitated or use a "bad " word (like swearing) . When pre-specified thresholds were reached or certain words used, the system would call a supervisor and allow the supervisor to "ghost" (listen but not be heard), "whisper" (coach the agent without being heard by the caller), or take over the call. The terminology 15 years ago was real time monitoring with language recognition heuristics. It worked great then and it was commercially available, it wasn't "AI"...

    1. Re:This isn't AI by r2kordmaa · · Score: 1

      Everything that gets done with neural networks is called "AI" these days. That's how it is, so get used to it, nobody is going to change it just because few people keep calling "Stop". It's even somewhat apropos, it works based on (sometimes failed) training and pretty much nobody can adequately explain how, kind of like natural intelligence in humans.

    2. Re:This isn't AI by ceoyoyo · · Score: 2

      You've identified the difference: "The terminology 15 years ago was real time monitoring with language recognition heuristics."

      Heuristics are a set of rules used for decision making. In the context of algorithms, those heuristics are designed by a human and programmed into the system.

      "AI" is a nonspecific term, but if it means anything it means a system that learns from experience. Specifically, it does not use preprogrammed heuristics.

  18. Bruce Banner by bob4u2c · · Score: 1

    Did they test Bruce Banner, he's always angry? Hulk Smash!

    Also does it detect passive aggressive anger? What if I yell "I LOVE YOU" at a pet, vs I whisper "I'm going to put you in the microwave and set it on high for 4 minutes, ohh yes I am, such a bad doggie you are"? What is the algorithm keying on; volume, facial expressions, changes in skin tone, words spoken? And all they did was get close to what a human could do. Come on, I thought computers were faster. Get it down to 0.000001s and I'll be impressed.

  19. Wells Fargo peeps don't need software to do this. by TigerPlish · · Score: 1

    Wells Fargo customer service reps lately just assume everyone is pissed at them these past few weeks, espcially yesterday and today.

    [stewie] Where's my money?! *WHAM!* Where's my money?!" [/stewie]

    --
    The "Civilized World" jumped the shark ca. 1973.
  20. German by cascadingstylesheet · · Score: 1

    can classify anger from audio data in as little as 1.2 seconds regardless of the speaker's language

    It was just a coincidence that the German speakers were angry 100% of the time ...

  21. What's its false positive record like? by mark-t · · Score: 1

    Can it tell the difference between a raised voice because of excitement or strong feelings about a matter and a voice raised in actual anger?

    1. Re:What's its false positive record like? by Anonymous Coward · · Score: 0

      Can it tell the difference between a raised voice because of excitement or strong feelings about a matter and a voice raised in actual anger?

      The amount of time until it decides you are angry until it really makes you so will likely be small enough that its record will be pretty good.

  22. No, it's recognizing arousal by maiden_taiwan · · Score: 1

    Not possible. If someone screams "Fuuuuuuuck!!!" at the top of their lungs, there is no way AI can distinguish whether it's anger, pain, frustration, surprise, or even joy, because the source signal may be identical for all of them. At best, this system is detecting high arousal and possibly unpleasant mood.

    1. Re:No, it's recognizing arousal by djinn6 · · Score: 1

      It'll probably also fail on people who are angry, but aren't shouting it. E.g. "I've said everything that can be said. You will refund me, or you will see your entrails hanging out of your body by tomorrow. Have a good day sir."

    2. Re:No, it's recognizing arousal by maiden_taiwan · · Score: 1
  23. too bad by zlives · · Score: 1

    too bad it can't smell the fart in its general direction,

  24. How could it tell? by hoggoth · · Score: 1

    "SUPPORT. HELP. HUMAN. OPERATOR. GET ME A FUCKING HUMAN BEING YOU GODDAMN PIECE OF SHIT! "

    processing... processing... processing... anger detected 37% probability

    (im not yelling slashdot im not yelling... ok i am but its on purpose let this post go through...)

    --
    - For the complete works of Shakespeare: cat /dev/random (may take some time)
  25. I'm sorry Dave by goombah99 · · Score: 1

    This conversation can serve no further useful purpose. goodbye.

    --
    Some drink at the fountain of knowledge. Others just gargle.
  26. AT&T Or Time Warner by sycodon · · Score: 1

    That dumb butch they have doesn't understand shit.

    Then they go all fucking stupid pretending computers make some kind of bepop noise when thinking.

    Stupid fuckers.

    --
    When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
  27. Mod this to the moon by Anonymous Coward · · Score: 0

    To the crescent moon, Alice.

  28. scripted by Anonymous Coward · · Score: 0

    Now everyone knows you're an NPC.