Slashdot Mirror


Microsoft's Acoustic Caller ID Patent

theodp writes "A new patent granted to Microsoft Tuesday for automatic identification of telephone callers based on voice characteristics covers constructing acoustic models for telephone callers by identifying words or subject matter commonly used by callers and capturing the acoustic properties of any utterance. Not only that, it's done 'without alerting the caller during the call that the caller is being identified,' boasts Microsoft in the patent claims."

34 of 185 comments (clear)

  1. Only Innovation: Real Time versus Offline? by Anonymous Coward · · Score: 3, Interesting

    The only difference here (aside from what agencies have been doing since the 1960's) is that this analysis seems to be done in real time, rather than offline? I mean, haven't monitoring people been able to tell who is speaking based on sound synthesis since forever?

    1. Re:Only Innovation: Real Time versus Offline? by dreamchaser · · Score: 2, Funny

      I dunno how useful this is. I usually just recognize the voice myself. Our wetware has some wonderful capabilities.

    2. Re:Only Innovation: Real Time versus Offline? by LiquidCoooled · · Score: 3, Insightful

      It might not be useful in a home environment, but how about in an office where after the initial greeting the customer details are popped on-screen without you typing anything?

      --
      liqbase :: faster than paper
    3. Re:Only Innovation: Real Time versus Offline? by Ctrl-Z · · Score: 2, Insightful

      Isn't that why they ask for my account number?

      --
      www.timcoleman.com is a total waste of your time. Never go there.
    4. Re:Only Innovation: Real Time versus Offline? by Nasarius · · Score: 2, Insightful

      I have. As I remember, it's one of the least painful parts of working tech support.

      --
      LOAD "SIG",8,1
    5. Re:Only Innovation: Real Time versus Offline? by omeomi · · Score: 3, Funny

      Isn't that why they ask for my account number?

      Good Lord, no. They ask for your account number just to irritate you because both you and the person you're talking to know damn well you had to key in your account number just 2 minutes ago.

    6. Re:Only Innovation: Real Time versus Offline? by Yoozer · · Score: 4, Insightful

      Me: "Thank you for calling, may I have your store's telephone number, area code first please?"

      You're the kind of rude, arrogant bastard that loses companies customers.
      Yeah, but only if he adds the insulting part to the call or changes tone with "your phone number?" (but that could indeed benefit from a "please").

      Part of the problem is that when you call any company, you generally get a barrage of useless information such as "This is Quux speaking, Foo and Bar suppliers, we now have a wonderful FooBar for only $39.95, how can I help you?" - which is swiftly (and rightfully) ignored because it's just noise; it does not solve the customer's problem. Make enough calls like that and they become a protocol.

      By trying to cut short and asking the phone number and area code, the call is made longer; for the customer it belongs to the last part of the first sentence and is therefore ignored, hence people will ask for it again. People generally want to identify themselves first after hearing contact has succeeded. To put it in computer terms; the AC is trying to cram the payload in the header of the packet and is then pissed off that the protocol doesn't support it.

      More effective would probably be "Thank you for calling, I'm Anonymous Coward, how can I help you?" - the latter part of this sentence ("I'm Anonymous Coward, how can I help you") will be ignored, but at least that information is completely irrelevant to the rest of the call. Then ask for important information, since that's actual payload.

  2. Can they detect how pissed off i am? by grahamsz · · Score: 2, Interesting

    Anecdotally I feel like some companies answer the phone quicker if you talk to their automated system in an irate and condescending manner. Could just be me though :)

    1. Re:Can they detect how pissed off i am? by qbwiz · · Score: 3, Interesting
      --
      Ewige Blumenkraft.
  3. Why? by Aoreias · · Score: 4, Insightful

    What's the purpose of caller ID after I've picked up the phone? I'm not going to talk to some challenge response bot if I'm someone who needs to be IDd and screened anyway.

    --
    We've upped our standards. Up yours.
  4. NOW I get it... by Penguinisto · · Score: 4, Funny
    ...they're looking to patent-troll the CIA!

    Brilliant!

    /P

    --
    Quo usque tandem abutere, Nimbus, patientia nostra?
  5. How is this different from existing voice recog? by Ngarrang · · Score: 2, Insightful

    I read the patent, but I guess I don't get it. How is what Microsoft claiming to do different from existing voice recognition systems?

    You have to train current voice systems so they recognize your voice pattern (or, acoustic ID) and translate it to text or action. Take that and add a system that keeps profiles for a more advanced version of caller ID. It seems like a natural evolution of the technology.

    --
    Bearded Dragon
  6. Verification of identity by Nymz · · Score: 4, Interesting

    What's the purpose of caller ID after I've picked up the phone?

    If someone had acquired some of your personal information, and then tried to impersonate you, an automated voice recognition system could be useful by raising an alarm, or at least giving a percentage of how much their voice is like yours.
  7. Err by OverlordQ · · Score: 3, Insightful

    Wont this most likely violate wiretapping laws in two-party states?

    --
    Your hair look like poop, Bob! - Wanker.
  8. For real? by Anonymous Coward · · Score: 2, Funny

    "Developers, Developers, Developers. I love this company, yeah" ** Sounds of flying chairs **

    Welcome to Microsoft patented caller Identity v1.0 beta
    Caller Identified: It's Steve... again
  9. Worse than IP broadcast TO THE WORLD! by Anonymous Coward · · Score: 2, Funny

    I had no idea someone I might call might be able to indentify me.

  10. Patenting intelligence by Cafe+Alpha · · Score: 2, Interesting

    The sort of processing this patent covers is something that hasn't been possible until recently, but I think, in principle, is something absolutely necessary for robust AI, and that is doing recognition simultaneously on both low level features and high level features of data and on intersections of the two.

    By "high level" I mean things like word choice, language etc. By low level I imagine they mean things like the specific resonance characteristics of a voice. In voice there are intermediate levels of features too, such a the characteristics of phonemes.

    The upshot of this is that just as algorithms and hardware begins to reach a level of power necessary to show intelligence, it will be impossible to do so without stepping on patents.

    We will have patents on a machine not being stupid.

  11. Comment removed by account_deleted · · Score: 2, Interesting

    Comment removed based on user account deletion

  12. Maybe... by theantipop · · Score: 2

    /. should just put an RSS feed to newly issued patents on the front page. Would cut down on the number of stories per day though.

  13. Wiretapping law by w9ofa · · Score: 3, Insightful

    It is my understanding that recording a telephone conversation is against the law in most states, without notifying the other parties on the line.

    Thus, a practical device for this patent would most likely be illegal.

    1. Re:Wiretapping law by VisceralLogic · · Score: 2, Informative

      If it processes it in real time, it doesn't need to record it, really. Just pass through in and out.

      --
      Stop! Dremel time!
  14. they think they can ... by twitter · · Score: 2, Funny

    ... but it works as well as their speech recognition. Between this, face recognition and kill drones OBL will be found and exterminated early and often. I hope it's not me next.

    The programmers put in an Easter egg, just for you. Whenever Twitter says "shit" into his cell phone, the official Microsoft transcript has "M$".

    --

    Friends don't help friends install M$ junk.

  15. Thus proving the rule by killmenow · · Score: 4, Funny

    To patent anything, follow these steps:

    1. Choose something already being done in the real world, anything really
    2. describe it with maximum verbosity
    3. add "on the Internet" at the end

    Tada! PATENT!

  16. Prior Art - Invented in 1968 by sk999 · · Score: 2, Insightful

    Inventors: Arthur C. Clarke and Stanley Kubrick

    First publication: 2001 A Space Odyssey (Released 1968). Heywood Floyd checks in to the space station:

    Female voice: "Thank you. You are cleared through Voiceprint Identification."

    http://www.imdb.com/title/tt0062622/quotes

  17. Actually, NSA will have prior Art by WindBourne · · Score: 2, Funny

    NSA has had real-time voice ID since before '96 and possibly longer. How MS got this patent is beyond me. Our system is soooooooo broken

    --
    I prefer the "u" in honour as it seems to be missing these days.
  18. So ... by Shadowlore · · Score: 2, Interesting

    According to this:
    Not only that, it's done 'without alerting the caller during the call that the caller is being identified,'

    They are describing a means to RECORD callers without their knowledge, and hence without their consent. So would this software be illegal in some jurisdictions? You bet yer ass it would be.

    Wonder how it handles people who say "uhm" or "uhh" a lot. ;)

    --
    My Suburban burns less gasoline than your Prius.
  19. Sneakers by Loconut1389 · · Score: 3, Funny

    My name is Werner Brandes, my voice is my passport. Verify me.

  20. Re:How is this different from existing voice recog by Alien+Being · · Score: 2, Informative

    "How is what Microsoft claiming to do different from existing voice recognition systems?"

    Existing voice recognition systems might be more acurately called speech recognition. They don't recognize the voice (who is speaking); they recognize the speech (what is being said). They can be categorized as speaker dependent or speaker independent.

    Speaker dependent speech recognition (type 1) requires complex training by each user. It needs to know all the ways a person pronounces every possible phoneme. During use, it must be given the name of the speaker and a sound sample. It gives back the name of the phoneme. 2 inputs, 1 output.

    Speaker independent speech recognition (type 2) is able to identify individual phonemes as spoken by a wide variety of speakers. 1 input, 1 output. That's what I would imagine is the important first step of what MS is claiming to do. Once a phoneme or two has been identified, the name of the phoneme and the captured sound sample can be fed to the type 1 algorithm and it would be able to output the name of the speaker.

    Functionally it's different than existing "voice recognition" systems, but I seriously doubt it worthy of a patent.

  21. Without the caller's knowlege? OMG! by TheTranceFan · · Score: 2, Funny

    ...it's done 'without alerting the caller during the call that the caller is being identified.

    ...Sometimes...when the phone rings...
    ...I answer it...and just listen...
    ...I hear the caller's voice and identify them by their voice...
    ...Then hang up without saying anything.

    How insidious!
    What. Is. The. Difference.

    1. Re:Without the caller's knowlege? OMG! by complete+loony · · Score: 2, Funny
      Them: "Is [insert partners name here] home?"
      Me: "Oh, hi [insert partners' friends name]. I'll go get her."
      Them: "How'd you know it was me?"

      Sheesh do anything with computers or on the internet and you can patent it.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
  22. Re:"Can you guarantee my safety?" by FSWKU · · Score: 2, Funny

    "The hot chick with the Uzi... can I get her phone number?"

    --
    "So after all this, you make my case for me. To end this stalemate, you must die..."
  23. Sounds cool, but... by fyrewulff · · Score: 2, Insightful

    I have Caller ID so I know who's calling BEFORE I pick up the phone, not afterwards.

    --
    "We need to get over this notion, that, for Apple to win... Microsoft must lose." - Steve Jobs, 1997
  24. 4th Amendment? by ivanmarsh · · Score: 2, Insightful

    Should I even ask? Does the 4th Amendment mean anything anymore?

    Cops bust a guy for video taping them and charge him with wiretapping and Microsoft is going to be recording my voice and compiling a profile of me and that's okay?

    Words I'm guessing it will be looking for by default: bomb, liberal, weed, nuke, bush, 1st Amendment.

    My tinfoil hat is starting to look stylish.

  25. Haven't I done this before? by gringer · · Score: 2, Funny

    A method and apparatus are provided for identifying a caller of a call from the caller to a recipient. A voice input is received from the caller, and characteristics of the voice input are applied to a plurality of acoustic models, which include a generic acoustic model and acoustic models of any previously identified callers, to obtain a plurality of respective acoustic scores. The caller is identified as one of the previously identified callers or as a new caller based on the plurality of acoustic scores. If the caller is identified as a new caller, a new acoustic model is generated for the new caller, which is specific to the new caller.
    Hrm, sounds familiar for some reason. Oh, wait... there's a phone call. I'll just go and pick it up.

    me: hello?
    caller: Hello, I'm Suzi Cheatem from Dewey, Cheatem, and Howe. I was wondering if you'd like to answer a few questions about your behaviour while using the Internet.
    I think hrm, this sounds like one of those annoying telemarketers
    me: Sorry, I'm not interested in speaking to telemarketers
    caller: It seems like you have identified me from a previously identified acoustic model. I'm afraid I'm going to have to tell Microsoft that you have stolen their idea. You can expect a bill from them within two weeks.
    <hangs up>

    Gosh, those telemarketers get stranger every time they call me.
    --
    Ask me about repetitive DNA