Microsoft's Acoustic Caller ID Patent

← Back to Stories (view on slashdot.org)

Microsoft's Acoustic Caller ID Patent

Posted by samzenpus on Wednesday June 13, 2007 @12:46PM from the who-do-you-sound-like-today dept.

theodp writes "A new patent granted to Microsoft Tuesday for automatic identification of telephone callers based on voice characteristics covers constructing acoustic models for telephone callers by identifying words or subject matter commonly used by callers and capturing the acoustic properties of any utterance. Not only that, it's done 'without alerting the caller during the call that the caller is being identified,' boasts Microsoft in the patent claims."

8 of 185 comments (clear)

Min score:

Reason:

Sort:

Only Innovation: Real Time versus Offline? by Anonymous Coward · 2007-06-13 12:49 · Score: 3, Interesting

The only difference here (aside from what agencies have been doing since the 1960's) is that this analysis seems to be done in real time, rather than offline? I mean, haven't monitoring people been able to tell who is speaking based on sound synthesis since forever?
Can they detect how pissed off i am? by grahamsz · 2007-06-13 12:49 · Score: 2, Interesting

Anecdotally I feel like some companies answer the phone quicker if you talk to their automated system in an irate and condescending manner. Could just be me though :)
1. Re:Can they detect how pissed off i am? by qbwiz · 2007-06-13 13:00 · Score: 3, Interesting
  
  It could be true, too.
  
  --
  Ewige Blumenkraft.
Verification of identity by Nymz · 2007-06-13 12:58 · Score: 4, Interesting

What's the purpose of caller ID after I've picked up the phone?

If someone had acquired some of your personal information, and then tried to impersonate you, an automated voice recognition system could be useful by raising an alarm, or at least giving a percentage of how much their voice is like yours.
Patenting intelligence by Cafe+Alpha · 2007-06-13 13:07 · Score: 2, Interesting

The sort of processing this patent covers is something that hasn't been possible until recently, but I think, in principle, is something absolutely necessary for robust AI, and that is doing recognition simultaneously on both low level features and high level features of data and on intersections of the two.

By "high level" I mean things like word choice, language etc. By low level I imagine they mean things like the specific resonance characteristics of a voice. In voice there are intermediate levels of features too, such a the characteristics of phonemes.

The upshot of this is that just as algorithms and hardware begins to reach a level of power necessary to show intelligence, it will be impossible to do so without stepping on patents.

We will have patents on a machine not being stupid.
Comment removed by account_deleted · 2007-06-13 13:07 · Score: 2, Interesting

Comment removed based on user account deletion
'without alerting the caller....' by Anonymous Coward · 2007-06-13 13:36 · Score: 1, Interesting

The keywords being:

'without alerting the caller during the call that the caller is being identified'

Don't we have laws against doing stuff with voices without informing people first? And since when is sampling audio, and then converting part or all of the audio to a format based on, and unique to the original, not an act of recording?
So ... by Shadowlore · 2007-06-13 14:42 · Score: 2, Interesting

According to this:
Not only that, it's done 'without alerting the caller during the call that the caller is being identified,'

They are describing a means to RECORD callers without their knowledge, and hence without their consent. So would this software be illegal in some jurisdictions? You bet yer ass it would be.

Wonder how it handles people who say "uhm" or "uhh" a lot. ;)

--
My Suburban burns less gasoline than your Prius.