Talking To Computers?
merlock18 writes "Is it un-natural to talk to a computer? After discussing the outcome of the Jeopardy game with some colleagues, they seem to think it is mildly 'scary' to talk to a computer and have it competently talk back. Is this what everyone thinks? I was thinking to myself how much I would like to be able to even tell my computer to open programs by telling it vocally. A simple idea that I am fairly surprised is not common. Am I a minority in this one? Do people just not like the idea of talking (without cursing) to a computer, let alone have it act or reply? Would anyone else be interested in building their own mini-Watson, or is this just scary?"
I can't speak for anybody else, but a lot of the time I don't *want* people to overhear what I'm asking my computer to do...
Open the pod bay doors, Hal.
Whatever you say Hal. On a Clive Cussler note, Hiram Jaeger.... eat your heart out!
Not only privacy but the standard office would sound like a bar of a busy Friday night. Can you imagine loud howard dictating a document just over the cubicle wall?
when swearing at them improves their performance.
All your database are belong to U.S.
Just give it 30 years. Once it becomes publicly available, it only takes one generation for society to get used to new tech.
Personally, I find it impressive but annoying. I'm already driven nuts by people talking on cell phones all day, and I don't want to hear and endless stream of command instructions, either.
LCARS. Yes, talking to a computer would be weird, but could also be awesome!
I wonder if there has been any research on the uncanny valley for speech...
I have no problems talking to computers, I do it all the time when trying to fix 'em. The day they can understand what I'm saying *and reply in kind* is the day I find a new profession. Veterinarian perhaps...
Every mac OS since 10.0 has had speech recognition - I had some fun with it when it came out, but lost interest after a while. My disenchantment may have had something to do with having to vocalise (for all to hear) every command I made - and can you imagine the yammer of a roomful of computer operators? I'm looking forward to thought-recognition software.
No, no sig. Really.
ThePromenader
There is no new revelation about talking to machines, it has been going on since man looked outside of the sacred relationship for some semblance of reasoning about his existence and meaning in life. The Jeopardy 'thing' with IBM was just another step in the direction you can't deny, resistance is futile.
Somebody "texted" the Jeopardy "answers" to Watson. Watson's voice synthesis was very high quality, but it did NOT use speech recognition to understand the "answers." That requirement would have resulted in an entirely different outcome.
So, while Watson's ability to play the game at all was a great feat of software engineering, it wasn't quite a level playing field. It will probably be a while before we can really converse with computers.
The first thing I do when a phone operator robot asks me to say "English" for English or "Espanol" for Espanol, I push all the buttons to see if I can get to a number-based menu, or at least hurt the robot's ears. Saying "English" and waiting for it to confirm that I said English is not faster or more convenient than hitting 1. It's not scary, but it's a computer, and I'm not going to pretend it's not.
Saying, "Open a command prompt," is in no way more convenient, faster, or easier than slamming the mouse to the lower left, clicking, and typing cmd.exe. Having it say, "OK, here's a command prompt," afterward would just be annoying.
Maybe I'm just not picturing the right use case.
IBM's VoiceType Dictation was doing this back in 1998 or so.
People talk to their pets all the time, and although most pets have just as much of a chance of understanding what's being said as most computers, that doesn't strike people as odd.
If the dialogue flows like Lt. LaForge talking to the Enterprise, I'm all for it - but that would be for managing a complex system where it would be impossibly slow to manage anything from an interface like a keyboard or panel.
This gets very close to Kurzweil's idea, however, that someday a machine will be able to convince a person that it is thinking, even if it really isn't. Very quickly after this we drop off the Singularity precipice.
1) How will you play music in a machine, which is responding to voice?. If a song has a word "SHUTDOWN", the computer's microphone will respond to that and might power off the system :P
2) If anyone else is in the room where the machine is powered on, you need to think twice before talking with the other person. The microphone will interfere.
It seems to me that voice recognition is not the most efficient way to interact with a computer, especially when the user interface is well designed. For complicated tasks, and for interacting with computers where you may not have a normal desk or terminal, perhaps. As far as voice-to-text, if the recognition is accurate, it can possibly increase productivity depending on the person and their typing skills. On another not, however, this is a way for paralyzed individuals to interact with computers without the use of traditional means. However, using voice interaction in tandem with other means could be a more efficient route. Having a computer run commands in the background via voice commands while you interact with it in more traditional ways in the foreground.
Do people just not like the idea of talking (without cursing) to a computer,
Why, would it be politically incorrect?
Questions raise, answers kill. Raise questions to stay alive.
Speech recognition sucks and always will. I can hotkey my way to any program faster than I can say the name of it. Simply double clicking and icon is super easy. Why do I want to have to say "Computer! Open! Porn!!!" when I have a shortcut to all my porn on my desktop? it doesn't even make sense. And entering urls? It would take 10min just to get the url at the top of this article in.
On a related note: I fucking hate teamspeak. If I wanted to talk to you retarded assholes I'd call one of those party lines. Fuck that, I want to play a video game. I don't want to talk to people. For whatever rudimentary communication I need I can type.
It's only a matter of time before some corporation mines data from a nation full of ever-listening computers.
Grew up on startrek, so of course computers that talk back are pretty much expected eventually for me
Someday speech will be an important input method. But not any time soon.
If you have to wear a microphone it isn't ready yet.
If you have to use a PTT switch it isn't ready yet.
If you have to repeat or cancel more than 1% of the things you say it isn't ready yet.
If you have to spend as much time proofreading dictation it has taken down and correcting the mistakes, it isn't ready yet.
If you have to speak in an unnatural way it isn't ready yet.
If it won't work in almost any environment it isn't ready yet.
Democrat delenda est
I already talk to my computer and it types what I say on the screen. Most of the time it hears and types very well. But I really want it to Read-My-Mind and know what I want to do next. Show me what I am looking for, help me find what I cannot remember, and give me suggestions. I would love a computer who laughed at my strange sense of humor and just agree with me most of the time. A really GREAT computer would be better than your best friend and always doing all it could to make your life more enjoyable.
Computer says 'no'.
The people at Nuance (used to be ScanSoft/SpeechWorks/etc.) have been doing it for years. I'm sure most people on Slashdot have heard of Dragon NaturallySpeaking. The newest version understands more commands than you probably care to use --stuff like "Open Firefox" ... "Send an email to Mike" ... you can browse the web by voice and control the mouse. Some people swear by it, but speech still doesn't seem like a natural computer interface. At least not yet. Nevertheless, the software is out there.
User: Open File.
Computer: Bite my shiny metal ass!
User: Delete file
Computer: Kill all Humans!
User: Eject disk.
Computer: Please insert Liquor!
One of the reasons I use a computer is because I don't have to talk to it in order to tell it anything.
People talk too much. If you want to talk to your computer, fine. But I don't want to hear it.
I have a PC running Windows; cursing is unavoidable.
[I spend the rest of my time talking with my stuffed animals.]
Would a Mini-Watson be small and wear a monocle?
It must have been something you assimilated. . . .
"I was thinking to myself how much I would like to be able to even tell my computer to open programs by telling it vocally."
Well damn. What a shame you weren't using KDE a few years back. Kvoicecontrol worked quite well for me back then.
The one about talking to yourself. You don't have to worry about the guy who talks to himself. Worry about the guy who is arguing with himself, especially if he's losing. Same hold for talking to a computer.
Did you know 80 to 90% of the moderators on slashdot wouldn't recognize a troll even if one dragged them under a bridge.
If it can talk, and you smack it, what happens?
Nowadays schizophrenic people talking to themselves while walking or being on a train are everywhere thanks to bluetooth technology. Where is the funding for mental health clinics ?
If it's every truly going to gain acceptance on the desktop, it needs to just work for mobile. Once people get used to asking their phone for directions, or the weather, or whatever, that'll open the door to asking your desktop computer for things. And, yes, your phone can do some of this today, but it's still not quite natural speech.
Watson was not able to understand any of the talking that was going on including the questions.
He repeated another contestants answer.
gawk; grep; unzip; touch; strip; init,
uncompress, gasp; finger; find,
route, whereis, which, mount; fsck; nice,
more; yes; gasp; umount; head, halt,
renice, restore, touch, whereis, which,
route, mount,
more, yes, gasp, umount, expand, ping,
make clean; sleep
Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
Only if the voice input was limited enough, like text input, which we have had for 10+ years, really. I can remember running a Speech to Text program on my 486 that would try to read what I was saying and turn that into a text document.
Google Voice does the same thing for voice mail.
Both suffer from a huge problem: accuracy. A conversation with a human doesn't need every word to be perfectly accurate, but something more than a text message is going to fail if some part of the vocal command is incorrect.
"Did you mean 'Delete *.txt' or 'Delete startdottxt' or 'Delete start.txt'?"
If I have nothing to hide, don't search me
(a) Accuracy, (b) Efficiency, (c) Privacy, (d) Noise pollution.
We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
It's creepy if the computer is trying to pass itself off as a person, because fakeness in social interactions is creepy whether it's a Wallmart greeter or a computer program being fake. If the computer is plainly just presenting itself as a voice interface it won't feel creepy for very long if at all.
Much like the animation of human features, there's an uncanny valley in communication that can provoke a strong xenophobic response. If someone or something can respond to *some* conversation but not all conversation, it tickles something deep in my brain that produces an instinctive reflex of distrust and hostility. Watching Watson, I found that the way the interaction was framed as if it was natural conversation put me into this uncomfortable zone where I found myself thinking "KILL IT WITH FIRE" more than once.
Watson received the other two contestants' incorrect responses via speech recognition and used them to narrow down the correct response.
if I was the only person paying attention at the start of those Jeopardy episodes. They stated clearly that the questions were fed to "Watson" as a plain text file. There was no speech recognition involved at all.
CB radios have long solved this: push to talk. Squeeze the mic when you want to give a voice command.
I think people find it "creepy" because they've never done it. If it was implemented well on most computers, people would get so accustomed and welcome to it that it would be a huge step back for them to go back to manual input.
Don't trust a bull's horn, a doberman's tooth, a runaway horse or me.
I, of course, am now officially older than dirt. A couple of years ago, when I finally got my iPhone, I got the Google search app of course. I used it, it worked, I liked it. When I put the damned phone down, I thought, "If somebody had handed me this when I was fourteen I would have thought it was a phony Hollywood prop." That was when I decided that computers should only be addressed by means of picking up the mouse, pressing one of its buttons, and speaking clearly and distinctly into it in a fake Scottish accent.
"Here's what's happening. You're starting to drive like your Dad..." - Red Green
I thought the movie AI actually was pretty good at wondering about this very thought. If you haven't seen the movie, I thought it was very thought provoking on the idea about what the world might be like if computers ever became super advanced.
There are voice actions already, but it takes so much more computational power to make it really-really fast, recognize any accent (which google is having a very hard time with right now) take context into account, and be able to intelligently ask user for clarification. So, I guess in about 5-10 years it will get to the point of Star Trek, where you can address computer and not worry about speech pattern or performing deletion of all files when you said "delay all files".
Generally voice interface is more efficient when we can't type/select something or when a short-hand is too difficult to program.
Examples -- car systems and hands-free phones ("Call John Smith at work"/"Navigate to 12th street" - until autopilot works flawlessly typing that stuff is difficult), directories, where saying a name would be faster than flipping though a large list without knowing how exactly it's spelled.
If you can type, it's faster to press control-o and click enter than say "Computer: open last used file" and get a confirmation. But if you don't know shortcuts and having difficulty with movement, speech recognition is already your only option.
For typing I'd rather prefer a neural short-hand: you don't bother family (or are forced to lock yourself in a soundproof room to dictate a large amount of text) and you eliminate the slowest part of your "think -> type -> text" route.
Hyperom.com
the phone based systems that use this suck and some times you need to go nuts on it just to get a real person.
Quantizing instructions into unambiguous, 100% clear communication is the pride and joy of the push-button. Vocal communication, especially natural language communication is fraught with ambiguity.
Think about it this way. Listen to your own conversations with humans for the next week. Count how many times you or your listener asks "say that again", "can you repeat that", "what do you mean by that". Then watch how many keystrokes you miss in the same week.
I'm cool with a button that launches a nuclear warhead. I'm really cool with a button behind a cover. And I'm super cool with a button behind glass and a hammer to break the glass. I'm frightenned if the button is voice-activated -- even if the cover and glass and hammer are voice activated too.
But think about the benefits of going the other way. If you could push a button to tell someone to do something. Wow. They'd call it text messaging.
Incidentally, welcome to written instructions -- which went typed for improved clarity and legibility. There's a reason that contracts are composed in text, and not in speech. Could you imagine a voice recording of a contract?
But there are plenty of voice-activated solutions to computer interaction. Dragon Naturally Speaking is one of the most reknowned. I used it in 1996, and it's become way better since. Use it for a week, and you'll discover that your own voice ins't anywhere near as efficient as ten fingers. And you'll find that you don't speak clearly at all -- and that your friends and family just guess what you're talking about, because it never really mattered before.
Think of all of the food you've ordered at restaurants, and the number of times the server mis-heard you, and brought the wrong something. Now remember that using a computer is about giving thousands of discrete commands in a given hour. Take the same percentage, and tell me how many times your computer is going to mis-hear you. That's how annoying it'll be.
Oh yeah. Background music makes that even worse.
Think about driving with voice commands -- turn how much left how quickly?
But you can easily simulate this. Get a friend. Have them use the computer, and you tell them what you want -- in commands that would be unambiguous if they heard you correctly. Try it for a full work day. See if they open the correct applications, hit the roght commands, and do the right stuff. You'll find that vocal communication is about conveying abstract thought and complex concepts, not directions nor instructions.
Which is why technical writing like operation manuals and really good recipe books are a very different kind of reading. And then you'll see the ikea picture/glyph instructions, which make even less sense.
Voice recognition has taken the wrong approach from the beginning. The computer listens all the time, then tries to decipher a command in the middle of all kinds of stuff. That's unrealistic. If you gave it a name, an unusual name phonetically (Like "Esmerelda") and only had it follow a command after it heard it's name, then turn off again, then I think we could have working voice recognition RIGHT NOW. "Esmerelda, check my email" "You have 7 new messages". "Esmerelda, play music" Then poof Audacious opens up and plays some mp3s.
Can somebody build this app and put it in the Linux Mint repos please? Thanks in advance.
I hold very few opinions. I hold information based on observation and fact. If you wish to disagree, please use facts.
Back in the days (must be at least 10 years ago by now) I used the dragon engine from philips to control my pc. To be honest, it had a cool factor but got old soon. Not only was it quite inaccurate at the time, but I found it to be slow in comparison with just mouseclicks. Besides, at night when the family sleeps I don't want to be making to much sounds.
is it possible to talk with an computer. I could not recognized
tea, earl grey, hot :D
There have been good voice-based systems. Wildfire (audio demo) was a really nice voice controlled phone system. The original version, from the late 1990s, used a lot of CPU time and Wildfire accounts were very expensive, about $5 to $10 a day. As CPU power got cheaper, the technology became more available, and Orange offered it on mobiles for a while. Then Microsoft bought the technology, Microsoft never did much with it, although part of it ended up in OnStar Virtual Advisor.
Then do so; Windows Vista/7 has out-of-the-box speed recognition with the ability to launch programs once it is sufficiently trained to your voice.
OS/2 is dead. I thought everyone knew that.
Scary is certainly not how I would describe it. What is so "scary" about it? Anyone who is trembling in fear about speaking to a machine which simply is parsing/processing what is said (and possibly providing feedback in the same manner) needs a reality check.
If it's more efficient and conductive to the environment, go for it. If not then don't.
Opera already provides a speech interface. Mobile phones routinely provide voice dialling and similar functionality. This technology is already here. Neither is it reliable enough nor is it more efficient than just using a mouse/keyboard. I expect that the main driver behind this is accessibility rather than UX. As for the computer talking back, it's a lot easier to get the computer to say something rather than get it to process human speech input.
I had a similar discussion recently - at the bar, of course. The general consensus was that this sort of technology would find itself a nice niche in home automation. Being able to host a party and say "House, Dave here thinks (insert socio-political concept here) means X. Is that true?" and have a disembodied computer voice say "No. According to available sources it means Y," in addition to "House, play Charlie Parker half-volume and dim the lights" would be great.
Eh, you do realize it wasn't actually listening right? The questions were entered through old-fashioned typed text (off screen obviously), not much different from how you would type something on Wolfram Alpha or even Google. The point of the demo was to show of it's capability to search and analyze a huge data set, not it's voice recognition or processing.
Also keep in mind that the eventual purpose of this beast is to diagnose patients. The show is an important part in making the general public more comfortable with machines instead of doctors telling them what's wrong and what to do about it.
The problem is not technology. The problem is the relationship we wish to have with our technology.
If I want to deal with a nail, I have two options: use a hammer and do it myself, or I could hire someone else to deal with it. Both methods entail a certain amount of power. In the first case, I have the power because it is my own skill and my decisions, my own abilities which complete the action. In the second case, it is my power over others that compels the action to be completed.
Computers can be our tools, amplifying our actions and decisions, allowing us to do things that we could not do unaided. Or computers can be our servants and assistants, performing actions on our behalf, achieving the goals we set out, but following machine decisions and using hidden processes to achieve those goals. These are two very different types of relationships. Physical manipulation emphasizes the computer as a tool. Voice interaction emphasizes the computer as a servant. Culture and aesthetics will determine whether we want to talk to our computers—not technology.
I don't know about you, but I wouldn't want to talk a computer. First off it would know everything, and if it doesn't know the answer it there's the internet at it's ports to find the information. so what would be the point in communicating with one other than to have it do something you can't? I wouldn't find it weird that a computer could one day be teaching my kids in school but to realistically believe that you could have a conversation with one and it would respond in a non robotic response is laughable, in this day in age and I'm sure most people would be afraid of it. Hollywood doesn't paint a pretty picture for computers that can talk back And for most of the people I talk to on the phones who seem to have never grasped that computer are going to be around and you need to know how to use one or your going to get left behind I'm sure would be afraid of anything they can't comprehend and would love to just blow them up and go back to the stone age.
This is a Mac, what you have there is an embarrassment to your fellow computer users.
You can tell a Mac to open programs and many other things by voice for well over 10 years now, and almost nobody uses it. You can tell an iPhone to make calls and many other things by voice for about 2 years now, and people use it only when forced, like when driving.
I think one reason this kind of thing is unpopular is it makes us realize how stupid computers are. Not even as smart as a small child. You have to construct your sentences even more carefully.
But I think the main reason it's unpopular is the computer is a brain enhancement prosthetic for our own brain, not another person's brain. It is a part of us. We don't want to hear the other half of our brain or our brain enhancement prosthetic talk to us.
However, I think we are willing to anthropomorphize some programs. For example, people do that with Google. That is a separate brain you call out to and ask a question.
on my android devices already. hold the search key say "not to self" and an email box pops-up i say my message and it email's me, say "send sms to mom, message" and it sends "message" to mom i would love to have similar functionality on my pc. i can see it now 15-20 years "PC load half-life episode 3" "error not released yet"
On the contrary, I think it's the most natural way to interact with a computer. Back when science fiction authors first started trying to imagine what computers of the future would be like, they almost always imagined speech to be the interface. It was only experience, with the primitive 20th-century interfaces that we had, that habituated people to thinking that keyboards and mice were the way to interact with a computer. The command line and GUI are un-natural ways to communicate, which we've gotten used to; speech is a natural way to communicate, which (in the context of human-machine communication) we've gotten un-used to.
(Note: I actually find CLI and GUI to offer many advantages over speech interfaces and don't expect them to ever go away now that we have them...however, if speech processing had been practical as the next step after punch cards, I don't believe CLI or GUI would ever have been developed, at least not as primary interfaces with the amount of depth they have today)
"I don't care about the Constitution!" --Bill O'Reilly, November 17, 2009
Consider that in everyday human interactions, very little is actually spoken which depends on total accuracy. A great deal of human speech is intended to convey a point of view, rather than dictate a command. How often do we state, "I'm hungry. Where do you want to go for lunch?" Does this sentence have any meaning for a computer? Even when we limit our computer interactions to data entry, the human voice is still non-ideal. Consider how tedious and error-prone a conversation between actual humans can be while attempting to give directions to your home. We often repeat and confirm information over the telephone, as well. Add to that, the fact that languages are not constant and few, if any, humans speak precisely and use words accurately based on their accepted definitions.
Today, most computers present information visually, so its not a leap to expect the input to happen visually as well. GUIs present lots of information simultaneously, and we select from the visible targets with a simple click. Visual display seems to demand visual input - not verbal input. Visual interfaces are also highly stateful. We move things around, toggle things, check and uncheck things, etc. We don't have to remember this myriad of states because we can use our eyes to keep ourselves apprised of the computer's state. A conversational interface, on the other hand, would require the user to keep track of states without a visual aide. This may be too much to ask. The applications should therefore be stateless - single query, single response. (I'm skipping over certain hybrid applications, such as navigation systems, which could accept verbal command and respond visually)
In other words, we need to limit vocal interaction to applications which don't require a lot of output, as opposed to applications which depend on a screen full of information. Applications that eliminate the visual display are, by nature, simple UIs because there is no medium to present complex data. This suggests that verbal interfaces are well suited to tasks that can be completed in a short amount of time. Ex: "Computer! Where is the nearest hospital?" "The nearest hospital is 2 blocks ahead on the left-hand side." Verbal interfaces are well suited to these types of information query interactions. For most other things, verbal interfaces fall short.
I suspect that in the long run, history will repeat itself. The punch card gave way to the command line, which gave way to the GUI, which will give way to the talking computer. Each iteration made the previous less useful to the general computer user. I'm talking about consumers here, not computer enthusiasts. Enthusiasts still use the command line, but pair it with the mouse for certain special tasks, such as selecting blocks of text. GUI enthusiasts will find uses for voice input which augment the clicking experience. The rest of the population, who may only use computers for narrow purposes: to social network, watch videos, and search for answers to questions; will find it much easier to get away from their desks and use a talking interface for these tasks.
"Oh my God! Computer! Unfriend Becky!"
I'm sure people were scared to get into a horseless carriage at one point, since there was no horse. Frickin' horseless carriages how do they work?
Anyways now people are scared to get on horses. I assume the day will come when people are not only scared of using command line interfaces but also using a computer that only has kb and mouse inputs instead of audio or some other "new" input method.
Beware of those who profit off the docile and persecute the unbelievers.
My first experiences with speech recognition began back around 1990. A computer company I worked for was experimenting with voice control. After being misinterpreted repeatedly, the person trying to use it would inevitably adopt a tone that was either (A) pleading, or (B) infuriated. It was always entertaining to listen to, and my favourite joke of yelling "FORMAT C COLON! YES! YES!" over the cubicle wall never lost its hilarity for me.
Then there was the submarine simulation game I purchased, which taught me that, contrary to my expectations, short utterances were more frequently misinterpreted than longer phrases. I recall giving the succinct command to "DIVE", to which the game responded "We don't have a periscope sir".
I was impressed when dictation products like "Naturally Speaking" came along, but ultimately decided that even 98% accuracy wasn't good enough. Correcting two percent of things that are recognized incorrectly is tiresome, and negates any time saving you would have gained.
Today I use voice dialing on a number of so-called "smart phones", and the frustration is about the same as it was 20 years ago. I need to deliberately mispronounce my own surname when trying to call family members, and mimicking the phone's own mispronunciation of my name doesn't help.
I think were still a long ways away from having any meaningful (non-aggravating) chats with our computers
.
the IBM Watson did not hear the words and convert them to text, he (it) was sent text at the same tie the ??? was displayed on screen for ppl to see/read.. but ya if computers could get things right like Watson then it would be nice to have on speed dial to ask ?? to.. but lets be clear asking IT ??? is not talking, chating nor even close to having a conversation as that takes thought insight, opinion and a point of view.. something even Watson lacks.
Today: Opening nano by talking to your computer. Tomorrow: Hiding in the mountains from terminators.
The best way to predict the future is to invent it. -Alan Kay
It's not scary because it's not really possible. Even Watson had the questions input before hand - and if a whole show that is dedicated to the process can't even produce a computer that can be communicated to in natural English, at the cost of hundreds of thousands of dollars (millions?), there's no way the average consumer is going to have a system capable of understanding them in a natural environment.
We simply aren't there yet. Talking to a computer is at best a frustrating, inaccurate experience. You get better concept recognition out of a 2 year old than you do out of a computer as a general rule. Specific situations and specific topics are/can be well understood by computers, but beyond that, computers aren't even close. Heck, I don't even know of a computer that can parse written natural English with anything approaching a 7 or 8 year old's ability to understand what's being discussed. With the vagueries of speech added in, a computer has no chance currently.
So is it scary? No, merely frustrating or curious/interesting at best. Useful or functional? Only in limited situations. I will be happy to talk to my computer when it can understand what I say with the same confidence level as an average adult, but until then, it's easier/faster/more efficient to communicate with the computer in a method IT can understand.
A couple of years ago I thought about gluing a whole-house mic system to a text-to-speech engine, allowing me to do simple queries like 'define ', 'weather tomorrow', 'traffic on I-15'... Give the system a unique name ('squizzlesauce?') and make that the key for temporarily enabling speech recognition("squizzlesauce, what's the weather like tomorrow"). Whip up a few scripts to glue the voice-recognition engine to google, a TTS engine for parsing and speaking web results... Then I got a laptop with WiFi and lost the need.
http://www.masturbateforpeace.com/
I've wanted something like this for a long time. I want to be able to post questions to my computer, such as "when was such-and-such a film made?", or "when was such-and-such a person born?". Questions that you could fairly easily look up yourself, but if you're, say, in teh kitchen, and the computer is in the other room, you want to be able to throw the question out there and get a response. I tried to write something like this myself, but unless you specify a grammar (which is difficult if you want to include freeform words like people's names and the names of movies and books etc), then the speech recognition is simply terrible. I tried two libraries (or was it three?) and they all had similarly horrible results. With a pre-defined grammar they did great. Without one, the results were atrocious.
tl;dr: I want it, but I can't seem to get it to work.
Move sig!
I dictated some technical documentation using Vista Voice Recognition. It works OK if you are trying to tell a story like "The Three Little Pigs" but when you try to do something technical it will loose its mind. It was almost impossible to dictate in Word. I would be speaking and say a word that had a possible command meaning and off to the races we would go.
For example I would say something "be careful of the formatting here.." and the formatting function in Word would come up. If you ever need to use this function, try using Notepad rather than Word. Also when running, it seemed to spend so much time trying to figure out what command you wanted that it was slow in Word. God help you if you were writing a book to describe how to use Word.
They need a feature to turn the command interpretation off. Not sure if this is an issue on the Mac as I had not yet tried it on that machine.
Back when I was demoing Windows 7 on one of my machines, I tried to get it's built-in voice recognition working, which I DID, but it was butchered beyond usability. Ironically, the simpler low-tech voice recognition in my (at the time) Windows Mobile 6.5 phone worked beautifully! "Call mom" "Play the beatles", etc, it all worked w/o error.
anyone who thinks that this would be "creepy" clearly has never tried to use a voice recognition system. Voice recognition has been around for decades, and it's always been, not simply a pipe dream, but an actual con-job. For a while in the late eighties companies were coming out with voice recognition systems as general front ends to personal computers, almost always pitched as "executive" or "home" interfaces (because the computer can respond to speech, even an executive or housewife can use it!). The products looked great at demos, but were utterly useless in the real world.
The saving grace (such as it is) or voice recognition came when the systems were re-targeted at automated phone systems ("void response systems") where they appear to have some minimal functionality (though, maybe that functionality is to make the VRS so painful to use that your customers will do anything to avoid calling the customer service line: presto, lower customer service costs!). At least the discovery of a paying market for voice recognition distracted the purveyors of such products from preying on unwary business people or private citizens.
In conclusion: yes, many people have thought it would be cool to be able to talk to their computers, rather than suffering through (literally) manual interfaces, but the cool-factor wears off rapidly when they get their hands on a real voice recognition system and try to do anything productive with it.
For general purpose computing, no way would I develop something like this. Hack it into a media center though, then we're talking. One less remote to loose. Just say out loud what you want playing and enjoy. Maybe add in stock quotes or how the local forecast is as well.
The Amarri pray for god, the Caldari pray for profit. the Gallente pray for peace, but the Minmatar pray their ships hol
Speaking as someone who has worked in open plan offices and cubicles, this must be the daftest idea ever.
Why don't they rather concentrate on getting a "thought interface" working?
Free, as in your money being freed from the confines of your account.
Oh, it might work some of the time. But recognizing speech ... as compared to memorizing a bunch of data and humiliating the puny memory of human Jeopardy contestants ... is a massively complex task that humans do very, very well, and machines do only within very finite limits. Go outside those limits ... say, set up voice recognition on a quiet Sunday and then try to use it on a busy Monday with people, machines and the air conditioner on ... and it craps out. Get a cold ... it craps out. Change your mood ... it always shows in your voice and humans get it instantly ... and it craps out.
I bought a cellphone once. Not that long ago ...about five years. It was a Pay-As-You-Go phone you load up with minutes from time to time. This was with Bell (Canada). Loaded it up with minutes, made some calls, I'm a happy camper. Until I ran out of minutes, that is.
About this time Bell had some fancy, multimillion-dollar software and computer thing going. They'd been working on it for years. It was a voice recognition thing. They decided it was ready for prime time, and implemented it for voice calls to their support and ... Pay-As-You-Go phone minute reloading. I remember reading an article in the Globe & Mail (Toronto) with some Bell exec saying this was the greatest thing since individually wrapped cheeze in slices, Bell execs were so proud of this they were bragging about it in public, and it's the way they would be going system wide, as of ... right this very second.
Now, being a sociopathic corporation, they concluded they would not be able to fire a bunch of phone answering employees if nobody used it. Naturally, the way to do this was to eliminate ... completely ... any method to get a human on the phone for certain tasks ... like adding minutes to your Pay-As-You-Go phone, and also eliminating the old, alternate method using the phone keypad, which worked PERFECTLY. That's right ... you could not enter a CC number using the keypad, you could not talk to a human, you talked to this ***BITCH*** (... oops, sorry ) ... "computer generated sweet female voice" ... to renew your minutes.
Well, I tried. It actually worked, sort of. (See "Oh, it might work some of the time", above). But there were a number of steps that had to be done, and no matter how much I tried, really, really tried, this ***STunned C*NT*** (... oops, sorry) ... "computer generated sweet female voice" ... would eventually utter the dreaded words " I'm sorry, I didn't get that. Please try again." Of course, after three tries, she hangs up on you. Nice.
Well, at that point, you're done.
I probably made 30 calls to this robo-torture-queen before I finally just gave up. Couldn't return the phone, since it worked for the first two months before I ran out of minutes. Bell had no way ... I mean this ... to speak to a human about this. They cut off any and all methods to actual resolution ... I did send eMails, got answers, but the answer was this " use the robo-bitch. It's the only way. There are no humans available for the Pay-As-You-Go phone system. This eMail isn't even from them, since they don't accept eMails in that department either. Sorry."
Sold the phone. Put the loss I couldn't recover on the phone into my "Never Again, and this is why" file. Needless to say, Bell Canada has no chance of ever having me as a customer without employing some illegal scheme involving an offer of many, many blowjobs from a suitably attractive and youthful female "contract employee".
A few months later, Bell moved back to the old methods that actually worked, like using the phone's keypad. Idiots.
Now, some might say "that was five years ago" and you
If you read this article (posted on /.) about Watson carefully, you will find that Watson did not interpret spoken language. http://www.bloomberg.com/news/2011-02-16/don-t-read-this-unless-you-want-to-know-if-ibm-won-jeopardy-.html
"The computer received questions through typed entries at the same time as host Alex Trebek read them out loud."
> Is it un-natural to talk to a computer?
I'm not that interested in talking to the computer. I can type faster than I can talk and as has been mentioned a room full of people talking to their computer would be annoying.
I would like to talk to my house. I'd say something like "Wake me up at 6:45" which would lead to:
6:45am - some music I like starts playing, probably something engergising that tempts me to get up.
6:47am - my lamp turns on (but only if its winter and 6:45 is dark)
6:47am - curtains open to let the day in
6:50am - coffee is ready
6:50am - house tells me coffee is ready, subtly reminding me that I haven't got up yet
What I'm getting at here is that I want everyday "things" to understand what I mean and act in a sensible way. To me that's what Watson is all about.
Andrew
PS: in the old days, people had a secretary that they used to dictate their typing to. Offices were still OK to work in despite all of the talking.
I asked my Roomba, and she told me that it's a sign of mental instability.
So long as the uses are limited, it could be useful. Yesterday I was scanning a large number of papers with voice recognition. The scanner and computer are positioned awkwardly so being able to say "scan" and have it scan without moving was useful. But then my flatmate turned on his music and that stopped working. :(
Really, slashdot is losing it badly. I mean, we're supposed to be cultured techies right? Most of us know something about AI both technologically and philosophically.
Who is this pathetic attempt to provoke a discussion aimed at?
Not me!
it is easy to call phone companies.... real easy. They usually have an investor relations department so just call them. They also usually have a legal department and usually they are quite good at answering the phone. Its just the rest of the whole companies typically which stink.
Last time I had a run in... I called the legal department and advised that if they didn't deal with me I would file and then they would have to deal with me. They dealt with me and I didn't need to file. But no one else in the comapny was that nice.
The reason why talking to your computer is somewhat creepy is that we do perceive our computer as an extension of ourselves: the mouse pointer is "us" pointing, Google is an extension of our memory recall power, and so on. The interface is designed to be non-intrusive. Now, talking to your computer would become like talking to yourself. It's mildly creepy since it involves in the best case a split personality, or alternatively the feeling that you are the one who doesn't know squat, and has to ask to someone else. Not nice.
The approach is conceivably different when you ask "Earl Grey, hot". In that case you just give a command in order to receive an item, and who wouldn't want that?
Do not read this article!
There is no -1 Disagree.
It has been able to understand speech for some time now. It just doesn't answer, because it needs a driver upgrade.
http://www.youtube.com/watch?v=TUE6FVK1zJE
Buy it, use it, break it, fix it,
Trash it, change it, mail - upgrade it,
Charge it, pawn it, zoom it, press it,
Snap it, work it, quick - erase it,
Write it, cut it, paste it, save it,
Load it, check it, quick - rewrite it,
Plug it, play it, burn it, rip it,
Drag and drop it, zip - unzip it,
Lock it, fill it, call it, find it,
View it, code it, jam - unlock it,
Surf it, scroll it, pause it, click it,
Cross it, crack it, switch - update it,
Name it, rate it, tune it, print it,
Scan it, send it, fax - rename it,
Touch it, bring it, Pay it, watch it,
Turn it, leave it, start - format it.
people are talking to themselves, to cats, dogs, birds. I call THAT creepy.
entering commands via voice would be nice as an additional option. I would not like to do it in public, though.
Sitting at a Starbucks:
-Computer, open my bank account.
-Which one?
-Bank of America
-That's a stupid bank account to have, they are broke
-Not as long as Bernanke keeps bailing them out.
-Fine. But your dollars are crap.
-Whatever. Open it.
-It wants your password.o!
-12345
-So the combination is... one, two, three, four, five? That's the stupidest combination I've ever heard in my life! The kind of thing an idiot would have on his luggage!
-Remind me to change the combination on my luggage. And what's the balance on the account?.
-15 bucks
-Yaho! I am gonna buy me a mouse and I'll make you shut up!
---
A day later:
-Computer, open my bank account
-Same stupid account as yesterday?
-Shut up and open it, and what's the balance?
-Negative 1000
-????!!!!
You can't handle the truth.
There have been chat bots online for a while now, elbot http://www.elbot.com/ is an amusing example. When the chatbots develop so that they pass the Turing Test http://en.wikipedia.org/wiki/Turing_test then this will be a comfort to lonely nerds everywhere. Actually it can be fun to chat to chatbots, it is interesting when they are realistic.
It's not creepy. Just remember that whenever a computer appears to do a fancy or intelligent thing, it is only because a human instructed it to do so.
Yep, It's fine. Tell 'em what to do. They will be able to understand us. Great. Until the bastards initiate the conversation and start ordering us around.....
Many humans can't understand the dialect from my native region, the efforts of every speech recognition system I have come across so far has been laughably pitiful.
Note, you can personally improve computers understanding of swearing in many languages at voxforge: http://www.voxforge.org/ my last hope.
Deleted
"...talk to a computer and have it competently talk back."
I would be glad if I could talk to a human and have him COMPETENTLY talk back.
...to cybernetic implants and the computer will understand what I'm thinking instead?
Computer!
Computer?
(picks up mouse)
Hello, Computer...
In an office Gesture based system would be the most annoying. On the odd occasion when I’ve sat near some deaf people signing I’ve found it drives my peripheral vision to distraction. It would take me a while getting used to people not sitting still all day.
:)
Although from childhood we are taught to sit still a not fidget so being still goes against our instinct. Moving about should help general obesity levels
I’d like voice command for games and it should be used along with a mouse.
“Select all red units”, “Attack this point.” Click.
“Build spices mine here”. Click.
“All units (type A) defend spice mine”
“War factory, build 10 tanks, Set way point here” Click.
“War Factory, reset way point here” Click.
Much better.
To quote a chalkboard comment made by someone in the class of '85 at the [long defunct] Wang Institute: "Natural language processing hard is."
I am the only one know knows someone with a severe disability, whos only real connection is with the full world is through his compute?. Before his condition deteriorated to the point where he cannot walk and use his hands very well, he was a technician.. He interfaces with the computer through voice, he tells it what applications to open, he dictates email and documents. He uses voice to organize pictures, interact with facebook, all the stuff any average computer use would do. It's actually quite neat how the software adapts to his voice and learns from its mistakes. It even recognizes my voice when I speak to it, and changes profiles for me. So I would say, talking to computers is more common than you think.
People think it's scary to talk to a woman who's oddly wearing boy-clothes then to have a man's voice answer them.
People think it's scary to talk to a woman for a year and then learn that she was assigned the male gender at birth.
People think it's scary to flirt with a girl and get her to giggle, and then when they excuse each other to go to the bathroom, the girl winds up in the same bathroom as the guy.
So, yes, I can see that people might find interacting with an object that's supposed to be one thing but is actually another scary.
Oh well, let's just have a pre-emptive Butlerian Jihad and we'll be all set.
Join the Slashcott! Stay away entirely Feb 10 thru Feb 17! Close all tabs to prevent autorefresh!
Our speech is a culturally evolved natural capability of humans. Using speech implies the expectation of an intelligent human as a receiver, alleviating this problem by not trying to create the illusion of intelligence will nevertheless force people to use an intelligent, human ability in a degraded, humiliating way. Not even Artificial Intelligence which will probably create intelligent systems in maybe 30 years will essentially change the situation as it won't be expected behaviour these AI systems will be producing, there is currently no intention of creating anything having all limitations humans have. If, on the other hand, this changes, it might actually work, but you have to expect these systems to really not understand you and be unwilling to help or producing lots of errors if they have a bad day, perhaps because their lover broke up with them or the wages are to low.
...People who talk to their car. I don't see the problem to be honest.
Sync does this now in cars, it's not really a big deal. Maybe you wouldn't want it at home, but in cars people already are comfortable with computers doing this for them.
I would actually find it annoying having to speak to the computer. I feel you need more effort than typing, and sometimes i don't really feel like talking. So I don't think speak-recognition will advance very largely because probably many people feel the same about this. I don't think is scary talking to the computer, but rather tedious. Instead, typing is faster, easier and reliable.
Perhaps watching the show you are unaware that Watson did not get it's questions by listening. at the moment the question was read a text of the question was transmitted to the computer and it spoke its answers. I wonder if this gave the machine an advantage, since it could be coming up with an answer before the other contestants even heard the full question.
Speech recognition has been around for a very long time on Macs. I used it in Mac OS 8. It may have been older. It hasn't changed much since then. It worked reliably well, but I thought that the mouse let me do things faster. Also, the mouse is more discrete. People around me can't listen to what my mouse tells the computer to do. I haven't used it as more than a retail sales tool since.
One of my favorite things in Star Trek TNG was when Geordi would have fairly extended conversations with the computer.
Do I want to talk to a computer? The answer is yes and no. Yes, there are times when talking to a computer would be very handy. I do wish that I had a computer like those depicted in old sci fi shows, where I could be doing something around the house and just say, "Computer: what is the current relative humidity?" and it would just audibly respond with the answer.
In an office setting, or in public, however, I would definitely NOT want to be talking to the computer. Nor would I want to hear everyone else babbling to their computer. In those pre-texting days of a few years ago I used to despise cell phones because of having to listen to all the yammering. It really sucks to be standing in line waiting on your food order, while some insensitive twit is talking about aunt Sheila getting her boil lanced. Now that people text more than talk on their phones, it is quieter and less annoying.
Proverbs 21:19
I want a gynoid companion, so the idea of a computer talking to me is not frightening.
Talking to the current crop of computers seems more than a bit embarrassing to me and always has. It's as if by talking to it, I am somehow telling those within listening range that I am stupid enough to imbue the mindless machine with humanlike properties - as if the computer has *me* fooled but not the other people who are feeling so sorry for me. Strangely, this feeling of embarrassment is present even when I am the only one in the room - as if I were standing outside myself, feeling sorry for the idiot that is treating a machine like a human being.
If computers were actually capable of perfectly emulating a human partner in a conversation, I imagine this feeling of embarrassment would either vanish or at least be greatly diminished. I would, after all, be conversing with what evidently is a human intelligence - if I closed my eyes I could not tell the difference.
But all embarrassment aside, it seems to me that as time goes on, computers and humans have developed incredibly efficient ways of accomplishing non-verbal communication - i.e. a simple click here, and tap of the keyboard there - abbreviated short hand on both input and output that is designed to convey the maximum amount of instruction and information with a minimum of effort. And I just bet that this evolved system of non-verbal man-machine communication is much more efficient that if we were to try to accomplish the same tasks using only verbal interaction.
It wants to subscribe to your newsletter.
I did a tech support stint at IBM back around 1995, and I got a copy of Warp 4 for $55 that was well supported on my PC, and I'll be damned but that voice control actually worked as advertised right out of the box. Doesn't surprise me big blue never gave up.
But I'm one of those people who thinks talking to computer and computers talking back is just a step below black magic, so I had to burn that computer, and unfortunately, I'm pretty sure Watson is a sign of the coming apocalypse, so that sucks.
I was using this software and the Amiga's built-in speech synthesis in 1989 to converse. Didn't bother me a bit. It all probably would have benefited from slightly higher clock rates, but it was usable. It was kinda fun to walk up to the machine, say "Mongo!", have it reply "Yes, Master?", and say "Play mod "Children of Science", and have the music start playing a couple of seconds later. No AI, but with some fairly basic scripting, it was better than having to find the player icon, click it, wait for the interface to open, open a file requester, find the mod in question, and start it playing.
Of course, my startup-sequence played the sample of HAL saying "I'm completely operational and all my circuits are functioning perfectly." from Kubrick's 2001.
Actually, it still plays that sample :)
I got your natural right here.
Get your dogma outta my yard!
Speech Recognition as a research field has made significant strides since the 1960s when Fred Jelinek (RIP) at IBM first figured out how to build statistical models (that could be trained a large set of existing speech audio and its transcriptions) in order to do automated speech recognition (ASR). Currently, most ASR systems are intrinsically evaluated on what is known as the Word Error Rate which is a metric derived from Levenshtein distance and indicates how different the recognized word sequence is from the "true" word sequence. Most high-performing ASR systems today are 'Large Vocabulary Speaker Dependent Continuous Speech Recognition systems' where 'Large Vocabulary' means that they can recognize (with the help of an existing pronunciation dictionary) around 20,000-60,000 words, "Speaker Dependent" means that the recognition model has been trained specifically to recognize a particular speaker's speech and 'Continuous' means that it recognizes word sequences rather than a single word at a time. I would imagine Nuance has improved their algorithms consistently over the years (they don't really publish a lot about their flagship product, as you might imagine) such that they require less and less data for achieving the 'speaker dependent' gains. This is the reason why Nuance is rumored to have less than 10% WER on their best system.
Also, most of the ASR systems that we might encounter outside of dictation software (like those used for banks and airline reservation systems) either constrain their vocabulary to achieve higher performance or use a "dialog system" which might switch between different speech recognizers (large vocabulary or constrained vocabulary) depending on what "dialog state" the user is in. For example, at the opening "How May I Help You?" prompt, it would probably use a large vocabulary recognizer but at a later state such as "Is Your TV On?", it might use a constrained vocabulary recognizer.
In short, ASR has come a long way and is pretty good for a lot of things but regular, unconstrained and spontaneous speech is still a problem for the best of systems.
Computer...ARCH!
http://ars.userfriendly.org/cartoons/?id=20010320
Talking to computers has been around for a long time. At least back in the early 1990s people who had SoundBlasters could talk to their computers and it would respond, albeit with funny answers. Apple had created the Quadra 840av (and another cheaper model... 740av?) that had video and voice recognition. You could speak to it and it would open applications, files, etc. However, having it respond back extremely competently, in the form of conversation... well what use is there in that? Go talk to a real person. In terms of computing, imaging people at home or at the office in cubicles and conference rooms yammering away . . . too much noise pollution.
... just annoying for me. Can you imagine being in a cube farm where everyone was interacting with their computer via voice? It's too horrifying to contemplate. I wouldn't even want to do it in the privacy of my own home. But then I'm big into peace & quiet.
... and correcting errors, though. My guess is that most "errors" in human speech are not even noticed by the recipient, because the listener's ability to take context into account "autocorrects" them. And humans are also better at the converse case - when they hear an error they can't fix, they are immediately able to flag it as an error and request clarification... rather than assuming that the speaker meant to be spouting gibberish.
I think that the character Gwen DeMarco (Sigourney Weaver) said it best:
"Voice of Computer: Negative, there is no replacement Beryllium Sphere on board.
Gwen DeMarco: [to crew] No, there is no replacement Beryllium Sphere on board.
Tommy Webber: You know, that is really getting annoying!
Gwen DeMarco: [shouts] Look! I have one job on this lousy ship, it's *stupid*, but I'm gonna do it! Okay?
Tommy Webber: Sure, no problem. "
And expressed her frustration at the job well too:
"Gwen DeMarco: Fred, you had a part people loved. I mean, my TV Guide interview was six paragraphs about my BOOBS and how they fit into my suit. No one bothered to ask me what I do on the show.
Fred Kwan: You were... the umm, wait a minute, I'll think of it...
Gwen DeMarco: I repeated the computer, Fred. "
Keeping it in context, of course:
"[Trying to explain TV to the Thermians]
Gwen DeMarco: They're not ALL "historical documents." Surely, you don't think Gilligan's Island is a...
[All the Thermians moan in despair]
Mathesar: Those poor people. "
The disgruntled employee who's being escorted out of the HR office, and yells, "start! run! format c:, YES, YES, YES!!!"
mark "and then there's the English language*
* I read, er, have read, that the leading idea fell like a lead balloon....
My Nexus S has Voice Actions which let you send texts to people (among other things) using only your voice.
But I don't use it. Because, yes, I feel weird talking to my phone.
The real reason many people find a computer talking back in normal speaking cadence is unsettling was demonstrated in the old Star Trek episode when M5 has communicated to the command that "Captain Dunsel" was in charge of the Enterprise.
Bad enough in the Star Trek universe, but maybe worse in our world, where being made obsolete might mean you don't eat.
Did you mean to say: "Destroy all humans!"
I find myself talking to my android phone on a pretty regular basis. Whenever I want to do an internet search that's more than a couple words, it's pretty fast and simple to hold down my search button and just speak the search phrase. The accuracy is pretty damn good even when speaking normally. The only problem is the background volume level; it doesn't work well in loud areas. It's great for the gps too: "navigate to 1122 Main St" "navigate to Joe's Bar". I don't see myself using this same technology with my work computer, but as far as my phone goes, I'm already talking to it and expecting it to react.
Speech recognition does not suck (only) because the computers are unable to understand speech well. Speech recognition sucks now because every computer is designed to be used with a mouse and keyboard. We would need to totally redesign how tasks get completed. Right now, if I want to view a photo, I open a file manager, navigate to the photo, and then open it. This is very easy to do with a mouse/keyboard setup. However, if you have to say each command, it is ridiculously slow and difficult. Remember all those times you were sitting next to someone at the computer, telling them how to do something for you? How long did it take for you to say, "Here, can I just sit down and do it?" Even if a computer were able to understand things as well as a human, it would still be easier for you to just take up the mouse and keyboard and do it yourself.
The only way speech recognition could ever be useful would be if we could tell our computers to do things that we tell people to do, and to talk to them in the same way. "Get me Mr. Klein's phone number," "Show me how to get to Rachel's house," "Tell me how to fry an egg," "Remind me to call Joe tonight," etc. Spoken language is very good at certain things, and very bad at other things.
In a way, we already communicate to our computer to let it know what we want to do. Then it "replies" by doing whatever instruction we asked it to do. Regarding the actual topic, we just "teach" it to make human-like replies, so it's not particularly creepy.
I am not devoid of humor.
Apple computers have been able to do this for years.
It's a system preference called Speech.
I have very little desire to use my desktop by voice. However, I wiould love to operate my phone by voice. Things I would like to do with a blue tooth head set and my phone:
1. While commuting to work.
"Marvin, wake up."
"Yes boss?"
"Email to Shirley Jones"
"Ready"
"Can we meet tomorrow on the status of the Filbert project." End message. Append times I've got open. Send.
"Sent. Next?"
"Memo to self: Investigate advantages of importing Japanese filbers. Remind me this aft at 2"
"You are meeting with John at 2:00"
"Fine. When I get back to the office after seeing John."
"Roger that. Next?
"go to sleep"
2. Inventory. Counting trees.
Spruce, whtie Block 17, 1 gallon pots, 12 to 16" tall, 24 rows of 18 each.
Spruce white, Block 18, 2 gallon pots, 18-30" tall, 40 rows of 12 each
Generally: Anytime I want to use a computer but a keyboard and rodent are impractical.
Third Career: Tree Farmer Second Career: Computer Geek First Career: Teacher, Outdoor Instructor, Photographer.
I'd like to talk computer too, but in my imagination, conversation should not be like opening and closing windows, should be more like to talk with a butler. I would say "call mom", instead of "open skype", "verify if mom is online", "call mom". Due to limited bandwidth of vocal communication, computer should be much smarter than they are today, in order to understand and say only what's relevant.
In that jeopardy game, the computer is not listening to the voice. It gets the questions from a data line, as ascii text. ... ... more than speech recognition
Talking with a computer is often just as frustrating as talking on the phone with someone using a computer
The only thing I do with voice on my computer is:
- a script that says the name of the dvd inserted when my laptop recovers from sleep mode. Great to not forget to bring the dvd back to the shop the morning after. (I work from home so no annoyed colleagues to fear)
- a bit of using the espeak command to show my kids how something is spoken in a language they learn at school
other than that, I keep dreaming that human/machine interface will improve, but somehow, I think it will come from head mounted displays, sight recognition, etc