A New Paradigm For Web Browsing
dsaci points out a New York Times article about how surfing the web may change to a more graphics-based endeavor. With the advent of devices like the Wii and the iPhone, the capability to directly control objects on a screen is becoming a popular and affordable technology. That, combined with immersive interfaces such as Piclens, could be the future of web browsing. Quoting:
"'I've wondered for a long time why the computer interface hasn't changed from 20 years ago,' said Austin Shoemaker, a former Apple Computer software engineer and now chief technology officer of Cooliris. 'People should think of a computer interface less as a tool and more as a extension of themselves or as extension of their mind.' Voice, too, is finally beginning to play a significant role as an interface tool in a new generation of consumer-oriented wireless handsets. Many technologists now believe that hunting and pecking on the tiny keyboards of cellphones and P.D.A.'s will quickly give way to voice commands that will return map, text and other data displayed visually on small screens."
Dragon on a reasonably powerful PC might work, but until you can nail 110% correct recognition, in a crowded area, in a shitty little mic on a 400 MHz ARM processor, don't bother. You don't want to start arguing with your PocketPC about traffic and directions: No, I said Springfield, not Slingblade! *crash*
The keyboard works, 100% of the time. Its easily understood. Its robust. It fails gracefully - you immediately see if you've made a mistake before submitting a command.
I want to delete my account but Slashdot doesn't allow it.
Hopefully "they" also develop good image to speech technologies, or are they forgetting that there are many visually impaired Internet users?
As long as the extra flashy junk doesn't impede my ability to get useful information from a website, I will be fine with it. There have been so many sites that don't seem to understand this though (yahoo maps is a great example, among many many sites. The original "low bandwidth" version is still more useful than their "new bling improved" version, even over a high speed connection). Ebay is headed down the path of "bling overload" too. What bothers me is when a site adds rotating blinking things without considering, "what improvements does this give us or the user trying to use our website?"
I Am My Own Worst Enemy
I certainly don't want to be on a bus or plane with dozens of people all yakking commands to their devices, nor do I necessarily want to display to the world what commands I am giving to my device. Voice control is nice in certain circumstances, but until they give me a direct neural interface I want keys and/or stylus and/or cursor control and input options.
I can think faster than I can speak unambiguous commands. Using a combination of keyboard shortcuts, extended mouse buttons and mouse gestures I can browse fast enough that the bottleneck is almost always reading comprehension. This is also much less tiring than speaking. A better solution might be a combination of eye tracking and brainwave monitoring, but that's still far too unreliable.
"'I've wondered for a long time why the computer interface hasn't changed from 20 years ago,'
OK, playing a little devil's advocate here. Perhaps the building bricks of computer interfaces and their basic interaction mechanics haven't changed because they are all right as they are now.
We have developed an interaction language that allows us to express interaction proposals and allows the users to understand those proposals and, therefore, to interact successfully with our systems. Why should we change that if it is working?
Change for change's sake, when we have an established language does not sound sound... I don't see no one complaining that we've been calling chairs "chairs" for so many years...
mod me up scottie!
voice recognition as it is today is painful.
"Computer, start, programs, Mozilla, fire fox , double you, double you, double you, dot, google, dot, com, search field, violent, asian, porn. I'm feeling lucky. click"
its a slow, painful, annoying as hell process that brings you back to the keyboard and mouse once the novelty has worn off, and only leaves the user feeling ripped off for wasting so much money on a fancy new inferior interface.
voice recognition won't be useful until it is intelligent. I should only have to say "Computer, google porn" and get my results. I shouldn't have to explain to my computer step by step how to open a freaken browser.
-I only code in BASIC.-
Here's an exercise for those who believe voice commands are the way to go for small electronics. Every time you use your cell phone, iPod, PDA or GPS, say each command out loud before entering it. See if you can keep this up for a full day.
It's because it works like it is. And the "new" ways of controlling aren't advantages, they are just ways of fixing the disadvantages of small displays and small devices lacking (working!) methods of cotrolling like mouses, trackballs and so on.
So I think it's safe to say that the number of people who do not see any benefit from graphical windowed environments is infinitesimally small, even among hardcore *nix hacker types.How old are mouse gestures, out of interest? Most people who use them seem to think they're a step forward, and they've only been a mainstream concept for a few years, though I'm sure they've existed for far longer than that as a research concept or whatever.
I wish people could learn to think of their computers more as "just a tool". Half the time I see people having problems with computer usage, it's because they're expecting the thing to read their mind. I have to explain to them just how dumb a computer is, and that you really have to tell it what to do because it's just a machine.
(The other half, of course, is due to shitty software.)
Voice recognition still sucks badly, even after a lot of time investment into it.
Maybe if someone got around to fixing that somehow, then we would consider, you know, using it.
I'm not at all suggesting we give up that line of research, just suggesting we put the horse before the cart here.
Or at least don't lie and say "will quickly give way to voice commands" and call it what it is. Those people want it to happen, and there is nothing wrong with that! Each tech has people that would prefer it over others. To each their own!
But to out right lie and say that it will happen 'quickly' is just embarrassing for your career as a technologist.
I think one of the hinderances to practical voice recognition has been the telephone paradigm (described in the book "Being Digital" by Nicholas Negroponte) where the computer is supposed to understand anything that anyone says at any time. What might work for voice recognition is for the user to have a custom chip that will allow a device to be configured to understand that specific user. Move the chip to a new device and that device will understand you perfectly.
What might also work is if the user trains himself/herself to speak in a way that the computer can consistently recognize, much like the user of Palm's Graffiti handwriting system learned to write in a way that the PDA could consistenly understand. With training, speaking that would could become second nature, much like typing has become for many users.
I've wondered for a long time why the computer interface hasn't changed from 20 years ago
Because it works.
Whereas all the attempts at shifting the paradigms to an extension of your soul (or whatever), just result in unusable exercises in masturbation (and not the kind the internet was invented for).
Remember how Flash was going to be the future of the web? Yeah.
sic transit gloria mundi
I'm icon-impaired. Seriously. My mind cannot make the subconscious connection between an icon or graphic and what said graphic is supposed to represent. Over the years, I've forced myself to recognize a floppy disk as "save," and a printer as "print". The rest mean nothing to me. When I use OpenOffice or any other graphic-intensive program, I must either (1) memorize various keyboard shortcuts, or (2) hover over the toolbar icons to find the one I want. For obvious reasons, my editor of choice is one that doesn't require me to decode icons. Nearly every graphical "decode" operation requires conscious thought as well as a process of elimination to narrow down the choices to a set of possibilities from which I will (hopefully) select the correct one. Many times I'm wrong.
Almost everything I do is on the CLI. I've been programming for nearly two decades, and I have no problems selecting textual tokens out of a field of similar-looking text. But give me a set of small, information-deprived graphics to decode, and I fall flat on my face.
I can't be alone in this. Surely others have this same cognitive disability.