RSI, WIMPs and Pipes; What Next?
Tetard asks: "Long live the pipe! Since the `|' was invented by Doug McIlroy in 1973, has there ever been a more effective way of reusing tools and connecting data ? The mouse is a device of the Beatles era; Rather than try and provoke nostalgia in the older ones among us, I'm asking myself, as are others: when we don't try to reinvent the wheel, or at least improve it, why must we try and copy it every time ? Xerox PARC exposed us to WIMPs and we haven't done better: some innovation, some plastic surgery -- but no "paradigm shift" -- where's the
creative destruction that will take us further ? Graphical component programming is turning us into click-happy bonobos^H^H^Hchimpanzees, as we fail to find new ways to manage and connect richer data streams. My web designer friends are damaged for life because of mice, and yet we persist... Where do we go from here ? If we ever invent the graphical pipe, let if have keyboard shortcuts." Yes, you've probably seen a similar question to this run by Ask Slashdot before, but this time I'm wondering if maybe we need new input devices before the WIMP paradigm is replaced with something better. Might any of you have ideas on what form these input devices might take?
For those interested, here are the previous stories that have handled this type of question:
So what it will take to break us out of the WIMP box (or prison, depending on your bias), maybe new input devices would do it, but quite frankly, I wouldn't be surprised if a 3D interface might be another route (it would possibly spark interest in designing a new input device that would work better with 3D interfaces, or maybe data-gloves could serve this purpose?). Going on a limb, maybe this guy might just be the ticket.
We're still using *wheels* on our vehicles! I've got a nearly brand new car, and it's using 5,000 year old technology to get around? From Detroit to Stuttgart, automakers are clearly devoid of imagination!
Get over it -- sometimes it makes sense to stick with what works. A love of change for the sake of change is the one thing that unites geeks and fresh-out-of-school MBA's.
-b
If I wanted a sig I would have filled in that stupid box.
I'm serious.
No, really.
Nothing gives me hand pain as quickly as using mice, especially ones with that wheely thing. Keeping my hands in good form over the "home row" of my keyboard -- and away from the mouse -- has virtually eliminated pain from my computing life. I spend half of my waking hours in Emacs, and I have come to love (and depend on) its there's-a-key-for-everything nature.
Dump the mice. Keep Emacs.
Easy, automatic testing for Perl.
Nope. It will have a role in niche applications, but I don't see it ever being the dominant method of interaction. A couple of reasons:
Imagine a large room full of office workers. Now, imagine the same room with every worker talking to his wordprocessor or spreadsheet, trying to make him or herself heard over all the others, getting irritated and fatigued because of the constant noise of everybody else talking to their computers.
Imagine trying to do some work in an airport or on an airplane. Now, imagine trying to do the work using your voice _without_ other people hearing the budget details for your company or hearing the steamy endearments you will be mailing to your spouse.
Imagine talking. Now, imagine constantly talking all day, every day. Some actors and singers get permanent damage to their vocal cords - and they've had professional coaching and access to medical services. It could become RSI for your throat.
/Janne
Trust the Computer. The Computer is your friend.
Add a touch screeen so that when it comes to window selection less fine manipulation, you can use the large muscles of the arm and shoulder, then use the mouse for the finer that you cannot get with a touch screen.
The more important aspect is the comfort and the breaks. No matter what mechanism is used, trauma can accumulate over time. You need time to allow the body to recover from that trauma.
Fight Spammers!
Use the right tool for the right job. That's the UNIX/Linux philosophy. If you think that the WIMP interface is cramping your style with respect to whatever task you are trying to accomplish, then you are using the wrong tool.
With regards to the command line or WIMP interfaces being old, and not particularly forward looking, you are also missing a fundamental point: A graphical "pipe" isn't innovative either. You're simply shoehorning two paradigms together, and even worse, two totally incompatible paradigms at that. The pipe is a useful metaphor and operator for stream-oriented I/O. The WIMP is useful for (obviously) visually oriented information, and its designed for a completely different purpose than the pipe. The WIMP is designed to allow humans to manipulate data and abstract objects in a visual manner. The pipe is designed to allow users to allow the computer to do the same, without intervention.
If you want an innovative computing interface, worrying about streams, or visual representations of data is a waste of time. You're going to have to come up with something totally new. One good example is the use of sound to communicate the health and performance of networks or systems.
The real problem with computer software advancement is that its all very firmly based in 2D land. WIMP environments are about as eficient as you can get. Even assuming that you can wrap your work habits around something as 'next-gen' as 'The Brain', you're still stuck in 2D land.
The real advance that will open innovation (real innovation and not some corporation's twisted idea of it) is the beginning of a 3D workstation environment.
We already have the primatives for this kind of environment in games like Unreal, Q3A, and Black and White. Assuming that we implement a 'graphical pipe' that will work for a truly 3D application system, ie: Allow 3D applications to pass information back and forth between each other semi-effortlessly, this will ignite a new 'interface revolution' similiar to what we experienced as a result of Xerox's early WIMP system and the first versions of Apple's MacOS.
Once programs and applications can truly be represented as 'objects' in a 3D environment, we'll end up with something like the 'God' interface in Black and White, where processes are represented by animated people and files are represented as other objects. Tasks best handled in 2-D such as composition, coding, or painting will still be easy to handle, but tasks best performed in 3-D such as file management, database management, and even some advanced programming tasks like linking and compiling files, will take place in a representational environment. Imagine opening up your HDD and pouring objects into it, then sorting them into containers based on type, as you would sort files into directories.
Eventually, I see us moving into something like Stephenson's 'Street' metaphor for shared environments.
Along with these advances, will come new interfaces. I think that eye-tracking cameras have the biggest potential, but we keep coming back to the data-glove in one form or the other. I know CADesigners who still have an old Nintendo powerglove hacked for basic 3D manipution tasks. We're also probably going to see a renaissance of 'body tracker' devices that will track human motion via sonar or laser. Any one of these has the potential to vastly reduce RSI injuries.
The real trick in jumping from 2D to 3D is reverse compatibility. All the shells I've seen that attempt 3D interaction do it badly. Even then, they fail completely when faced with most of the tasks we do on a daily basis, like write or paint. I think we're going to have to use 'easels' or something similiar. In Cowboy Bebop, Radical Edward's computing environment is shown as a multitude of 2D windows hovering around her in 3D space. This wouldn't be that hard to do, really.
Navigation will be the true challenge for any 3D application designer. It will be that itch to scratch that will spawn new, inventive input and coding ideas.
The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
Great idea, but it doesn't get us out of the WIMP paradigm. You're just replacing your mouse with a more efficient type of pointer.
Really, that's the problem in a nutshell. We are so used to the WIMP interface, that the best we can vizualize is an improvement of the WIMP system. Until we can come up with a totally different metaphor for interacting with our computers, we won't see WIMP go away. Personally, I think it will take a "mad genius" type to break out of the WIMP paradigm and move us forward.
This
To improve on the classic WIMP interfaces, I immediately would exclude anything that had to be physically connected to the user. That eliminates data gloves as a significant mainstream input device.
Gloves work well technically (at least from what I've seen of them), but are fairly inconvenient to use. I suspect that some basic speech recognition will be mainstream in the not-too-distant future, because processing power is cheap enough to handle simple speech without impacting performance. Maybe eye tracking will be used some as well, but eyes tend to wander.
So I think the biggest trend in interface design over the next few years is going to be a return to simplicity. Fewer clicks, fewer mouse movements, and greater use of predictive interfaces - where the interface guesses what you'll do next based on experience and learning, and has it ready for you just in case. The Mac-style version of the UI (but not necessarily the Mac itself) will probably be the dominant strain overall - Microsoft is converging in that direction now as well as evidenced by the Luna interface in XP. I think mainstream mice return to two buttons (from the currently popular multi-buttoned models), or maybe even one. And mouse gestures will be used more instead of clicking in some cases.
Mice themselves I can see working by gyro rather than by ball or light sensor, which would allow a mouse to be held and moved within 3 dimensions (even if it only tracks in two), rather than skated over a desk in two dimensions. It's potentially a more natural hand motion. Keyboards probably won't change much due to inertia. They haven't really changed in basic layout for over a hundred years.
Of course, I may just be blowing smoke, since my intuition is no better than anyone else's. But the Slashdot crowd is very different from the "mainstream" - we are typically more forgiving of a complex interface and would trade off in favor of power over simplicity. So we may not be the best people to forecast.
-- Josh Turiel
"2. Do not eat iPod Shuffle."
Almost nobody talks about the fact that the WIMP interface was made possible by the hardware made available at the time (mouse, high-quality graphics) and until new hardware comes along, there will be no "paradigm shift".
There have been a lot of interesting research (the Oberon system, Xerox's Information Visualization research, 3D windowing systems) but none is compelling enough for us to switch.
A no-compromise 3D system (plain screen with 3D capabilities without the need for special glasses or head gear) could make it possible for researchers to come up with something new. Otherwise, forget it.
We can expect nothing but some clever modifications here and there (such as the mouse gestures in Mozilla and Opera) unless we get new, fantastic hardware.
The mouse is a device of the Beatles era
And the steering wheel is a relic of the Jazz age. I don't plan on giving up either of them any time soon.
For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
No, I'm serious. Computers should be working for us, not against us. Today, we don't give a computer a task and ask it to complete it, we instead worry about interfacing with it. The interface should be natural and change depending on what you are doing. Let's go through this using Star Trek (TNG) as our baseline:
Computers should have voice recognition for general tasks such as environment management and collabertion. The crew uses this to dim the lights for example.
Computers should have keypad interfaces for general reading and writing. You'll notice the workstation on Picard's desk for exactly this purpose.
Computers should have remote/networked interfaces for passing sneaker-net info. This means datapads and remote voice uplinks. (Hold on captain, I'll email the info to your chair! Yeah, right.)
Computers should be able to present information in the most efficient way possible. This doesn't mean the prettiest, but instead it means dynamic graphical/audio interfaces. These interfaces must be easy to create and use. This means that the readouts might one moment be tracking the progress of Warp plasma then next moment display starcharts of the ship's course.
All of this makes the Star Trek interface very useful and is quite possible with current technology. However, there is one thing that really makes it easy to use:
Computers must be able to automatically analyze, process, and display any information. Filters should be able to be applied automatically. Artifical intelligence should be able to translate speech from any language to any language and add new patterns without reprogramming. Answers to theoriretcal questions should be able to be answered given a current set of information on a subject. Information should be able to be retrieved and stored no matter what the format and new formats should be able to be analyzed and added to the database.
This is much more difficult due to the fact that computers today have a very monolithic nature. Pipes are a good example of dynamic linkage of programs, yet at the same time are very primitive. They require a great deal of operator understanding of the information in order to use.
What if we could build layers of software on top of layers of software, which are then again used to build greater layers of software? This would require that we accept standard functionality at each level of the layering process and then allow people to write ever simpler code due to this great deal of layering. Why should anyone be required to rewrite a quicksort once it has been written?
The first and best key we have today is the Java class file format. Class files are allowed to layer on top of others by nature. The computer can pull apart the structure and investigate the usefullness of a class for a specific function. With enough layers, the computer can make extremely intelligent decisions automatically. Witness JINI.
So here is my challenge to every programmer/newbie/manager/hardware designer in existance. Stop trying to focus on how we can optimize or tweak each individual computer function so that you gain temporary marketshare, outperform your competitor, or push your political agenda. Instead, begin pushing for hard standards in the industry. Don't worry if they have the "highest performance" or if they have the "coolest widgets". Instead worry if they have the best design, if they are technically "right" for the task at hand. Stop worrying about making languages that allow you to produce specific functionality in fewer lines of code, and worry about producing the highest level of quality.
Many people will find this a hard pill to swallow, but the end result will most certainly be worth it.
Javascript + Nintendo DSi = DSiCade
Pipes seem necessarily linked to the command line.
It's not even so much the command line as it is the pervasiveness of ASCII text for information storage throughout the system. Almost every program currently available for MacOS or Windows would have to be changed to start storing their files as ASCII text rather than as custom binary formats.
As XML conversion continues, this may become more feasible. However, few programs will use XML as their native format for efficiency reasons. Programs will need to have options for XML input and output.
This leaves aside the fact that pipes are a programmer-only feature, which no one else wants or could possibly use.
Tim
I've thought about this myself. By tracking both ends of the wand in 3 dimensions, it is possible to implement a joystick with 5 degrees of freedom (unfortunatly detecting rotations about the axis of the wand seems difficult to me). Be great for games, not sure how usefull it would be for office work.
This seems a bit like asking what it would take to replace the current way of driving a car (steering wheel, gas and pedal brakes, etc.) with something better. But the interface between humans and automobiles is pretty much a solved problem, and nobody seems to spend much time speculating on what a paradigm change in automobile control would be like.
Oh yeah? Two words: cruise control. It completely redefined the "car interface". How about two more: intermittent wipers. True, the inventor got shafted by Detroit and had to fight tooth and nail for years to get his due, but he too changed the "car interface" dramatically.
There's a curious assumption which I've seen repeatedly-- namely, that a paradigm shift in human/computer interaction would be a good thing. Why, exactly?
Simple: because the quantum increase in computer access that was engendered by the WIMP interface isn't by any stretch of the imagination the endpoint of interface evolution. Want an example? Don Hopkins has been pushing his concept of Pie Menus for about 15 years now, and has implemented them everywhere he can find an amenable display system (starting with (*shudder*) X10 and including MS-Windows!). If you think you know how user interfaces should work and you haven't read any of Don's exhortations on the human-factors improvements inherent in non-linear menus, you need to get with the program.
There's a time and a place for diplomacy, and this isn't it.
<div diplomacy="off"> That's a dumb idea. </div>
My attention wanders and consequently my eyes do as well. I blink subconsciously. You can't change these things, and it's stupid to try. Don't make an input system that fails when they happen.
I think Douglas Adams does a good job of describing the failings of this type of input system:
That's not even taking into account the way the eye inherently jitters, according to the other replies. Even without that, this wouldn't work out well except for extremely disabled people who have little other choice and aren't likely to complain about something that makes it at all possible for them to use a computer.
Alias|Wavefront has been using their version, called "marking menus" in PowerAnimator and Maya for a couple of years. They fruiggin' rock!
By customizing the Marking Menus a little but, you can drive PowerAnimator with a 3 button mouse and the Control, Shift and Alt keys. Way cool.
It doesn't mean much now, it's built for the future.
The problem with the Star Trek paradigm is that is fake! As in Hollywood fake. Dramatic licence is used all over to make the show flow smoothly.
Many people have tried to build systems like Star Trek, and in fact you can get a lot closer than you many be aware. The problem is that in real life, that interface *sucks*!
In real life it is painfully annoying and insanely slow to tell a room/computer to do something.
We just don't realize how good a user interface a lightswitch is.
The Star Trek data pads are touch sensitive; we have had touch sensitive technology as long as the mouse; but the mouse is what we use because of its acceleration property.
I have built and analyzed [CMU,MIT, Microsoft et al] many systems designed to solve the exact problem you are talking about.
And although the technology isnt up to the task, you can evaluate it as if it is by using humans to "Wizard of Oz" the test. Turns out [in my and others opinions] that the whole idea is dysfunctional and annoying.