Slashdot Mirror


Designing a Practical UI For a Gesture-Based Interface

An anonymous reader writes with a link to an intriguing account of the challenge of designing a close-range, hand and finger-based gesture recognition interface using 3D cameras. Things like this look good in science-fiction, but it's hard to create a gesture-based system that makes sense to the user and rejects gestures not meant for the computer.

44 comments

  1. Test it on Italians by Overzeetop · · Score: 5, Funny

    You're going to have to test it on Italians for gesture rejection. I can't imagine what kind of havoc could be created if the interface over-saw even the most mundane of conversations.

    --
    Is it just my observation, or are there way too many stupid people in the world?
    1. Re:Test it on Italians by Anonymous Coward · · Score: 0

      Italians are amazing people; they can change gears, smoke a cigarette, and flip you off all at the same time.

    2. Re:Test it on Italians by K.+S.+Kyosuke · · Score: 1

      You're going to have to test it on Italians for gesture rejection. I can't imagine what kind of havoc could be created if the interface over-saw even the most mundane of conversations.

      We already know where this is going to lead:

      For years radios had been operated by means of pressing buttons and turning dials; then as the technology became more sophisticated the controls were made touch-sensitive - you merely had to brush the panels with your fingers; now all you had to do was wave your hand in the general direction of the components and hope. It saved a lot of muscular expenditure of course, but meant that you had to sit infuriatingly still if you wanted to keep listening to the same programme.

      --
      Ezekiel 23:20
  2. I hate gestures. by Anonymous Coward · · Score: 0

    The beauty and the promise of devices like Kinect, and the power of modern computers, is the potential for an interface with almost no gestures. I know skeuomorphism has taken a lot of hits here lately, but this is the worst yet. They've made a model of a bookshelf in beautiful 3D, and the first thing you'll want to do is pluck a book from the shelf and peel the covers apart. Instead, you'll have to poke, and jab, and pinch, and drag. If you want a 3D gesture-based interface, bang it out Kinect style, or iOS style, with beautiful-looking icons to double-tap, or double-punch, or whatever. Make it look like a traditional interface, because doing the opposite will only make me sad when I discover that it is still just a traditional interface.

  3. If Microsoft is involved... by TWX · · Score: 2

    ...I can think of a particular gesture that will be very common, but most likely won't be recognized as it'll be provided after the system isn't working...

    --
    Do not look into laser with remaining eye.
    1. Re:If Microsoft is involved... by vlm · · Score: 2

      American Sign Language? Oh wait you mean something else.

      Seriously whats wrong with ASL? Large installed base, lots of training, could be pretty darn useful as a translator when someone without ASL "talks" to someone with ASL.

      My guess is MS would try to embrace extend and extinguish ASL... make it just like ASL but upside down and backwards, just to make it hard for everyone.

      --
      "Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
    2. Re:If Microsoft is involved... by Anonymous Coward · · Score: 0

      Patent each and every gesture!

    3. Re:If Microsoft is involved... by Anonymous Coward · · Score: 0

      My guess is MS would try to embrace extend and extinguish ASL... make it just like ASL but upside down and backwards, just to make it hard for everyone.

      Australian Sign Langauge?

    4. Re:If Microsoft is involved... by JasterBobaMereel · · Score: 1

      If you mean the finger then it is not as widely used or recognised outside the USA ... ...many gestures are very localised and do not travel very well to new cultures

      --
      Puteulanus fenestra mortis
    5. Re:If Microsoft is involved... by SuricouRaven · · Score: 1

      It wasn't. Due to the international success of US-made entertainment media, their expressions and gestures have been spreading for some time. The finger is certainly recognized here in the UK.

  4. Take a look at the Leap by gentryx · · Score: 2

    I just got my development kit from Leapmotion. Take a look at their 1m video. Pretty impressive, I would say. Also, the resolution of their sensor (1mm) seems to be much higher than that of what cameras can do today.

    --
    Computer simulation made easy -- LibGeoDecomp
  5. Looked good? by Anonymous Coward · · Score: 0

    The exaggerated gestures looked completely silly in Minority Report. Who in his right mind would want a computer like that?

    1. Re:Looked good? by Anonymous Coward · · Score: 0

      Agreed. Only Tom Cruise can make those "gestures" look good.

      That being said, your typical alpha-male MBA or ad agency guy would look (think he does) like a cool guy using it. Or Chuck Norris.

  6. what finger am i holding up by Anonymous Coward · · Score: 0

    take a big guess

  7. more challenging...practicality. by argStyopa · · Score: 2

    The biggest challenge is explaining who/how/why anyone would NEED a gesture-based interface.

    Sure, we all thought Minority Report looked cool with the flying window-thingy, but honestly, Mr Cruise could have 'flipped through' his data far, far more easily sitting at a desk with trivial motions of a mouse instead of giant arm-wavings and calisthenics.

    Plus, then he'd get to SIT, which is a little more conducive to surviving the 10+ hour days most of us spend staring at a little computer screen.

    --
    -Styopa
    1. Re:more challenging...practicality. by Jmc23 · · Score: 1

      Perhaps these UI's aren't meant for low-level troglodytes like yourself? Perhaps they're made for people who can move their bodies and spend most of their time not being worker slaves?

      --
      Don't complain about syntax, grammar, or spelling. There is no.hell like input on android.
    2. Re:more challenging...practicality. by hey · · Score: 2

      I dislike gesture UIs. Not just because they are new. The main reason is that if you are a newbie there are no hints about what to do. With shortcut keys you could always use the menus instead if you don't know the keystrokes. But Windows, Blackberry, Android, iPhone, all have different gestures its easy to get lost if you use more than one platform.

    3. Re:more challenging...practicality. by citizenr · · Score: 1

      Plus, then he'd get to SIT, which is a little more conducive to surviving the 10+ hour days most of us spend staring at a little computer screen.

      This is actually a minus, and a big one at that. Standing up desk is a LOT healthier.

      --
      Who logs in to gdm? Not I, said the duck.
    4. Re:more challenging...practicality. by Anonymous Coward · · Score: 0

      I'm trying to imagine the support calls.

      "All right. Now put up two fingers and wave them at the screen to move your file to the other monitor."
      "Oh, it rebooted? No, you were supposed to point your hand away from the screen, not at it."

      Gestures are a retarded interface for anything that can easily be fitted with a keyboard and mouse.

    5. Re:more challenging...practicality. by Richy_T · · Score: 1

      Agreed. The mousewheel has become ubiquitous because it out-competed the effort required to twitch the mouse to move the pointer over to the scroll bar.

      Aside: Have any comedies spoofed gesture interfaces? It seems like it would be a good target for a physical comedian (Rowan Atkinson, Lee Evans or Ryan Stiles for example).

    6. Re:more challenging...practicality. by Anonymous Coward · · Score: 0

      I would learn to use it since my right hand is experiencing significant pain from mousing and typing all day.

      Standing up to use gestures would help to relieve that pain and I could get more exercise, if not cardio. Just getting to move around more would be a relief.

    7. Re:more challenging...practicality. by Anonymous Coward · · Score: 0

      Yo, you must have never played with a Kinect/Xtion, Bee...
      Try processing with SimpleOpenNI lib for a quick entry, or
      see here for some examples: http://www.openprocessing.org/search/?q=kinect (or look online for tuts).
      That's nice for prototyping as well, btw.

      C U in minority report 2.0

  8. As an ASL Interpreter... by bigattichouse · · Score: 4, Informative

    I interpreted ASL in educational settings (High School, Freelance, University, Public, and even elementary.) for something like 6 or 7 years.

    My arms were ripped, and you could expect to burn several hundred calories (EASY) during a day of doing that. Also, I had learned the stretch properly thanks to some Aikido training.. and I still had some bad habits that caused me repetitive stress problems.

    Gestures are a novelty, and a lot of work for the user... I think there will be many blind alleys before they become natural.

    Some problems/ideas I see:
    1. Exhaustion - you waste a lot of energy
    2. "Namespaces" - you can make two gestures at once - geez... so you have a left hand gesture that tells the computer to listen (the ASL "Attention" one handed would work) + a command - maybe even "against" that hand. Its like a salute with your left hand vertical moving away from your face.
    3. Facial expressions are a HUGE part of ASL, probably not even considered. "WH" questions get eyebrows scrunched, other queries eyebrows up, puffed cheeks and all kinds of things...
    4. Security - I defy you to sign EXACTLY like someone else... It's possible, and easy in a mocking sense (High schoolers) - but I imagine a door that could see you carrying groceries and unlock combined with voice recog., or other simple things would be useful.

    --
    meh
    1. Re:As an ASL Interpreter... by bigattichouse · · Score: 1

      Well.. ok.. attention would be both hands... but I meant it in context of using it to communicate with a computer.

      --
      meh
    2. Re:As an ASL Interpreter... by Anonymous Coward · · Score: 0

      I interpreted ASL in educational settings (High School, Freelance, University, Public, and even elementary.) for something like 6 or 7 years.

      My arms were ripped, and you could expect to burn several hundred calories (EASY) during a day of doing that. Also, I had learned the stretch properly thanks to some Aikido training.. and I still had some bad habits that caused me repetitive stress problems.

      Gestures are a novelty, and a lot of work for the user... I think there will be many blind alleys before they become natural.

      Some problems/ideas I see:
      1. Exhaustion - you waste a lot of energy
      2. "Namespaces" - you can make two gestures at once - geez... so you have a left hand gesture that tells the computer to listen (the ASL "Attention" one handed would work) + a command - maybe even "against" that hand. Its like a salute with your left hand vertical moving away from your face.
      3. Facial expressions are a HUGE part of ASL, probably not even considered. "WH" questions get eyebrows scrunched, other queries eyebrows up, puffed cheeks and all kinds of things...
      4. Security - I defy you to sign EXACTLY like someone else... It's possible, and easy in a mocking sense (High schoolers) - but I imagine a door that could see you carrying groceries and unlock combined with voice recog., or other simple things would be useful.

      5. When in doubt, applying nikkyo is sometimes impractical.

    3. Re:As an ASL Interpreter... by Jmc23 · · Score: 1
      It's a computer. Your interaction with it is limited. You do not tell the cursor to move by giving it directions in english to move so and so pixels in the x direction and in the y direction. So why the hell would you communicate in ASL with it???

      All of your problems exist only because you are trying to use the wrong tool for the job.

      The HUGE advantage this would have is that it could be an avenue for teaching people how to move their bodies properly. RSI happens when you don't know how to move properly, that's why the damage occurs.

      Anyways, they still are coming about it the wrong way. I won't say why, I still have some hope that my system will be finished before someone comes up with the same idea. It'd be nice to be known for something and I'm extremely suprised nobody has implemented it before.

      --
      Don't complain about syntax, grammar, or spelling. There is no.hell like input on android.
    4. Re:As an ASL Interpreter... by JasterBobaMereel · · Score: 1

      Don't use ASL it is unlike English specific to USA/Canada and West Africa is not mutually intelligible with most other sign languages and is largely unknown outside the Deaf community

      There is no deaf sign language that is reasonably universal like English is in spoken languages so using ASL as a base will alienate many of your potential users ...

      The biggest issue is "Gorilla Arm" if you are making a lot of small precise gestures with arms out you get tied extremely quickly, which is why most sign languages avoid this, but most mockups of gesture systems show exactly this type of gesture

       

      --
      Puteulanus fenestra mortis
  9. Too bad, so sad. by AndyKron · · Score: 1

    Sorry, but whatever idea you have was patented already.

  10. How do you do it when dreaming? by TheRealMindChild · · Score: 1

    I would imagine the most intuitive interface is what you do to move objects when dreaming. For instance when I want to pull an object toward me, I focus on it, then tug an invisible rope quickly as to whip it back. I also push my hand outward holding my palm out as to say "stop" to push it away. Somewhere in our heads we already know what will work

    --

    "When life gives you lemons, don't make lemonade. Make life take the lemons back!" -- Cave Johnson
    1. Re:How do you do it when dreaming? by Jmc23 · · Score: 1

      Ah young padawan, do you not realize that the energy and the ritualistic movement that can generate it are two seperate things? There is no spoon.

      --
      Don't complain about syntax, grammar, or spelling. There is no.hell like input on android.
    2. Re:How do you do it when dreaming? by MickLinux · · Score: 1

      Aah, young Jmc, do you not recognize that Matrix and Star Wars are two different things?

      There is no force. No matrix either.

      --
      Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
    3. Re:How do you do it when dreaming? by Anonymous Coward · · Score: 0

      I have never, ever done this in a dream. On the other hand, an interface based around stomping on the brake pedal and having it not work would be pretty ideal for me. Or punching really slowly.

    4. Re:How do you do it when dreaming? by Jmc23 · · Score: 1
      Force is the nerve impulse. Matrix is the mind construct.

      Never heard of metaphors?

      Any one who has enough control in their dreams to do what the parent can would understand. Perhaps you need to educate yourself in lucid dreaming?

      --
      Don't complain about syntax, grammar, or spelling. There is no.hell like input on android.
    5. Re:How do you do it when dreaming? by MickLinux · · Score: 1

      I've done t before, but it's not as interesting as uncontrolled dreaming. I've also done multi-path dreaming: multiple dreams at the same time; but only once. I've also dreamed specifically in color once or twice.

      --
      Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
    6. Re:How do you do it when dreaming? by Jmc23 · · Score: 1
      Not as interesting? Lucidity is separate from control. Control in the hypnogogic state can be as boring as your imagination. Lucidity in deep rem is anything but un-interesting though requires extreme emotional control to maintain

      Lucidity at night is directly correlated to awareness during the day.

      --
      Don't complain about syntax, grammar, or spelling. There is no.hell like input on android.
    7. Re:How do you do it when dreaming? by Anonymous Coward · · Score: 0

      Oh, in that case the multipath dreaming was also lucid dreaming, but it was tied to me being extremely exhausted, iirc.

  11. What about intent recognition by DrXym · · Score: 1

    I'm wondering when the first device will appear which shows GUI elements even before the user has touched the screen. i.e. it senses the user moving their hand close to some portion of the screen and a hidden GUI associated with that area pops up in time to be there when they touch it. It'd have to be done carefully to avoid frustration but I think it'll come in time.

    1. Re:What about intent recognition by Anonymous Coward · · Score: 0

      hand jerking up and down == next picture ?

  12. Optional by gmuslera · · Score: 1

    As extra, optional, and not intrusive input device could be good, games are a good example, information consuming activities could be other. But for most content creation (music, and paint excluded) probably would be bad.

  13. I think there is a good solution to this problem by Omnifarious · · Score: 1

    It's the solution humans use in gesture-based communication. Eye contact. The computer needs an obvious eye, and the software needs to be able to tell when the user is looking at it. Bonus points if the eye can blink or move or otherwise express the computer's understanding of the gesture.

  14. Re:I think there is a good solution to this proble by Anonymous Coward · · Score: 0

    That would probably work. Face tracking is doable now and should be a subset of the capabilities of the hardware needed for a gesture interface in general.

  15. Hitchhiker's Guide to the Galaxy. by tragedy · · Score: 1

    Have to quote this from _The Hitchhiker's Guide to the Galaxy_ :

    A loud clatter of gunk music flooded through the Heart of Gold cabin as Zaphod searched the sub-etha radio wave bands for news of himself. The machine was rather difficult to operate. For years radios had been operated by means of pressing buttons and turning dials; then as the technology became more sophisticated the controls were made touch-sensitive--you merely had to brush the panels with your fingers; now all you had to do was wave your hand in the general direction of the components and hope. It saved a lot of muscular expenditure, of course, but meant that you had to sit infuriatingly still if you wanted to keep listening to the same program.

    The fundamental problem of gesture recognition has been recognized for quite some time now.

  16. need virtual keyboards by Anonymous Coward · · Score: 0

    The gesture stuff is nice, but we still need to type words to communicate and I'd like to see some kind of new creative keyboard interface that will make it more portable. Why not use this 3D gesture recognition to recognize how we move our fingers, to allow us to type virtually without a physical keyboard?