Slashdot Mirror


Statistics For Data Entry: The Brave New Step

A reader writes:"First there was Dasher, a novel application of statistical theory that lets free texts be written using only a pointing device. Dasher works by predicting the continuations of the text being written, based on what has been written so far; there is a probability associated with each offered continuation and the presentation is designed to make it easier to choose more probable continuations. A big advantage of statistics-based interfaces is that they automatically enforce correctness, because correct strings are more probable than incorrect ones. Now the same approach has been extended to writing maths. Apropos is a Javascript application (it supports IE6 & Firefox) to create mathematical expressions. It represents the math using MathML, the official XML spec for mathematics. It is definitely clunky when compared to Dasher, but better than MS Equation Editor etc. It is interesting to consider if this approach can be extended to other XML vocabularies (for example, a model for HTML that suggests the markup as you go along - a properly trained one will make it harder to create pages with blinking text, loads of images etc.), or to formal languages other than XML (e.g. programming languages). Stochastic modeling can also be used as a basis for speech recognition, with the recognizer using the model to choose a continuation when the speech signal is ambiguous or indistinct."

8 of 121 comments (clear)

  1. Like t9 by xabi · · Score: 4, Interesting

    It seems to be the same concept as t9.

    --
    Check populicio.us
  2. Old technology by Inigo+Soto · · Score: 4, Interesting

    That is hardly news. Mobile phone interfaces have been offering this kind of interfaces for years. True, they are useful, but nothing new here

  3. It sounds good .. in theory by Anonymous Coward · · Score: 5, Funny

    "You appear to be writing a letter, and here's what you're probably going to say..."

  4. Riiiiiiiiight by $RANDOMLUSER · · Score: 4, Funny
    "Twas brillig, and the slithy toves
    Did gyre and gimble in the wabe:
    All mimsy were the borogoves,
    And the mome raths outgrabe."

    It knew he was going to say that.

    More likely, it's going to predict that someone's going to say "Let's circle back and touch base tomorrow".

    --
    No folly is more costly than the folly of intolerant idealism. - Winston Churchill
  5. Why this isn't the same as T9. by pkhuong · · Score: 4, Informative

    Having used both dasher and T9, it seems to me that t9 only takes into account the keystrokes entered for each word. It then correlates them to a dictionary. Dasher, on the other hand, is based on markov chains (yes, like those word/text generators), and thus takes into account the last [n] characters. That makes it much more accurate, and, interestingly enough, should make it particularly well-suited to editing programs in most mainstream languages, since they have a lot of noise words and frequently used sequences.

    --
    Try Corewar @ www.koth.org - rec.games.corewar
  6. Quick test by potifar · · Score: 4, Insightful
    MathML was never really intended for writing by hand, and even if Apropos makes it easier, I can't see myself switching from (La-)TeX anytime soon. I can enter extremely complex mathematical expressions at least 20-30 times faster by typing them in TeX than I ever could do clicking around an interface like Apropos.

    MathML is a good idea in theory, but until there are good tools for writing and editing MathML, there will be very few people using it (either for publishing or for archival purposes.)

  7. Failures of inattention by hussar · · Score: 4, Interesting

    As other posters have noted, this sounds a lot like T9, which is used in cell phones for predictive text entry. T9 is a great utility, but it has happened that what I am writing is less predictable or the there is a more often used combination of letters that results from the keys I have hit. If I don't pay attention, I get the wrong word.

    I can't help but think of someone entering a mathematical equation and concentrating more on his idea than what is being written to the screen. Due to this inattention, the equation doesn't work, he figures he's just wrong, and spends hours/days to find the point at which the computer put in its prediction and not what he thought he entered. Worst case, he could abandon what would have been a great idea.

    Or, imagine this applied to writing computer programs. Say for example, you are writing a program to calculate the correct distance the probe should hold above the atmosphere so it doesn't burn up. Your cube mate distracts you briefly, and...

    --

    Bureaucracy loves company.
  8. Dasher and stats rock by palad1 · · Score: 4, Funny

    I did a quick test run of Dasher instead of RTFA, and as far as I understand, it works by presenting the most statistically-probable letter in the middle of the input area.

    So, by dragging a perfectly horizontal line with my mouse cursor, I was able to create the most statistically-probable sentence.

    Here goes, for Science:

    Kennedy insider&xeathGhed a noviceable. Punt.uetGrance beganic or Central believe t, space ship,' Alice, it is deleasantB.Carzone.That's luJbi

    Conspiracy theorists, area51 nuts and cypherpunks are going to be thrilled!