I don't think it's clear, but it seems obvious. Is this anything more than a verbose description of a server sending OOO emails with some specific technologies mentioned?
I live on Niuatoputapu, Tonga. 1-3 months to the next town (whenever the boat comes). I often have to clean the sparkplug in my generator before booting.
My net connection is 14.4 dialup that cuts out every five minutes... long enough to load Slashdot and POP email.
I think it is widely recognized that you need to take coarticulation and _meaning_ into account when converting between speech and text.
You argued in another post for models of 4+ phonemes. Why we don't see this is because it's not a huge theoretical leap from triphones (thus boring researchers) and there are computational/storage/training efficiency requirements to consider. This is why one doesn't record an exhaustive library of every possible utterance in the first place. I think once you get to 7-phones, you may be better off trying to understand the phrase from higher level of abstraction.
Have we correctly identified the right compact expression of speech? I doubt it. Getting speech stuff to work involves a lot of tweaking that is theoretically ungrounded. Tweaking in a methodical and science-biased way _is_ engineering, however.
BTW, I seem to remember a prof saying that X-ray cinematography more-or-less proved the existance of vocal tract target configurations in speech, which correspond to phonemes. Not to mention that you can encode a message in IPA and have it understood by someone else. Even if they're not totally correct, phonemes may be a sufficient basis for building speech systems.
"Votester"? What does this application have to do with voting?
I could see "Protester", and then you'd have an accurate, descriptive name which fulfills your Rob Schnider-esque desire to end every P2P app with "-ster".
I don't think it's clear, but it seems obvious. Is this anything more than a verbose description of a server sending OOO emails with some specific technologies mentioned?
I live on Niuatoputapu, Tonga. 1-3 months to the next town (whenever the boat comes). I often have to clean the sparkplug in my generator before booting.
My net connection is 14.4 dialup that cuts out every five minutes... long enough to load Slashdot and POP email.
I think it is widely recognized that you need to take coarticulation and _meaning_ into account when converting between speech and text.
You argued in another post for models of 4+ phonemes. Why we don't see this is because it's not a huge theoretical leap from triphones (thus boring researchers) and there are computational/storage/training efficiency requirements to consider. This is why one doesn't record an exhaustive library of every possible utterance in the first place. I think once you get to 7-phones, you may be better off trying to understand the phrase from higher level of abstraction.
Have we correctly identified the right compact expression of speech? I doubt it. Getting speech stuff to work involves a lot of tweaking that is theoretically ungrounded. Tweaking in a methodical and science-biased way _is_ engineering, however.
BTW, I seem to remember a prof saying that X-ray cinematography more-or-less proved the existance of vocal tract target configurations in speech, which correspond to phonemes. Not to mention that you can encode a message in IPA and have it understood by someone else. Even if they're not totally correct, phonemes may be a sufficient basis for building speech systems.
Ha ha ha ha ha
ho ho ho ho ho
hee hee hee
hum
phew.
guess you had to be there.
"Votester"? What does this application have to do with voting?
I could see "Protester", and then you'd have an accurate, descriptive name which fulfills your Rob Schnider-esque desire to end every P2P app with "-ster".