A "high justification"? How about speech recognition that actually works?
Training speech recognisers requires data. The biggest reason why speech recognition has improved in the recent years: lots of data.
Speech recognition in the cloud has given companies like Apple and Google a reason/excuse to gather masses of training data. They have put it to good use: speech recognition is much better than it was. If you like speech recognition, use it, meanwhile donating your data and helping the rest of us. If you don't, don't use it. As long as users are aware of this, I don't really see the problem.
Really? Have you tried Gnome 3.4? I installed it two weeks ago (on Ubuntu), and I haven't looked back. Maybe you've been bitten by previous versions? It seems GNOME 3 is like KDE 4: they need a couple of versions to get it right. (And an additional couple of versions to get the mindshare back.)
I love how GNOME3 gets out of your way while you're doing useful things, and how at the same time any context switch (starting a new application, or going to an existing window) is just one key press away. Even the colours are well thought-out: all context-switching stuff is in black, content is mostly bright.
The way Gnome3 handles IM is really exemplary: messages turn up at the bottom of the screen (in black). You can ignore them, or mouseover and type a reply. As soon as you move your mouse away, the conversation disappears and you can go on with what you were doing.
Does anything else beat this in terms of productivity?
I'm surprised that there seems to be this Slashdot groupthink against new desktop environments on the basis that they are different from Windows or Gnome2. Surely most people posting here have an IQ that allows them to learn a new interface? And actually come up with meaningful arguments?
Disclaimer: I'm not a patent lawyer, but I do know a thing or two about speech recognition. I've only read the summaries of the patents, but they don't seem to cover anything that Siri or any other sensible speech recogniser system does.
Patent 7,266,496 (from 2007) is about a complete speech recogniser on a chip. This couldn't be further from Siri, which sends the audio data to the cloud to be recognised. The four "modules" that the patent covers are bog-standard. Patent 7,707,032 (from 2010) describes a silly way of doing speech recognition (by comparing with individual training samples) and is unrelated to any modern commercial speech recogniser.
This is just some Taiwanese university hoping for two minutes of fame and a settlement. They seem to be getting their fame, but they're not going to see any money.
A big factor is how well the programmer knows their stuff. I basically run the same test, accidentally, when I ran a course on a university computer science curriculum. Graduate students had to program a small speech recogniser in C++, Java, or Python. I measured the speed of the entries on a standardised task. My reference version, in C++, was fastest, no surprise there. Second and third were students' programs in Java and Python (!). The very slowest program was extremely convoluted C++ code, and a factor of 20 slower than the fastest C++ version. The Java programs showed less variance: they were bland, but all reasonable. The entries in Python were concise.
My conclusion is that the performance advantages of C++ are easily offset by its ability to confuse the programmer. C++ is arcane and complicated; Java is more limited; Python is a lot cleaner. If you know exactly what you're doing, then C++ can be the fastest option, but in real-life situations, with deadlines and average programmers, don't go there.
This paper shows why psychologists should not touch computers, let alone write papers about it.
Hyperlearning may be a cause of schizophrenia in people, but this paper shows nothing of the sort. The learning rate in artificial neural nets determines a step size for the optimisation of a function. You need small steps because at each step the learning algorithm (gradient descent, or error backpropagation in this case) assumes that the function is linear. So neurons in the human brain assume piecewise linear behaviour in their neighbours? Of course not. The authors are just clueless about the mathematical model that they use.
You may not realise that speech recognisers need training data. And there is no data like more data. A year ago someone from Google told me that they trained their recogniser on 1000 hours of voice searches. If every utterance is a couple of seconds long, that's a lot of recordings. When you do a voice search, you can select from a number of recognition hypotheses. This is how they get transcribed data.
They also need to train on your voice specifically before you get decent recognition performance. I found that after a while my phone became surprisingly good at decoding my speech.
I do agree the privacy aspect is a concern, but in this case at least you benefit personally from Google storing your data.
I'm doing my PhD on speech recognition. I think (and hope!) it's neither dead nor fully developed. Currently, changes of environment screw speech recognisers up. Different speakers, background noise...
A trick that I heard has been used for subtitling television broadcasts is to have someone re-speak the words (which is not that hard). You could play the audio recordings on your headphones while repeating them into a microphone. If you're in a quiet room and the recogniser is trained on your voice, that may get you most of the way. You'll still want to correct transcriptions manually.
I don't know of any good trained open-source speech recognisers. There are open-source back-ends like Sphinx or HTK (which I sort of work on) but you need massive transcribed training corpora to train a speech recogniser. This is expensive which I guess is why open-source speech recognition hasn't taken off. In the speech recognition group at my university, most people use Linux, and I don't think anyone actually uses a speech recogniser in their daily work.
The education system, I'd say across the world is completely outdated and is a perfect example of a government run system.
Let me guess. Your world is constrained to North-America?
Even with all the technological advances available to schools, we still use the 17th century lecture style instruction method across the globe. We cram 30 students into the room with 1 teacher, and force everyone to learn at one pace: from the smartest to the dumbest.
In countries like the Netherlands and Germany, there are three or four different tracks for students aged 12-18. Around 15% follow the "pre-university" track in the Netherlands. It worked well for me, for exactly the reasons you give.
You wont get this though. Because we live in a world that demands "social justice" aka: forcing the smartest to be clumped in with the dumbest and the laziest.
I don't know where you get your idea of social justice from. Social justice would be to mix rich and poor in a classroom.
2 - Republicans don't go to war more then Democrats. Both parties voted to go to war. People seem to forget that polls showed that US citizens, as well as many of the world supported going into Iraq immediately after 9/11 on a false premise that Saddam had ties to 9/11. Bush pushed for diplomacy and intel. That intel concluded that Saddam had no ties to 9/11. A warmonger strikes while the iron is hot, not pushes for diplomacy for a few more years.
I can't stand to see such blatant deception moderated so highly. Bush and his cabinet pushed for war, and manipulated intelligence to make it look more desirable. No one ever suggested that there was a link between Saddam and 9/11; rather, Bush's administration manipulated evidence to falsely suggest that Saddam had weapons of mass destruction.
You may be one of these people who pay attention to the facts. However, many Americans at the time didn't, and believed that Saddam was behind the attacks. See this article in Political Science and Politics, 2004.
Iran is more like pre-Reformation Europe--a civilization whose people are growing more advanced, leading to tensions with a medieval theocratic regime.
The issues that the Iranian government is known for abroad are not necessarily interesting to normal Iranians.
My Iranian friend happened to be there during the British hostage thing.
The Iranians saw their government stance merely as a silly show-off.
The Iranians I know are unhappy with the low economic growth and high unemployment rates rather than theological tensions.
You are misinformed. You probably don't have any experience with Boost, nor do you get your facts right.
Boost consists of different parts, and for using boost::shared_ptr you need only a couple of headers. There is even a tool that extracts the necessary bits for you.
Its bloated , has a wierd syntax that differs from the C++ norm and doesn't solve any problem that isn't already solved or could be done quite easily by standard C++ anyway. boost::shared_ptr has been in standard library implementations for a couple of years now as std::tr1::shared_ptr. It will most likely be included in the next C++ standard. Apparently the C++ standard committee did think that shared_ptr solves problems that the old C++ standard does not.
I don't see much explanatory value in talk about "power users".
That I am an expert on speech recognisers does not make me want to manipulate zillions of settings when I'm burning a CD.
I have better things to do.
KDE is not the desktop of choice for "power users", but for people with too much time on their hands.
Come to think of it, that's exactly the psychological profile of the average Slashdot reader!
Has anybody thought it through? It's not even a matter of security, just plain utility. You may need to think again.
Keep in mind that some huge fraction of Americans never intend to get an Internet connection. Don't just dismiss that many people as idiots, either. In the Netherlands and Scandinavia over 90% of the population under 55 use the Internet at least once a week.
You imply that Americans are naturally less inclined to get an Internet connection (you use the word "idiots").
It is more reasonable to assume that America is somewhat behind, and will catch up.
And how would you like it if your C++ compiler or GIMP or Photoshop or 3D Studio Max was a web application? Most people have computers only to use the Internet and maybe to write the odd letter, not for their "C++ compiler or GIMP or Photoshop or 3D Studio Max".
AFAICT, subpixel rendering is not disabled, only the 5-tap filter that's supposed to reduce colour fringes.
See http://www.grc.com/cttech.htm.
Apparently this is one of the things Microsoft has patented, and I haven't seen any "prior art" for this specific technique.
In my humble opinion disabling the filter is not much of a loss as it just makes fonts look fuzzier.
The text people have to read is given. I.e. the orthographic transcription is available. It is possible to bootstrap a speech recognition system from these transcriptions. It will not be particularly good, though.
The more important problem is that current speech recognisers do not generalise well. If you train only on read speech, the performance on spontaneous speech will most likely be horrible. Transcribing spontaneous speech, however, takes enormous amounts of time. And it is not the kind of job you want to do for more than ten minutes. So I don't see how a good speech recogniser can be produced without money. I'm afraid this effort is going to lead nowhere, much though its purpose is to be applauded.
The IFA Dutch Open Source Corpus is way too small for producing a speech recogniser. The best systems produced at the institution where I am studying are trained on about a thousand hours of speech.
Android 4.0 runs fine on the Desire S with the ROM from HTCDev.
The surprising thing here is that HTC has actually supported the phone for longer than Cyanogenmod.
A "high justification"? How about speech recognition that actually works?
Training speech recognisers requires data. The biggest reason why speech recognition has improved in the recent years: lots of data.
Speech recognition in the cloud has given companies like Apple and Google a reason/excuse to gather masses of training data. They have put it to good use: speech recognition is much better than it was. If you like speech recognition, use it, meanwhile donating your data and helping the rest of us. If you don't, don't use it. As long as users are aware of this, I don't really see the problem.
"If you have something that you don't want anyone to know, maybe you shouldn't be doing it in the first place" - Eric Schmidt
You may be interested in the context of that quote.
http://blogs.gartner.com/jeffrey_mann/2009/12/18/googles-eric-schmidt-needs-media-training-not-a-privacy-spanking/
Really? Have you tried Gnome 3.4? I installed it two weeks ago (on Ubuntu), and I haven't looked back. Maybe you've been bitten by previous versions? It seems GNOME 3 is like KDE 4: they need a couple of versions to get it right. (And an additional couple of versions to get the mindshare back.)
I love how GNOME3 gets out of your way while you're doing useful things, and how at the same time any context switch (starting a new application, or going to an existing window) is just one key press away. Even the colours are well thought-out: all context-switching stuff is in black, content is mostly bright. The way Gnome3 handles IM is really exemplary: messages turn up at the bottom of the screen (in black). You can ignore them, or mouseover and type a reply. As soon as you move your mouse away, the conversation disappears and you can go on with what you were doing. Does anything else beat this in terms of productivity?
I'm surprised that there seems to be this Slashdot groupthink against new desktop environments on the basis that they are different from Windows or Gnome2. Surely most people posting here have an IQ that allows them to learn a new interface? And actually come up with meaningful arguments?
I know, I must be new here...
Disclaimer: I'm not a patent lawyer, but I do know a thing or two about speech recognition. I've only read the summaries of the patents, but they don't seem to cover anything that Siri or any other sensible speech recogniser system does.
Patent 7,266,496 (from 2007) is about a complete speech recogniser on a chip. This couldn't be further from Siri, which sends the audio data to the cloud to be recognised. The four "modules" that the patent covers are bog-standard. Patent 7,707,032 (from 2010) describes a silly way of doing speech recognition (by comparing with individual training samples) and is unrelated to any modern commercial speech recogniser.
This is just some Taiwanese university hoping for two minutes of fame and a settlement. They seem to be getting their fame, but they're not going to see any money.
it makes me literally trip over my own fingers.
Some hate it when people use "literally" when they mean "figuratively".
Not me. I picture you literally tripping over your fingers. Which body parts were you playing your bass with before that happened?
My conclusion is that the performance advantages of C++ are easily offset by its ability to confuse the programmer. C++ is arcane and complicated; Java is more limited; Python is a lot cleaner. If you know exactly what you're doing, then C++ can be the fastest option, but in real-life situations, with deadlines and average programmers, don't go there.
This paper shows why psychologists should not touch computers, let alone write papers about it. Hyperlearning may be a cause of schizophrenia in people, but this paper shows nothing of the sort. The learning rate in artificial neural nets determines a step size for the optimisation of a function. You need small steps because at each step the learning algorithm (gradient descent, or error backpropagation in this case) assumes that the function is linear. So neurons in the human brain assume piecewise linear behaviour in their neighbours? Of course not. The authors are just clueless about the mathematical model that they use.
You may not realise that speech recognisers need training data. And there is no data like more data. A year ago someone from Google told me that they trained their recogniser on 1000 hours of voice searches. If every utterance is a couple of seconds long, that's a lot of recordings. When you do a voice search, you can select from a number of recognition hypotheses. This is how they get transcribed data.
They also need to train on your voice specifically before you get decent recognition performance. I found that after a while my phone became surprisingly good at decoding my speech. I do agree the privacy aspect is a concern, but in this case at least you benefit personally from Google storing your data.
I don't know of any good trained open-source speech recognisers. There are open-source back-ends like Sphinx or HTK (which I sort of work on) but you need massive transcribed training corpora to train a speech recogniser. This is expensive which I guess is why open-source speech recognition hasn't taken off. In the speech recognition group at my university, most people use Linux, and I don't think anyone actually uses a speech recogniser in their daily work.
The education system, I'd say across the world is completely outdated and is a perfect example of a government run system.
Let me guess. Your world is constrained to North-America?
Even with all the technological advances available to schools, we still use the 17th century lecture style instruction method across the globe. We cram 30 students into the room with 1 teacher, and force everyone to learn at one pace: from the smartest to the dumbest.
In countries like the Netherlands and Germany, there are three or four different tracks for students aged 12-18. Around 15% follow the "pre-university" track in the Netherlands. It worked well for me, for exactly the reasons you give.
You wont get this though. Because we live in a world that demands "social justice" aka: forcing the smartest to be clumped in with the dumbest and the laziest.
I don't know where you get your idea of social justice from. Social justice would be to mix rich and poor in a classroom.
Combining of C++ and Python works really smoothly with Boost.Python. C++ objects can look and feel like Python objects, and vice versa.
I can't stand to see such blatant deception moderated so highly. Bush and his cabinet pushed for war, and manipulated intelligence to make it look more desirable. No one ever suggested that there was a link between Saddam and 9/11; rather, Bush's administration manipulated evidence to falsely suggest that Saddam had weapons of mass destruction.
You may be one of these people who pay attention to the facts. However, many Americans at the time didn't, and believed that Saddam was behind the attacks. See this article in Political Science and Politics, 2004.
That summary is over-concise. The 1979 Islamic Revolution was a result of growing unease with the US-backed monarch, installed through the 1953 coup when the elected prime minister didn't want to share the country's oil with the West. Say what you will, but under the Islamic government education improved, with female literacy improving tremendously.
The issues that the Iranian government is known for abroad are not necessarily interesting to normal Iranians. My Iranian friend happened to be there during the British hostage thing. The Iranians saw their government stance merely as a silly show-off. The Iranians I know are unhappy with the low economic growth and high unemployment rates rather than theological tensions.
I think the difference between KDE and Gnome can be explained sufficiently well by two screenshots, taken from random places on the web.
Copying a CD with KDE
Copying a CD with Gnome
I don't see much explanatory value in talk about "power users". That I am an expert on speech recognisers does not make me want to manipulate zillions of settings when I'm burning a CD. I have better things to do. KDE is not the desktop of choice for "power users", but for people with too much time on their hands.
Come to think of it, that's exactly the psychological profile of the average Slashdot reader!
The main developer of FreeType decided to disable the filter in September. The Novell deal was later and had nothing to do with this.
AFAICT, subpixel rendering is not disabled, only the 5-tap filter that's supposed to reduce colour fringes. See http://www.grc.com/cttech.htm. Apparently this is one of the things Microsoft has patented, and I haven't seen any "prior art" for this specific technique. In my humble opinion disabling the filter is not much of a loss as it just makes fonts look fuzzier.
What kind of society do you live in where "fuck" denotes violence? How does your partner feel about that?
The text people have to read is given. I.e. the orthographic transcription is available. It is possible to bootstrap a speech recognition system from these transcriptions. It will not be particularly good, though.
The more important problem is that current speech recognisers do not generalise well. If you train only on read speech, the performance on spontaneous speech will most likely be horrible. Transcribing spontaneous speech, however, takes enormous amounts of time. And it is not the kind of job you want to do for more than ten minutes. So I don't see how a good speech recogniser can be produced without money. I'm afraid this effort is going to lead nowhere, much though its purpose is to be applauded.
The IFA Dutch Open Source Corpus is way too small for producing a speech recogniser. The best systems produced at the institution where I am studying are trained on about a thousand hours of speech.