Slashdot Mirror


User: bedahr

bedahr's activity in the archive.

Stories
0
Comments
14
First seen
Last seen
Profile
(view on slashdot.org)

Comments · 14

  1. Re:Open Source, or Microsoft-Owned? on Open Source Speech Recognition · · Score: 1

    You would only need to change the modelmanager (modelmanager.cpp) to use different tools.

    It wouldn't be that hard I guess....

    -- bedahr

  2. Re:Project's webpage in English? on Open Source Speech Recognition · · Score: 1

    Have you downloaded the software?

    There is a work-in-progress article about setting up simon on our project wiki: http://simon-listens.org/wiki

    As it is still far from complete here the key points:

    * Download and compile simon (this can be tricky on windows).

    * Simon will present you with a setup-type wizard when you first start it (you have to compile it first).

    * The wizard will prompt you for paths and settings.

    * You can then create a new language model / open an existing one (if you speak english you could try out the voxforge model) and specify commands.

    Please try it out and contact me if you have any questions.

  3. Re:Open Source, or Microsoft-Owned? on Open Source Speech Recognition · · Score: 2, Informative

    This software's license most obviously violates requirements 1, 2, and 3. These are perhaps the most important provisions of the definition and form the basis for the power of calling a license an open source license. By not adhering to this definition when calling licenses and software "Open Source" you dilute the power the terms carries. Simply calling something 'open source' because they allow you to look at the source code is something we should avoid because 'Open Source' requires freedom not just source.

    Simon does not violate this description in ANY way.

    HTK is not redistributed with simon so simon itself complies exactly with what you are writing.

    Simon does not depend on the HTK toolkit. It simply uses it to compile / maintain the model. If you have compiled the model already (simon explicitly asks if you have done that already when starting the first time) you can specify the path to it.

    Simon will then just use the model and can still start programs, type text, etc.

    There is absolutely no need for the HTK toolkit. Simon is also useful without it.

    Is e.g. X.org not open source because it has the means to put non-free software to use to make it even more powerful? (e.g. the nvidia driver)

    Simon itself stands under the GPLv3.

    -- bedahr
  4. Re:Reason we're not there yet... on Open Source Speech Recognition · · Score: 1

    Well simon is not that far off of that idea.

    The real recognition is "outsourced" to juliusd which communicates with simon over tcp/ip.

    The main simon program is just to maintain the language model / train it / etc.

    In my opinion exposing the recognition results over e.g. dbus would be a better way than to quadruple the efforts by splitting this (HUGE) task to gnome, kde, xfce, window, etc.

    -- bedahr

  5. Re:been playing with it on Open Source Speech Recognition · · Score: 1

    Yes that would be the type of thing that can be done already quite reliably...

    -- bedahr

  6. Re:Project's webpage in English? on Open Source Speech Recognition · · Score: 1

    I don't have a windows machine here to play with...

    So sadly there are no official windows binary (no binaries at all for that matter) until now.

    (The alpha version is not really targeted for the end-user)

    -- bedahr

  7. Re:Open Source, or Microsoft-Owned? on Open Source Speech Recognition · · Score: 5, Informative

    Simon is in no way connected to Microsoft.

    Simon does NOT contain the HTK toolkit - it meerly executes commands.

    HTK is free of charge and open source (in the strict sense of you-can-look-at-the-code). It is, however, not "free".

    We are aware of that and have not packaged any parts of HTK for the release - you have to download it yourself if you want to modify the model from within simon.

    It is not optimal, but we don't have the knowledge and / or manpower to code up something similar in a reasonable timeframe. And after all, it isn't that big of a deal, is it?

    -- bedahr

  8. Re:Open Source? on Open Source Speech Recognition · · Score: 1

    Sphinx is just an engine - isn't it?

    Simon takes the julius engine and uses the recognition results to do something useful.

    Please take a look at the screenshots at the sourceforge page (mentioned in the article).

    -- bedahr

  9. Re:Project's webpage in English? on Open Source Speech Recognition · · Score: 5, Informative

    We are sorry that there is no international homepage for this yet.

    BUT: you are strongly encouraged to contact me with any questions: grasch < at > simon-listens.org

    -- Peter

  10. Re:Which languages are supported? on Open Source Speech Recognition · · Score: 1

    The language has nothing to do with the software.

    But it has everything to do with the model. You'd just need for exmpaple an Italian language model. (Sure the ui /should/ probably translated as well but that has nothing to do with the recognition).

    Simon doesn't even include a language model - it does, however include the means to create one.

    -- Peter

  11. Re:Are they productive? on Open Source Speech Recognition · · Score: 3, Informative

    You might want to have a look at the voxforge project

    And this doesn't require changes in the algorithm - just in the model.

    -- bedahr

  12. Re:Are they productive? on Open Source Speech Recognition · · Score: 1

    This is exactly what we are going for.

    Our training persons have spastic disabilities.

    -- bedahr

  13. Re:been playing with it on Open Source Speech Recognition · · Score: 2, Interesting

    Actually you don't need to get your hands dirty for writing your own grammar. Simon includes a complete grammar module with ways to compile the grammar, edit the sentence structures, import them from written texts (by looking the words up in the dictionary), etc.

    -- bedahr

  14. Re:been playing with it on Open Source Speech Recognition · · Score: 5, Informative

    This is actually the simon approach does: the magic keyword is "simon". "simon Firefox" for example. -- bedahr