Crowd-Source Translation Software For Free Content?
yahyamf writes "I have a lot of free educational content in the form of audio lectures and text, which I'd like to translate into as many languages as possible. I would also want to transcribe the audio and create audiobooks from the text. There are already several volunteers willing to contribute, but I need some web based software to manage all the work. Facebook is already doing something like this, but it is only for their content. I've also looked at Damned Lies, which is part of the Gnome project, but it doesn't seem to handle audio. Are there any other open source translation projects out there that I can customize and build upon?"
Are they your lectures and who owns the copyright on the lectures? Does the university or do you? Since your work product was for hire . . .
Beer is proof that God loves us and wants us to be happy.
I've fallen behind in my web 2.0 buzz words. What the hell's a crowd source? I was thinking someone or something that draws crowds like Obama or double jointed Swedish twins. Unenlightened minds want to know!
You're looking for open source software that can combine both those into something effective? If you don't mind the translated audio being practically useless, then you might be able to find something.
My hovercraft is full of eels.
I'm pretty sure that the writer of TFQ is looking for software to coordinate a human speech to text effort(ie, manage volunteer accounts, serve audio clips for transcription/translation, receive results files from them, and so forth), not speech to text software.
He is, in essence, looking for an audio equivalent of the interface used by the Distributed Proofreaders project. With, perhaps, a side of translation mechanisms similar to the ones used on Ubuntu launchpad or equivalent. Neither are particular exotic technologically.
Such a setup is more or less prosaic in CS terms, no major breakthroughs need to be made; but it would constitute a somewhat specialized flavor of Content Management system. I honestly don't know if anything of the sort exists.
That's actualy a great idea. Just include this link with a copy of the English language versions:
http://www.lmgtfy.com/?q=learn+to+speak+english
Problem solved!
As our way of thanking you for your positive contributions to Slashdot, you are eligible to disable Slashdot 2.0.
Hello,
At transposh we aim to create such a project, that will enable crowd-sourcing websites translations (and hence your scripts), no audio is planned though.
Currently we have a wordpress plugin, but a generic plug is being written, everyone is welcomed to help
Ofer
This doesn't handle audio, nor does it seem to be up even, but this seems kind of like what you want:
http://blogoscoped.com/archive/2008-08-04-n48.html
The people over at BOINC have a software called Bossa for distributed thinking projects (crowd sourcing). I am not sure of the current status of the project, but I have heard of at least one group that is trying to implement it.
This signiture copied from somewhere.
I'm an freelance translator and I'd like to warn you about the most serious pitfall of crowdsourcing - the quality. I've seen Facebook translation onto my language (Polish) and it's terrible. There are other projects done this way and most of them are of extremely poor quality.
Problem is - if you want quality content, you need professionals do the job. They don't necessarily have to be paid professionals (translators) - maybe just the people from your field, who wish to contribute for some reason or other. But in crowdsourcing you have to take into account a lot of poor translations and you have to introduce some form of quality control - best would be to hire editors, but maybe some kind of voting system would do.
Just don't let your content be translated without QA, because you won't sell much of it.
It handles texts, not audio, but Open Source Mission's Gospel Translations might be a useful model. They work with publishers/rights-holders (if any) to get the right to post works, then coordinate translations to a huge variety of languages. Once a translation is done, they post/host it for free. The translations are developed using a Wiki. Their focus is on Christian works, but I think the approach would work for any literature you want widely distributed in a variety of languages.
- David A. Wheeler (see my Secure Programming HOWTO)
Indeed. I hope you don't mean that in a pejorative sense? When TFQ is asking about translation, it's perfectly appropriate for professionals in the field to chime in with their insights and expertise.
There was an article recently in the Japan Times about a project at the University of Tokyo to build a very similar system, though it is apparently just for texts being translated into Japanese. For the curious: http://search.japantimes.co.jp/cgi-bin/ek20090422a1.html. I don't agree with some of the pronouncements in the article (understanding the nuances of the source text and accurately conveying those in a fluently written target text does indeed take some skill, whereas the article and even the project name Minna no Honyaku suggest that 'anyone can translate!'), but the project itself looks interesting. The project site is http://trans-aid.jp/ (Japanese only).
Perhaps the TFQ submitter could contact the Professor Kyo Kageura mentioned in the article to find out more about the Minna no Honyaku system? It's basically crowdsourcing for translation projects that don't merit the time, money, and quality of professional translation, which kinda sounds like what they're looking for.
Cheers,
"What in the name of Fats Waller is that?"
"A four-foot prune."
See also http://www.meedan.net/
Also, Google has a translation widget that might be a reasonable stop-gap measure.
http://translate.google.com/translate_tools?hl=en