Universal Free Dictionary
Zdenek Broz writes "The all free dictionaries project focuses on maintaining free dictionaries (now more than 90 with more than 3,300,000 translations). We are designing a new system which will unite them all into one universal dictionary for all languages. The universal dictionary will be soon available for free under GPL."
This seems to be entirely useful. The proposed design looks like they will have a quick dictionary lookup on a word in the language being used with a definition, and cross-reference to the same word in other languages. That could be entirely useful, and anyone who enjoys Engrish might wish to help add that module (mostly for fun), but it looks like this project might actually take some of the mystery out of translation. Perhaps Engrish is going to be a thing of the past?
The dangers of knowledge trigger emotional distress in human beings.
Klingon? Jive? 1337 5P34K? Pig Latin?
I'm all for this but dictionary.com,Babelfish, and google meet my dictionary needs.
-Teiresias
Why not just contribute to Wiktionary?
Or if they don't like the possibility of vandos, why not fork Wiktionary?
Le français vous intéresse?
Great! Now I can add all of my typos and misspellings to the dictionary and the slashdot spelling weenies won't be able to say anything.
"I use a Mac because I'm just better than you are."
I think a multi-language Urban Dictionary for slang would be far more useful.
http://www.wiktionary.org/ has been doing this for a long time, what's wrong with them?
Aren't there much better licenses for dictionaries than the GPL? Creative Commons comes to mind. What does the Guttenberg project use?
Moderation Totals: Flamebait=2, Troll=1, Redundant=1, Insightful=6, Overrated=1, Underrated=1, Total=12. (not mine)
So, I wouldn't be able to translate "blue jeans" from another langauge? This really sucks, because on of my High School spanglish teachers taught us that it translated to "bluyins" in Spanish, and I've really never trusted that...
Perhaps they should wait until they have a more robust system before making these types of announcments?
Why don't more people put syllable boundary markers in their phonetic transcriptions?
Me lost me cookie at the disco.
The flaw I see is that English is used as the base language. That creates a severe problem. English is not neutral. It's severely screwed up, as languages go, and is very hard to learn. There are multiple meanings of most of the words in the english language. Oh, sure, it's easy to use english, if that's the language the developers speak. But I do not think it best. Esperanto would be a good choice. Each word has basically one meaning. It has few grammar exceptions. Lots of translations into esperanto end up being more accurate than translations into other languages for multiple reasons.
But definitely, English is the opposite of a good choice.
Drop me a line at:
Key ID: 0x54D1D809
So what are you doing to find the similarities between languages? It would seem that if I searched for an Italian word, I would get the Latin root and them the related languages. This is more than a dictionary this is 65,000 years of human history, if you so allow it!
Oh, and IMNAL - I am not a linguist.
/usr/bin/grep -i -E meaning life.txt
All your noun are belong to us!
sigs, as if you care.
it would be *very* cool if there was a decent way to handle different "levels" of each language.
For example, in quebec we use the word "char" for your car... "j'vais prenez le char ce soir", i'm going to take the car tonight.
this isn't *good* french, but it's good quebec slang. it's how people actually speak. however you wouldn't use it if you were trying to write a cover letter, but you might use it if you were writing an email to a french friend. A dictionary where you could specify "speaking" vs "writing", or even "polite" vs "friendly", some way of really characterizing the KIND of translation you want.
expressions too... sometimes expressions can be directly translated, other times you'll sound like an idiot if you just use the same phrase you would have said in english. Something that recognizes common phrases and gives corresponding expressions in another language would be incredibly useful.
I guess what I'm getting at is it's annoying when you look up a word in a translation dictionary and get like 4 or 5 choices but you have no idea what the difference between them is, or it gives you a word that actually is correct, but is so rarely used that when you say it people look at you funny.
And for a convenience, it will automatically correct your spelling as follows:
Actually what is needed is a phrase translator (like those electronic pocket thingies you can buy ..they're great ..especially the ones that will actually say out the phrase for you) .. Rather than just translating words .. a dictionary that translates actual common sentences would add tremendous value. This is important because a dictionary doesnt tell you diddly on how to construct a meaningful sentence let alone help you understand common idioms.
Idiot - noun
1 usually offensive : a person affected with idiocy
2 : a foolish or stupid person
3 : Luigi Dipthong, who insulted me deeply by saying that I had completely misdrawn the control unit of my favorite processor.
There are translation for some of the words, for exemple House:
* Arabic: (bayt)
* Basque: etxe
* Breton: ti m
* Catalan: casa
* Chineses:
* Czech: dm m
* Dutch: huis n
* Esperanto: domo
* Estonian: maja
* Fijian: vale
* Finnish: talo (1, 2), house (3)
* French: maison f
* Frisian: hûs n
* Galician: casa f
* German: Haus, n
* Greek: oiko, spiti (modern Greek)
* Hebrew:
* Hungarian: ház
* Indonesian: rumah
* Italian: casa
* Japanese: (, ie), (, tatemono)
* Latin: domus f
* Low Saxon: Huus n
* Malay: rumah
* Persian: (xne)
* Polish: dom m
* Portuguese: casa f
* Romanian: cas f
* Romanica: casa f, domo m
* Slovene: hisa f
* Spanish: casa f
* Russian: m
* Swedish: hus n
* Turkish: ev n
My city: Barcelona.
... and it said, "Something that happens to most Japanese men before sex"?!?!
Free Firefox news reader.
Glossaries like these have their uses, and I sometimes use them myself when I'm reading something and don't know a word, but good dictionaries go way beyond these. To begin with, you often can't adequately translate a word from one language with a single word from another language. It often takes at least a phrase, and sometimes there isn't any straightforwad translation and a fairly elaborate explanation is necessary. Furthermore, especially if you're going into the language you don't know well, it is often necessary to have information about the grammar of the word in order to be able to use it properly. What case does the object of a verb have to be? Which conjugation does a verb belong to?
The other major limitation of simple glossaries like these is that they don't work very well for languages with complex word-formation where the citation form is not easily obtained from the inflected forms. For instance, in English it isn't a big deal to look up a plural noun because in almost all cases you just remove s or es, so someone who reads, e.g. trapezoids doesn't need to know very much in order to guess that it is a form of trapezoid and look it up under trapezoid. However, there are languages in which words have hundreds or thousands of forms and in which it is quite difficult to figure out what to look a word up under. Creating dictionaries for such languages that can be used by inexpert users is a long-standing problem for which electronic dictionaries offer a solution, but such dictionaries won't be simple glossaries; they will be databases with morphological analyzers as front ends. I've got a paper about this problem in Athabaskan languages here.
Now, if I can just find someone to extract this stupid fish...
Some of the texts on the Free Dictionaries project are listed as being licensed under the GPL. Can you mingle public-domain text with GPL'ed text?
This is a matter of practical concern. I'm overseeing a project which is digitizing copyright-expired dictionaries of the early Germanic languages. Some of the texts on my site are in German, and I'd like to use the GPL'ed Free Dictionaries German-English word list to add a feature to my project which allows you to click a German word to get a translation for that word.
Question 1: Are there provisions of the GPL which would prevent the a GPL'ed dictionary from being intermingled in this matter with existing public domain texts?
Another problem. The texts in my project contain many rare German words relating to Iron Age technology which are unlikely to be in the Free Dictionaries list, so I'd like to add my own supplemental list of words.
Question 2: Can I assign my supplemental word list to the public domain, or do I have to license it under the GPL as a modification to the original word list?
We haven't even gone into Eubonics, or the difference between the Queen's English and American English. English not only borrows words and phrases from other languages, it borrows grammar, and oddly. This is where all the exeptions come from.
Would that be two, to, or too? Would you use there or their? Most Americans can't spell "their", without their word processor. These things make a language hard. Shall I talk about am/is/are/was/were? I mean, come on. How many different ways can you bastardize "to be?" "To be, or not to be," was dead on. If English weren't your native tongue you'd be having troubles with it too.
Come to think of it, how many times this week have I heard some stupid question like, "You be goin' to tha sto(re)? Pick me up some smokes, aight?" Explain that one to a non-native. Heck, explain it to your (failing) English teacher. He/She is baffled, too. Again, I must point to Eubonics. I've actually had to interpret English to another English speaker! I'd say the primary function is not satisfied by English in it's current (screwed up!) state.
Drop me a line at:
Key ID: 0x54D1D809
I've been looking for something like this for the linux system in my wife's classroom. Not because of anything that special about the way definitions are developed, but just because it can be downloaded and used offline. (don't ask me why they can't run the network to the classroom, but the haven't). This could become one of the most popular programs there after Connect-4.
Foreign words do not always map onto each other, so there's going to be problems with nuance. It's probably redundant in saying this, but many words have no English equivalent - the German unheimlich for instance. Even the French uncanny doesn't quite do it justice.
I hope the English on the dictionary is better than the English on the homepage eg "There will be always the backbone description" and "will lead contributors to translate English words into other language." (mis)
Si tacuisses philosophus mansisses. If you had kept quiet, you would have remained a philosopher.
...SCO sues the operators of GPL Dictionary for using words that were originally included in source code allegedly developed by SCO...
Soon to be integrate with wearable PCs, then roam anywhere and have tiny headset translate what foreigners say to you. Speaker will talk back at them so they understand what you say as well!
I suggest you read Slashdot
Just today I was trying to prototype some ideas on a dictionary idea I had.
:-)
My idea was to create a tutorial for learning AMSLAN (American Sign Language) and to use texts from Project Gutenburg or other public domain works. One (of several) problems though is that English is rife with homographs... words that are spelled alike but have different meanings. In the case of sign language, a sign for a bow in a little girl's hair might appear extremely odd if the sign for bow of a ship popped up in automatic substitution.
Dealing with the homographs is a problem, but I see that this site's plan already takes a stab at dealing with such things (in their cat example). I'd love it if they went to the trouble of also including a bit of AMSLAN (either in animations or static pictures) as that might inspire me with some help in the solution.
Ideally, my desire is to get an automated library that could read a text (possibly read by the human sorting the homographs). And allow a user to listen to the reading and watch the text (while learning English), listen to the reading only (if hearing impaired), watch a silent sign language presentation with subtitles (to learn sign language), or watch a silent presentation through signing (if reading in silence is preferable).
Just kind of bizarre that the idea struck the same day as this article appeared, I thought.
One dictionary to rule them all!
"There is no flag large enough to cover the shame of killing innocent people."--Howard Zinn
1337 5P34K 1s F0r 0ld p3opl3
what would be the best is they publish it as a web service and then people like you and me make clients to access them from applications running in the system tray.
Having taken Chinese I this term, I have learned that there's a whole lot more to a language than just vocabulary. In order to be a useful English-killer or monolinguism-killer, a language site needs to have information on how to pronounce words, how to write all of the glyphs used in the language (which might not be important if it uses a Latin-based script and so does your native language, but a lot of languages don't), and some idea of how to construct a useful sentence. (Word order, how to conjugate verbs/decline nouns, use of measure words/particles/prepositions, even synonyms and homonyms..) Also useful would be free media in the language -- TV shows, music, menus, newspapers -- but I know this would be very difficult to host effectively. My Chinese textbook's name translates as "Chinese Listening, Speaking, Reading and Writing". Developing skill with all of these facets of a language is part of gaining facility with it.
:).
I would love a free-content languages database, full of audio samples of native speakers and grammar rules, but this isn't quite there yet. I do hope something like it gets off the ground, though, because monolinguism is a terrible disease in a global community
Ethan
Why Q Entries in FAQ use Structured Differently English.
Are maintainers FAQ also architechts Backbone of English?
Am I Possible Detecting liability project here?
"Sometimes you've got to kick at the darkness till it bleeds daylight" Bruce C0ckburn
If it lists GPL as a word I'd be very concerned about the accuracy of the dictionary.
Maybe they could improve their design by adding context to the english words, or maybe to ALL words in ANY language? For example (incorrectness possible, I'm not an arab speaker):
h or ical)
foundation/base(engineering,housebuilding,metap
= al Qaeda(engineering,housebuilding,metaphorical)
the loo/the sit(colloq.,sanitary)=
al Qaeda(colloq.,sanitary)
a foundation(organisation,group)=
Al Qaeda(organisation,group)
Al Qaeda(terror,name)=
Al Qaeda(terror,name)
I'm still trying to figure out what people mean by 'social skills' here.