Open dictionaries are excellent examples of open knowledge projects. Whether monolingual or bilingual, and whether dealing with definitions, etymology, translation or pronounciation – they can often be large, collaborative undertakings.
Dictionary databases have a wide variety of potential applications – from education and research to machine translation and integration with software applications and services.
We’ve listed several open dictionary projects and packages on CKAN:
These include:
- http://ckan.net/package/read/freedict
- Currently offers 69 bilingual dictionaries released under the GPL.
- http://ckan.net/package/read/xdxf
- Currently includes over currently 308 dictionary files in various languages published in XML format. All material is under the GPL.
- http://ckan.net/package/read/apertium-all
- Offers a variety of dictionaries with over 20 different language pairs. Material is under the GPL, the GFDL and the Creative Commons Attribution-Sharealike license.
- http://ckan.net/package/read/wiktionary
- The Wikimedia Foundation’s dictionary project – currently including over 5 million entries in over 170 languages.
- http://ckan.net/package/read/k12-open-dictionary
- A project to build a basic public domain dictionary for children.
- http://ckan.net/package/read/oed
- Scans of the first several volumes of the Oxford English Dictionary (the portion which has fallen into the public domain). It would be great to have a machine-readable version of this!
- http://ckan.net/package/read/ding
- A German-English dictionary with over 216,000 entries. Under the GPL.
- http://ckan.net/package/read/eurfa
- A Welsh-English, English-Welsh dictionary with over 13,000 entries. Under the GPL.
- http://ckan.net/package/read/jmdict
- A Japanese-Multilingual dictionary available under a Creative Commons Attribution Sharealike license.
- http://ckan.net/package/read/open-thesaurus
- A set of thesauri in 8 different languages under the GPL.
We’d like to start using tags to correspond with the ISO 639-2 codes for the representation of names of languages, such as:
If you know of any other open dictionary projects – we’d love to hear about them! You can either pop us a line to the okfn-discuss list, or add packages directly to CKAN:
Dr. Jonathan Gray is Lecturer in Critical Infrastructure Studies at the Department of Digital Humanities, King’s College London, where he is currently writing a book on data worlds. He is also Cofounder of the Public Data Lab; and Research Associate at the Digital Methods Initiative (University of Amsterdam) and the médialab (Sciences Po, Paris). More about his work can be found at jonathangray.org and he tweets at @jwyg.
Hoi,
You may want to check out OmegaWiki.org. It provides lexical information like all the others. The difference is that the user interface can be changed to another language and it will be able to show the same information in the other language dependent on the availability of translations./
Thanks for the suggestion GerardM!
I’ve added a package for OmegaWiki at:
http://ckan.net/package/read/omegawiki
Feel free to amend or add to this entry!
I notice Macmillan have added an Open Dictionary to their new free online dictionary
http://www.macmillandictionary.com/open-dictionary/
Jean C: thanks for the pointer. Unfortunately it looks like Macmillan’s “Open Dictionary” isn’t open — at least not in any way we mean by that term.
Their “open” means letting you give them information for free (by submitting word suggestions) but getting nothing back — as the terms and conditions make quite clear (emphasis added):
To my mind this is clear abuse of the term open and and more than a little exploitative — you do work for them for free and they don’t even promise to give you credit let alone permission to use the material you helped create. Such potential for abuse of the “open” label is a major reason we created the open definition — where open content and data is clearly defined as material that anyone is free to use, reuse and redistribute without restriction.
From what I see, there is a figure of different positions on this. I mean you only have to browse the varied Internet forums and that gets starkly plain. Yet the trouble is, numerous people don’t appear to look that deep into this.
thank for share.
Lead Rocket 2.0