CEDICT

The CEDICT project was started by Paul Denisowski in 1997 and is maintained by MDBG, under the name CC-CEDICT, with the aim to provide a complete Chinese to English dictionary with pronunciation in pinyin for the Chinese characters.

Content

CEDICT is a text file; other programs (or simply Notepad or egrep or equivalent) are needed to search and display it. This project is considered a standard Chinese-English reference on the Internet and is used by several other Chinese-English projects. The Unihan Database uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database .

Features:

The basic format of a CEDICT entry is:

Traditional Simplified [pin1 yin1] /American English equivalent 1/equivalent 2/
漢字 汉字 [han4 zi4] /Chinese character/CL:個|个/

Example of a simple egrep search:

$ egrep -i 有勇無謀 cedict.txt
有勇無謀 有勇无谋 [you3 yong3 wu2 mou2] /bold but not very astute/

History

Year Event
1991 EDICT Japanese dictionary project was started by Jim Breen.
1997 CEDICT project started by Paul Denisowski, on the model of EDICT.
1999 CEDICT ownership transferred to Erik Peterson of mandarintools.com.
2007 MDBG started a new project called CC-CEDICT which continues the CEDICT project with a new license: Creative Commons Attribution-Share Alike 3.0 License, allowing more projects to use it. Additionally a work flow has been set up to streamline the process of submitting, reviewing and processing new entries.

Related projects

CEDICT has shown the way to some other projects:

See also

External links

References

  1. http://cantonese.org/download.html
  2. http://writecantonese8.wordpress.com/2012/02/04/cantonese-cedict-project/ "Later, I was guided to merge data from Cantonese Stardict, which is an electronic version of “A Dictionary of Cantonese Slang”, into Cantonese CEDICT"
This article is issued from Wikipedia - version of the 10/22/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.