How are the dictionaries of Carabao populated?
The dictionaries are built semi-automatically. While some parts of the data must be set up manually, the bulk is imported from dictionaries in “human-readable” format. People have been building dictionaries for millenia and most of them are structured after a certain pattern. Human “hand-picked” dictionary data is much more accurate than the results of any statistical data harvester, and much more widely available – even for minority languages. Among other features, Carabao is able to parse plain ASCII (not XML or anything) data structured after the regular human dictionary pattern, such as: entryInSourceLanguage1 – I. part of speech 1 1. translationSense1Synonym1, translationSense1Synonym2 2. translationSense2 3. translationSense3Synonym1, translationSense3Synonym2, translationSense3Synonym3 II. part of speech 2 translationSense4 entryInSourceLanguage2 – part of speech entry2Translation The parsing tool is configurable for maximum flexibility. More information is available on the Ling