Help me build a better dictionary!
September 16, 2006 9:44 PM
Subscribe
How do you start digitizing a dictionary? More specifically, how (and where) do I begin putting together a digital and internally cross-referenceable edition of the
Hans Wehr Dictionary of Modern Written Arabic?
[Long explanation ahead. A thousand apologetic synonyms.]
If you've ever learned Arabic, there's a good chance you've probably glanced once or twice at the green bible. The truth is, "Hans," as Arabic language students know it, is simply the best learners reference available, despite a few fundamental problems. One of these has been the lack of any good way to look up words in the opposite direction. Hans only goes from Arabic to English, and no reliable dictionary designed expressly for the English-speaking student of Arabic exists.
Arabic is a langage based on trilateral roots, which makes the meaning of any one word is strongly dependent on context of the collective meanings within the entire root series. Looking up any unfamiliar word from English to Arabic involves a tedious process of finding the word, then cross-referencing it to Hans, which provides the proper context on two axes: how it fits into the conjugation patterns of other words derived from the same root, and how the terms was commonly used when the dictionary was written. A student can only be sure that the word she's chosen is the right one after she's double and triple-checked all the contexts.
One of the solutions to this problem, I believe, is to put the entire Hans Wehr dictionary into a computer-searchable format. This would mean that any word searched for in English could be easily linked back to its Arabic translation, which would then be found in the original context set out by Wehr.
I would guess off the top of my head that some kind of database is the best option, since the dictionary is arranged hierarchically and alphabetically.
My question for the Hive Mind is this: what's the best software to use to get this project off the ground? I'm not so technically inclined that I can design something like this from the ground up. I'd something that offers a frontend flexible enough that I can design my own hierarchy of tables and entries, and something that (even more ideally) would allow me to tag only certain texts to be available for search. Lastly, I'd like the software to allow for collaboration, maybe by making it easy to import other peoples' work into the main body.
Once I get this thing started up, I'll move it over to MeFiProjects so that everyone can get a chance to participate and see how progress is going.
This is a long question that might need some clarification if it seems I haven't explained things enough. Thanks for your collective help!
posted by awenner to writing & language (16 comments total)
3 users marked this as a favorite
(Or your first step is to contact an intellectual property lawyer and tell him to prepare to defend you when the copyright holder sues your tail feathers off.)
posted by Steven C. Den Beste at 10:04 PM on September 16, 2006