Introduction
Alphabetic Dictionary of the Foochow Dialect, Third Edition, 1929 is one of the first Foochow - English dictionaries written by missionaries in Foochow.
The first edition was published in 1870.
The third edition of this dictionary has 2000+ pages with tens of thousands of entries.
Why Digitalize This Dictionary?
Digitalizing this dictionary will not only create the largest Foochow dictionary openly available online, but it will also be benificial to scholars, especially those whose focus is on the evolution of Eastern Min over time.
Potential applications include:
- Dictionary apps
- An Input Method (IME) for Foochow
- Data Mining on the Tone Sandhi of Eastern Min
Plan & Outline
Stage 1 - Segmentation [Done]
- Decomposing pages into individual entries.
- Segmented images are available here.
Stage 2 - Machine OCR [Planning a 2nd attempt]
- We intend to use ABBYY OCR SDK for image recognition.
Using an OCR engine as accurate as ABBYY can greatly reduce the time and effort needed for manual input and review.
- To help with re-applying OCR, see this issue。.
Stage 3 - Crowdsourced Reviewing [Ongoing]
- Use the power of the crowd to manually review each entry.
- We have built a web application for this purpose.
Stage 4 - Post-processing
- Create the data structure for the dictionary.
- Publish the data.
- Create a web UI for querying dictionary entries.
Participate in Discussions
Please visit the
issue tracker of this project.