Nakdan: Professional Hebrew Diacritizer
This system addresses the need for accurate diacritization in Hebrew texts, particularly for scientific editions, but is incremental as it builds on existing methods with domain-specific enhancements.
The authors tackled the problem of automatic diacritization for Hebrew text by developing a system that combines neural models with linguistic knowledge, achieving state-of-the-art accuracy and supporting manual editing for scientific editions.
We present a system for automatic diacritization of Hebrew text. The system combines modern neural models with carefully curated declarative linguistic knowledge and comprehensive manually constructed tables and dictionaries. Besides providing state of the art diacritization accuracy, the system also supports an interface for manual editing and correction of the automatic output, and has several features which make it particularly useful for preparation of scientific editions of Hebrew texts. The system supports Modern Hebrew, Rabbinic Hebrew and Poetic Hebrew. The system is freely accessible for all use at http://nakdanpro.dicta.org.il.