Christopher Chu

h-index3

2papers

95citations

2 Papers

31.0CLOct 9, 2020

Solving Historical Dictionary Codes with a Neural Language Model

Christopher Chu, Raphael Valenti, Kevin Knight

We solve difficult word-based substitution codes by constructing a decoding lattice and searching that lattice with a neural language model. We apply our method to a set of enciphered letters exchanged between US Army General James Wilkinson and agents of the Spanish Crown in the late 1700s and early 1800s, obtained from the US Library of Congress. We are able to decipher 75.1% of the cipher-word tokens correctly.

31.0CLOct 9, 2020

Learning to Pronounce Chinese Without a Pronunciation Dictionary

Christopher Chu, Scot Fang, Kevin Knight

We demonstrate a program that learns to pronounce Chinese text in Mandarin, without a pronunciation dictionary. From non-parallel streams of Chinese characters and Chinese pinyin syllables, it establishes a many-to-many mapping between characters and pronunciations. Using unsupervised methods, the program effectively deciphers writing into speech. Its token-level character-to-syllable accuracy is 89%, which significantly exceeds the 22% accuracy of prior work.