Creating an Aligned Corpus of Sound and Text: The Multimodal Corpus of Shakespeare and Milton
This work creates a multimodal corpus for literary analysis, but it is incremental as it applies existing alignment techniques to new data.
The authors tackled the problem of aligning audio readings with text for poems by Shakespeare and Milton, resulting in a corpus enriched with multi-level alignments and scansion, and they provided a basic visualization platform.
In this work we present a corpus of poems by William Shakespeare and John Milton that have been enriched with readings from the public domain. We have aligned all the lines with their respective audio segments, at the line, word, syllable and phone level, and we have included their scansion. We make a basic visualization platform for these poems and we conclude by conjecturing possible future directions.