Exploring Tokenization Methods for Multitrack Sheet Music Generation
This addresses tokenization for sheet music generation, an incremental improvement in a domain-specific area.
The study tackled tokenization for multitrack sheet music generation in ABC notation, finding that bar-stream patching performed best in computational efficiency and musicality compared to existing methods.
This study explores the tokenization of multitrack sheet music in ABC notation, introducing two methods--bar-stream and line-stream patching. We compare these methods against existing techniques, including bar patching, byte patching, and Byte Pair Encoding (BPE). In terms of both computational efficiency and the musicality of the generated compositions, experimental results show that bar-stream patching performs best overall compared to the others, which makes it a promising tokenization strategy for sheet music generation.