Paraphrasing Techniques for Maritime QA system
This work addresses the challenge of enabling effective human-machine partnerships in defense systems by improving natural language to SQL translation with limited data, though it is incremental as it builds on existing paraphrasing techniques.
The paper tackles the problem of training semantic parsers for human-machine communication in defense scenarios where large annotated datasets are unavailable, by using paraphrasing methods to generate training data and reports experimental results on real-world maritime data.
There has been an increasing interest in incorporating Artificial Intelligence (AI) into Defence and military systems to complement and augment human intelligence and capabilities. However, much work still needs to be done toward achieving an effective human-machine partnership. This work is aimed at enhancing human-machine communications by developing a capability for automatically translating human natural language into a machine-understandable language (e.g., SQL queries). Techniques toward achieving this goal typically involve building a semantic parser trained on a very large amount of high-quality manually-annotated data. However, in many real-world Defence scenarios, it is not feasible to obtain such a large amount of training data. To the best of our knowledge, there are few works trying to explore the possibility of training a semantic parser with limited manually-paraphrased data, in other words, zero-shot. In this paper, we investigate how to exploit paraphrasing methods for the automated generation of large-scale training datasets (in the form of paraphrased utterances and their corresponding logical forms in SQL format) and present our experimental results using real-world data in the maritime domain.