CLOct 24, 2022

"Covid vaccine is against Covid but Oxford vaccine is made at Oxford!" Semantic Interpretation of Proper Noun Compounds

arXiv:2210.13039v1291 citationsh-index: 44Has Code
Originality Incremental advance
AI Analysis

This addresses a gap in information-seeking applications for short-form domains like news headlines, though it is incremental as it builds on existing noun compound interpretation methods.

The paper tackles the problem of interpreting proper noun compounds like 'Covid vaccine' by releasing ProNCI, a manually annotated dataset of 22.5K examples, and shows that adding targeted knowledge improves model performance by up to 2.8% and boosts an Open IE system's yield by 7.5% at 85% precision.

Proper noun compounds, e.g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease"). These are commonly used in short-form domains, such as news headlines, but are largely ignored in information-seeking applications. To address this limitation, we release a new manually annotated dataset, ProNCI, consisting of 22.5K proper noun compounds along with their free-form semantic interpretations. ProNCI is 60 times larger than prior noun compound datasets and also includes non-compositional examples, which have not been previously explored. We experiment with various neural models for automatically generating the semantic interpretations from proper noun compounds, ranging from few-shot prompting to supervised learning, with varying degrees of knowledge about the constituent nouns. We find that adding targeted knowledge, particularly about the common noun, results in performance gains of upto 2.8%. Finally, we integrate our model generated interpretations with an existing Open IE system and observe an 7.5% increase in yield at a precision of 85%. The dataset and code are available at https://github.com/dair-iitd/pronci.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes