Marvin: Semantic annotation using multiple knowledge sources
This tool addresses the challenge for professionals in various fields who struggle to manage large amounts of written material, though it appears incremental as it builds on existing annotation methods.
The authors tackled the problem of overwhelming publication volumes by developing Marvin, a Java-based semantic text annotator that supports multiple knowledge sources like WordNet and DBPedia, enabling improved information retrieval and extraction.
People are producing more written material then anytime in the history. The increase is so high that professionals from the various fields are no more able to cope with this amount of publications. Text mining tools can offer tools to help them and one of the tools that can aid information retrieval and information extraction is semantic text annotation. In this report we present Marvin, a text annotator written in Java, which can be used as a command line tool and as a Java library. Marvin is able to annotate text using multiple sources, including WordNet, MetaMap, DBPedia and thesauri represented as SKOS.