CLSep 24, 2020

N-LTP: An Open-source Neural Language Technology Platform for Chinese

arXiv:2009.11616v4674 citationsHas Code
Originality Incremental advance
AI Analysis

This provides a unified, open-source solution for researchers and practitioners working on Chinese NLP tasks, though it is incremental as it builds on existing multi-task and distillation methods.

The authors tackled the problem of developing a comprehensive toolkit for Chinese NLP by introducing N-LTP, an open-source platform that supports six fundamental tasks using a multi-task framework with a shared pre-trained model, achieving competitive performance through knowledge distillation.

We introduce \texttt{N-LTP}, an open-source neural language technology platform supporting six fundamental Chinese NLP tasks: {lexical analysis} (Chinese word segmentation, part-of-speech tagging, and named entity recognition), {syntactic parsing} (dependency parsing), and {semantic parsing} (semantic dependency parsing and semantic role labeling). Unlike the existing state-of-the-art toolkits, such as \texttt{Stanza}, that adopt an independent model for each task, \texttt{N-LTP} adopts the multi-task framework by using a shared pre-trained model, which has the advantage of capturing the shared knowledge across relevant Chinese tasks. In addition, a knowledge distillation method \cite{DBLP:journals/corr/abs-1907-04829} where the single-task model teaches the multi-task model is further introduced to encourage the multi-task model to surpass its single-task teacher. Finally, we provide a collection of easy-to-use APIs and a visualization tool to make users to use and view the processing results more easily and directly. To the best of our knowledge, this is the first toolkit to support six Chinese NLP fundamental tasks. Source code, documentation, and pre-trained models are available at \url{https://github.com/HIT-SCIR/ltp}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes