CLOct 23, 2020

BARThez: a Skilled Pretrained French Sequence-to-Sequence Model

arXiv:2010.12321v2671 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of limited NLP resources for French, though it is incremental as it adapts existing methods to a new language.

The authors tackled the lack of large-scale pretrained sequence-to-sequence models for French by introducing BARThez, which achieved competitive performance with state-of-the-art BERT-based French models on discriminative tasks and showed improved generative performance with mBARThez.

Inductive transfer learning has taken the entire NLP field by storm, with models such as BERT and BART setting new state of the art on countless NLU tasks. However, most of the available models and research have been conducted for English. In this work, we introduce BARThez, the first large-scale pretrained seq2seq model for French. Being based on BART, BARThez is particularly well-suited for generative tasks. We evaluate BARThez on five discriminative tasks from the FLUE benchmark and two generative tasks from a novel summarization dataset, OrangeSum, that we created for this research. We show BARThez to be very competitive with state-of-the-art BERT-based French language models such as CamemBERT and FlauBERT. We also continue the pretraining of a multilingual BART on BARThez' corpus, and show our resulting model, mBARThez, to significantly boost BARThez' generative performance. Code, data and models are publicly available.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes