CLOct 12, 2020

The National Corpus of Contemporary Welsh: Project Report | Y Corpws Cenedlaethol Cymraeg Cyfoes: Adroddiad y Prosiect

arXiv:2010.05542v11 citations
Originality Synthesis-oriented
AI Analysis

This project addresses the need for a modern Welsh language corpus for linguists, educators, and other users, but it is incremental as it builds on existing corpus-building theories and practices.

The paper describes the development of the National Corpus of Contemporary Welsh (CorCenCC), an online corpus resource, by outlining its theoretical foundations, operational decisions, and applications, with the result being a comprehensive corpus aimed at supporting Welsh language research and use.

This report provides an overview of the CorCenCC project and the online corpus resource that was developed as a result of work on the project. The report lays out the theoretical underpinnings of the research, demonstrating how the project has built on and extended this theory. We also raise and discuss some of the key operational questions that arose during the course of the project, outlining the ways in which they were answered, the impact of these decisions on the resource that has been produced and the longer-term contribution they will make to practices in corpus-building. Finally, we discuss some of the applications and the utility of the work, outlining the impact that CorCenCC is set to have on a range of different individuals and user groups.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes