Óscar García-Sierra

CL
3papers
37citations
Novelty10%
AI Score17

3 Papers

CLFeb 13, 2023
Linguistic ambiguity analysis in ChatGPT

Miguel Ortega-Martín, Óscar García-Sierra, Alfonso Ardoiz et al.

Linguistic ambiguity is and has always been one of the main challenges in Natural Language Processing (NLP) systems. Modern Transformer architectures like BERT, T5 or more recently InstructGPT have achieved some impressive improvements in many NLP fields, but there is still plenty of work to do. Motivated by the uproar caused by ChatGPT, in this paper we provide an introduction to linguistic ambiguity, its varieties and their relevance in modern NLP, and perform an extensive empiric analysis. ChatGPT strengths and weaknesses are revealed, as well as strategies to get the most of this model.

CLFeb 24, 2023
Spanish Built Factual Freectianary (Spanish-BFF): the first AI-generated free dictionary

Miguel Ortega-Martín, Óscar García-Sierra, Alfonso Ardoiz et al.

Dictionaries are one of the oldest and most used linguistic resources. Building them is a complex task that, to the best of our knowledge, has yet to be explored with generative Large Language Models (LLMs). We introduce the "Spanish Built Factual Freectianary" (Spanish-BFF) as the first Spanish AI-generated dictionary. This first-of-its-kind free dictionary uses GPT-3. We also define future steps we aim to follow to improve this initial commitment to the field, such as more additional languages.

CLJun 17, 2024
Building another Spanish dictionary, this time with GPT-4

Miguel Ortega-Martín, Óscar García-Sierra, Alfonso Ardoiz et al.

We present the "Spanish Built Factual Freectianary 2.0" (Spanish-BFF-2) as the second iteration of an AI-generated Spanish dictionary. Previously, we developed the inaugural version of this unique free dictionary employing GPT-3. In this study, we aim to improve the dictionary by using GPT-4-turbo instead. Furthermore, we explore improvements made to the initial version and compare the performance of both models.