CLAIDLMay 19, 2023

NAIST Academic Travelogue Dataset

arXiv:2305.11444v22 citations
Originality Synthesis-oriented
AI Analysis

This dataset addresses the problem of data scarcity and lack of reproducibility in travelogue research for academic researchers, though it is incremental as it provides a new dataset without introducing novel methods.

The authors tackled the scarcity of widely available travelogue data for research by constructing the NAIST Academic Travelogue Dataset, a Japanese text dataset with over 31 million words from 4,672 domestic and 9,607 overseas travelogues, enabling researchers to conduct investigations on the same data for transparency and reproducibility.

We have constructed NAIST Academic Travelogue Dataset (ATD) and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to prepare their own data. This hinders the replication of existing studies and fair comparative analysis of experimental results. Our dataset enables any researchers to conduct investigation on the same data and to ensure transparency and reproducibility in research. In this paper, we describe the academic significance, characteristics, and prospects of our dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes