CLApr 30, 2025

The Distribution of Dependency Distance and Hierarchical Distance in Contemporary Written Japanese and Its Influencing Factors

arXiv:2504.21421v2h-index: 5
Originality Synthesis-oriented
AI Analysis

This work provides insights into linguistic complexity for researchers in computational linguistics and cognitive science, but it is incremental as it builds on existing theories of dependency grammar.

The study investigated the relationship between dependency distance and hierarchical distance in Japanese, finding that predicate valency is the key factor driving a trade-off between linear and hierarchical complexity, with mean dependency distance lower than mean hierarchical distance due to valency effects.

To explore the relationship between dependency distance (DD) and hierarchical distance (HD) in Japanese, we compared the probability distributions of DD and HD with and without sentence length fixed, and analyzed the changes in mean dependency distance (MDD) and mean hierarchical distance (MHD) as sentence length increases, along with their correlation coefficient based on the Balanced Corpus of Contemporary Written Japanese. It was found that the valency of the predicates is the underlying factor behind the trade-off relation between MDD and MHD in Japanese. Native speakers of Japanese regulate the linear complexity and hierarchical complexity through the valency of the predicates, and the relative sizes of MDD and MHD depend on whether the threshold of valency has been reached. Apart from the cognitive load, the valency of the predicates also affects the probability distributions of DD and HD. The effect of the valency of the predicates on the distribution of HD is greater than on that of DD, which leads to differences in their probability distributions and causes the mean of MDD to be lower than that of MHD.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes