Yun Fan

h-index45
2papers

2 Papers

CLFeb 4
ERNIE 5.0 Technical Report

Haifeng Wang, Hua Wu, Tian Wu et al.

In this report, we introduce ERNIE 5.0, a natively autoregressive foundation model desinged for unified multimodal understanding and generation across text, image, video, and audio. All modalities are trained from scratch under a unified next-group-of-tokens prediction objective, based on an ultra-sparse mixture-of-experts (MoE) architecture with modality-agnostic expert routing. To address practical challenges in large-scale deployment under diverse resource constraints, ERNIE 5.0 adopts a novel elastic training paradigm. Within a single pre-training run, the model learns a family of sub-models with varying depths, expert capacities, and routing sparsity, enabling flexible trade-offs among performance, model size, and inference latency in memory- or time-constrained scenarios. Moreover, we systematically address the challenges of scaling reinforcement learning to unified foundation models, thereby guaranteeing efficient and stable post-training under ultra-sparse MoE architectures and diverse multimodal settings. Extensive experiments demonstrate that ERNIE 5.0 achieves strong and balanced performance across multiple modalities. To the best of our knowledge, among publicly disclosed models, ERNIE 5.0 represents the first production-scale realization of a trillion-parameter unified autoregressive model that supports both multimodal understanding and generation. To facilitate further research, we present detailed visualizations of modality-agnostic expert routing in the unified model, alongside comprehensive empirical analysis of elastic training, aiming to offer profound insights to the community.

DMJun 4, 2014
Fourier Transforms and Bent Functions on Finite Abelian Group-Acted Sets

Yun Fan, Bangteng Xu

Let $G$ be a finite abelian group acting faithfully on a finite set $X$. As a natural generalization of the perfect nonlinearity of Boolean functions, the $G$-bentness and $G$-perfect nonlinearity of functions on $X$ are studied by Poinsot et al. [6,7] via Fourier transforms of functions on $G$. In this paper we introduce the so-called $G$-dual set $\widehat X$ of $X$, which plays the role similar to the dual group $\widehat G$ of $G$, and the Fourier transforms of functions on $X$, a generalization of the Fourier transforms of functions on finite abelian groups. Then we characterize the bent functions on $X$ in terms of their own Fourier transforms on $\widehat X$. Bent (perfect nonlinear) functions on finite abelian groups and $G$-bent ($G$-perfect nonlinear) functions on $X$ are treated in a uniform way in this paper, and many known results in [4,2,6,7] are obtained as direct consequences. Furthermore, we will prove that the bentness of a function on $X$ can be determined by its distance from the set of $G$-linear functions. In order to explain the main results clearly, examples are also presented.