Structural Feature Engineering for Generative Engine Optimization: How Content Structure Shapes Citation Behavior
This work addresses content visibility challenges for creators and publishers in LLM-powered information ecosystems, establishing structural optimization as a foundational component of Generative Engine Optimization.
The paper tackled the problem of how content structure influences citation behavior in AI-powered search engines, proposing a structural feature engineering framework (GEO-SFE) that improved citation rates by 17.3% and subjective quality by 18.5% across six generative engines.
The proliferation of AI-powered search engines has shifted information discovery from traditional link-based retrieval to direct answer generation with selective source citation, creating new challenges for content visibility. While existing Generative Engine Optimization (GEO) approaches focus primarily on semantic content modification, the role of structural features in influencing citation behavior remains underexplored. In this paper, we propose GEO-SFE, a systematic framework for structural feature engineering in generative engine optimization. Our approach decomposes content structure into three hierarchical levels: macro-structure (document architecture), meso-structure (information chunking), and micro-structure (visual emphasis), and models their impact on citation probability across different generative engine architectures. We develop architecture-aware optimization strategies and predictive models that preserve semantic integrity while improving structural effectiveness. Experimental evaluation across six mainstream generative engines demonstrates consistent improvements in citation rate (17.3 percent) and subjective quality (18.5 percent), validating the effectiveness and generalizability of the proposed framework. This work establishes structural optimization as a foundational component of GEO, providing a data-driven methodology for enhancing content visibility in LLM-powered information ecosystems.