26.1CYApr 19
The Inference Bottleneck: A Formal Model of Vertical Foreclosure in AI MarketsGaston Besanson
As generative AI commercializes, competitive advantage is shifting from model training toward inference, distribution, and routing. This paper develops a formal game-theoretic model of vertical foreclosure in inference markets, as the formal-model companion to Besanson and Celani (2026). The model isolates two foreclosure mechanisms operating without predatory pricing: quality-of-service (QoS) discrimination against downstream rivals via latency, throughput, context limits, or feature access; and routing bias in assistant-layer interfaces. An extension motivated by Anthropic's April 2026 release of Claude Opus 4.7 alongside the restricted-access Claude Mythos Preview introduces a third mechanism, tier-based access discrimination, parameterized by a tier gap (tau) and partner-exclusivity (kappa). The main result gives an explicit local equilibrium characterization of the QoS gap. Under logit demand and symmetric rivals, the gap is strictly increasing in inference-quality importance (alpha) and downstream margins, and strictly decreasing in API price and rival entry elasticity. Discrimination vanishes at a joint boundary rather than at a simple threshold in alpha alone. A stylized calibration to four providers using April 2026 data treats parameter values as inputs to a comparative risk mapping, not structural estimates. The mapping suggests Google and OpenAI face conditions most conducive to foreclosure; Microsoft's realized routing bias has been voluntarily constrained by a March 2026 multi-model pivot; Anthropic shows low consumer-channel risk and elevated risk in enterprise coding-agent segments. The policy section proposes Neutral Inference, a four-pillar conduct framework: QoS parity, routing transparency, FRAND-style non-discrimination, and tier transparency with release-pathway discipline. Illustrative welfare calculations suggest net gains in the tens of billions annually.
12.2SEMay 8
SARC: A Governance-by-Architecture Framework for Agentic AI SystemsGaston Besanson
Agentic AI systems increasingly act through tools, sub-agents, and external services, but governance controls are still commonly attached to prompts, dashboards, or post-hoc documentation. This creates a structural mismatch in regulated settings: obligations that must constrain execution are often evaluated only after execution has occurred. We introduce SARC, a runtime governance architecture for tool-using agents that treats constraints as first-class specification objects alongside state, action space, and reward. A SARC specification declares each constraint's source, class, predicate, verification point, response protocol, and operating point, and compiles these into four enforcement sites in the agent loop: a Pre-Action Gate, an Action-Time Monitor, a Post-Action Auditor, and an Escalation Router. We formalize the minimal invariants required for specification-trace correspondence, show why finite reward penalties do not generally substitute for hard runtime constraints, and extend the architecture to multi-agent workflows through constraint propagation, authority intersection, and attribution-preserving trace trees. We implement a prototype audit checker and report a reproducible synthetic evaluation over 50 seeds comparing SARC against post-hoc audit, output filtering, workflow rules, and policy-as-code-only baselines on a procurement task. SARC executes zero hard-constraint violations under exact predicates; its declared PAA throttling response reduces soft-window overages by 89.5% relative to policy-as-code-only. Predicate-noise and enforcement-failure sweeps are consistent with the claim that residual hard violations under SARC scale with enforcement-stack error rather than environmental violation opportunity. SARC provides the architectural substrate through which obligations can be made executable, inspectable, and auditable at runtime.
AIFeb 6
Accuracy Standards for AI at Work vs. Personal Life: Evidence from an Online SurveyGaston Besanson, Federico Todeschini
We study how people trade off accuracy when using AI-powered tools in professional versus personal contexts for adoption purposes, the determinants of those trade-offs, and how users cope when AI/apps are unavailable. Because modern AI systems (especially generative models) can produce acceptable but non-identical outputs, we define "accuracy" as context-specific reliability: the degree to which an output aligns with the user's intent within a tolerance threshold that depends on stakes and the cost of correction. In an online survey (N=300), among respondents with both accuracy items (N=170), the share requiring high accuracy (top-box) is 24.1% at work vs. 8.8% in personal life (+15.3 pp; z=6.29, p<0.001). The gap remains large under a broader top-two-box definition (67.0% vs. 32.9%) and on the full 1-5 ordinal scale (mean 3.86 vs. 3.08). Heavy app use and experience patterns correlate with stricter work standards (H2). When tools are unavailable (H3), respondents report more disruption in personal routines than at work (34.1% vs. 15.3%, p<0.01). We keep the main text focused on these substantive results and place test taxonomy and power derivations in a technical appendix.
37.8DBMar 31
The Data Hydration Gap: A Formal Model of Underinvestment in General-Purpose Data Products Under Decentralized GovernanceGaston Besanson
When organizations decentralize data product ownership, as in the data mesh paradigm, each domain team optimizes for its immediate analytical needs, underinvesting in the cross-domain generality that enables organization-wide reuse. We formalize this as a simultaneous-move game in which N domains choose quality (q) and generality (g). Generality creates positive externalities but is privately costly. The Nash equilibrium generality gap is increasing in the number of domains and the value of cross-domain analytics. Under plausible parameter configurations, a corner solution obtains in which no reusable silver layer emerges organically, a condition we term the data mesh trap. Technical debt from narrow products grows quadratically in N. An illustrative calibration suggests non-trivial organizational welfare losses under plausible enterprise parameters. We derive within-model conditions under which centralized, federated, and hybrid governance regimes dominate, and we identify the information asymmetries and transaction costs that complicate implementation. The model provides a formal foundation for empirical research on decentralized data governance.