Improving FMQA via Initial Training Data Design Considering Marginal Bit Coverage in One-Hot Encoding

Taiga Hayashi, Yuya Seki, Kotaro Terada, Yosuke Mukasa, Shuta Kikuchi, Shu Tanaka

arXiv:2605.048256.7h-index: 10

AI Analysis

Incremental improvement to FMQA for black-box optimization with discrete/continuous variables, addressing a known training data coverage issue.

FMQA for integer/continuous variables via one-hot encoding suffers from inactive binary variables in initial training data. By ensuring complete marginal bit coverage via Latin hypercube or Sobol' sampling, the proposed LHS-FMQA and Sobol'-FMQA achieve higher mean final cruising speeds on a 17- and 32-variable wing-shape optimization benchmark, with larger gains on the harder problem.

Factorization machine with quadratic-optimization annealing (FMQA) is a black-box optimization method that combines a factorization machine (FM) surrogate with QUBO-based search by an Ising machine. When FMQA is applied to integer or discretized continuous variables via one-hot encoding, uniform random initial sampling can leave many binary variables never active in the initial training data, and the corresponding FM parameters receive no direct gradient updates from the observed responses. We address this by designing the initial training data to achieve complete marginal bit coverage, namely, ensuring that every binary variable obtained by one-hot encoding takes the value one at least once. We use two space-filling sampling methods, Latin hypercube sampling (LHS) and the Sobol' sequence, yielding LHS-FMQA and Sobol'-FMQA. On the human-powered aircraft wing-shape optimization benchmark with 17 and 32 design variables, both proposed methods achieved numerically higher mean final cruising speeds than the baseline FMQA, with the advantage more pronounced on the 32-variable problem.

View on arXiv PDF

Similar