LGDec 8, 2022Code
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning BenchmarksJiaqi Ma, Xingjian Zhang, Hezheng Fan et al.
Establishing open and general benchmarks has been a critical driving force behind the success of modern machine learning techniques. As machine learning is being applied to broader domains and tasks, there is a need to establish richer and more diverse benchmarks to better reflect the reality of the application scenarios. Graph learning is an emerging field of machine learning that urgently needs more and better benchmarks. To accommodate the need, we introduce Graph Learning Indexer (GLI), a benchmark curation platform for graph learning. In comparison to existing graph learning benchmark libraries, GLI highlights two novel design objectives. First, GLI is designed to incentivize \emph{dataset contributors}. In particular, we incorporate various measures to minimize the effort of contributing and maintaining a dataset, increase the usability of the contributed dataset, as well as encourage attributions to different contributors of the dataset. Second, GLI is designed to curate a knowledge base, instead of a plain collection, of benchmark datasets. We use multiple sources of meta information to augment the benchmark datasets with \emph{rich characteristics}, so that they can be easily selected and used in downstream research or development. The source code of GLI is available at \url{https://github.com/Graph-Learning-Benchmarks/gli}.
FLU-DYNMay 29
A scalable Ewald-free BIE framework for periodic Stokes flow via hierarchical proxy sumsTianyue Li, Dhairya Malhotra, Shravan Veerapaneni
Particulate Stokes flow in confined, periodic geometries underlies a broad class of problems in biophysics, microfluidics, and the rheology of complex fluids. Boundary integral equation (BIE) methods are a natural tool for such problems, but existing periodization schemes rely either on periodic Green's functions, which are restrictive for complex confining geometries, or on free-space schemes that solve auxiliary proxy strengths alongside the surface densities in an extended linear system whose cost scales unfavorably in three dimensions. We present a BIE framework for three-dimensional particulate Stokes flow in periodic pipes with circular cross-sections, wall-bounded doubly-periodic, and triply-periodic geometries that uses only the free-space Green's function and avoids both Ewald summation and the extended linear system. Proxy sources placed on equivalent surfaces of the kernel-independent FMM (KIFMM) form the auxiliary basis, and contributions from far image boxes are captured by a hierarchical proxy sum made absolutely convergent by a net-force-zero compatibility condition. The resulting periodization precomputation depends only on the periodic-box geometry, independent of the kernel and of the surfaces inside the box, and is reused verbatim across the Stokeslet, stresslet, and rotlet. Combined with high-order adaptive surface discretizations, the method achieves high-order accuracy at $\mathcal{O}(N)$ cost with a single layer of image boxes in the near field. Numerical examples on dense polydisperse suspensions with thousands of particles and on flow through complex periodic channels, together with strong and weak scaling studies, demonstrate efficient performance on systems with millions of degrees of freedom on distributed-memory architectures.
LGNov 10, 2025
Lightweight Time Series Data Valuation on Time Series Foundation Models via In-Context FinetuningShunyu Wu, Tianyue Li, Yixuan Leng et al.
Time series foundation models (TSFMs) have demonstrated increasing capabilities due to their extensive pretraining on large volumes of diverse time series data. Consequently, the quality of time series data is crucial to TSFM performance, rendering an accurate and efficient data valuation of time series for TSFMs indispensable. However, traditional data valuation methods, such as influence functions, face severe computational bottlenecks due to their poor scalability with growing TSFM model sizes and often fail to preserve temporal dependencies. In this paper, we propose LTSV, a Lightweight Time Series Valuation on TSFMS via in-context finetuning. Grounded in the theoretical evidence that in-context finetuning approximates the influence function, LTSV estimates a sample's contribution by measuring the change in context loss after in-context finetuning, leveraging the strong generalization capabilities of TSFMs to produce robust and transferable data valuations. To capture temporal dependencies, we introduce temporal block aggregation, which integrates per-block influence scores across overlapping time windows. Experiments across multiple time series datasets and models demonstrate that LTSV consistently provides reliable and strong valuation performance, while maintaining manageable computational requirements. Our results suggest that in-context finetuning on time series foundation models provides a practical and effective bridge between data attribution and model generalization in time series learning.