Kaiyi Wu

2papers

2 Papers

MLMar 7, 2020
Diffusion State Distances: Multitemporal Analysis, Fast Algorithms, and Applications to Biological Networks

Lenore Cowen, Kapil Devkota, Xiaozhe Hu et al.

Data-dependent metrics are powerful tools for learning the underlying structure of high-dimensional data. This article develops and analyzes a data-dependent metric known as diffusion state distance (DSD), which compares points using a data-driven diffusion process. Unlike related diffusion methods, DSDs incorporate information across time scales, which allows for the intrinsic data structure to be inferred in a parameter-free manner. This article develops a theory for DSD based on the multitemporal emergence of mesoscopic equilibria in the underlying diffusion process. New algorithms for denoising and dimension reduction with DSD are also proposed and analyzed. These approaches are based on a weighted spectral decomposition of the underlying diffusion process, and experiments on synthetic datasets and real biological networks illustrate the efficacy of the proposed algorithms in terms of both speed and accuracy. Throughout, comparisons with related methods are made, in order to illustrate the distinct advantages of DSD for datasets exhibiting multiscale structure.

PESep 15, 2018
Optimal spatial-dynamic management to minimize the damages caused by aquatic invasive species

Katherine Y. Zipp, Yangqingxiang Wu, Kaiyi Wu et al.

Invasive species have been recognized as a leading threat to biodiversity. In particular, lakes are especially affected by species invasions because they are closed systems sensitive to disruption. Accurately controlling the spread of invasive species requires solving a complex spatial-dynamic optimization problem. In this work we propose a novel framework for determining the optimal management strategy to maximize the value of a lake system net of damages from invasive species, including an endogenous diffusion mechanism for the spread of invasive species through boaters' trips between lakes. The proposed method includes a combined global iterative process which determines the optimal number of trips to each lake in each season and the spatial-dynamic optimal boat ramp fee.