Lixiang Zhang

AI
4papers
25citations
Novelty52%
AI Score37

4 Papers

NANov 5, 2015
Modeling and Simulation for Fluid-Rotating Structure Interaction

Kai Yang, Pengtao Sun, Lu Wang et al.

In this paper, we study a dynamic fluid-structure interaction (FSI) model for an elastic structure that is immersed and spinning in the fluid. We develop a linear constitutive model to describe the motion of a rotational elastic structure which is suitable for the application of arbitrary Lagrangian-Eulerian (ALE) method in FSI simulation. Additionally, a novel ALE mapping method is designed to generate the moving fluid mesh while the deformable structure spins in a non-axisymmetric fluid channel. The structure velocity is adopted as the principle unknown to form a monolithic saddle-point system together with fluid velocity and pressure. We discretize the nonlinear saddle-point system with mixed finite element method and Newton's linearization, and prove that the derived saddle-point problem is well-posed. The developed methodology is applied to a self-defined elastic structure and a realistic hydro-turbine under a prescribed angular velocity. Both illustrate the satisfactory numerical results of an elastic structure that is deforming and rotating while interacting with the fluid. The numerical validation is also conducted to demonstrate the modeling consistency.

LGNov 10, 2022
Robust DNN Surrogate Models with Uncertainty Quantification via Adversarial Training

Lixiang Zhang, Jia Li

For computational efficiency, surrogate models have been used to emulate mathematical simulators for physical or biological processes. High-speed simulation is crucial for conducting uncertainty quantification (UQ) when the simulation is repeated over many randomly sampled input points (aka, the Monte Carlo method). In some cases, UQ is only feasible with a surrogate model. Recently, Deep Neural Network (DNN) surrogate models have gained popularity for their hard-to-match emulation accuracy. However, it is well-known that DNN is prone to errors when input data are perturbed in particular ways, the very motivation for adversarial training. In the usage scenario of surrogate models, the concern is less of a deliberate attack but more of the high sensitivity of the DNN's accuracy to input directions, an issue largely ignored by researchers using emulation models. In this paper, we show the severity of this issue through empirical studies and hypothesis testing. Furthermore, we adopt methods in adversarial training to enhance the robustness of DNN surrogate models. Experiments demonstrate that our approaches significantly improve the robustness of the surrogate models without compromising emulation accuracy.

AIJan 14
DScheLLM: Enabling Dynamic Scheduling through a Fine-Tuned Dual-System Large language Model

Lixiang Zhang, Chenggong Zhao, Qing Gao et al.

Production scheduling is highly susceptible to dynamic disruptions, such as variations in processing times, machine availability, and unexpected task insertions. Conventional approaches typically rely on event-specific models and explicit analytical formulations, which limits their adaptability and generalization across previously unseen disturbances. To overcome these limitations, this paper proposes DScheLLM, a dynamic scheduling approach that leverages fine-tuned large language models within a dual-system (fast-slow) reasoning architecture to address disturbances of different scales. A unified large language model-based framework is constructed to handle dynamic events, where training datasets for both fast and slow reasoning modes are generated using exact schedules obtained from an operations research solver. The Huawei OpenPangu Embedded-7B model is subsequently fine-tuned under the hybrid reasoning paradigms using LoRA. Experimental evaluations on standard job shop scheduling benchmarks demonstrate that the fast-thinking mode can efficiently generate high-quality schedules and the slow-thinking mode can produce solver-compatible and well-formatted decision inputs. To the best of our knowledge, this work represents one of the earliest studies applying large language models to job shop scheduling in dynamic environments, highlighting their considerable potential for intelligent and adaptive scheduling optimization.

MLDec 7, 2019
Deep Variable-Block Chain with Adaptive Variable Selection

Lixiang Zhang, Lin Lin, Jia Li

The architectures of deep neural networks (DNN) rely heavily on the underlying grid structure of variables, for instance, the lattice of pixels in an image. For general high dimensional data with variables not associated with a grid, the multi-layer perceptron and deep belief network are often used. However, it is frequently observed that those networks do not perform competitively and they are not helpful for identifying important variables. In this paper, we propose a framework that imposes on blocks of variables a chain structure obtained by step-wise greedy search so that the DNN architecture can leverage the constructed grid. We call this new neural network Deep Variable-Block Chain (DVC). Because the variable blocks are used for classification in a sequential manner, we further develop the capacity of selecting variables adaptively according to a number of regions trained by a decision tree. Our experiments show that DVC outperforms other generic DNNs and other strong classifiers. Moreover, DVC can achieve high accuracy at much reduced dimensionality and sometimes reveals drastically different sets of relevant variables for different regions.