Dan Jiao

NA
3papers
47citations
Novelty72%
AI Score44

3 Papers

NAMar 17, 2017
Accuracy Directly Controlled Fast Direct Solutions of General ${\cal H}^2$-Matrices and Its Application to Electrically Large Integral-Equation-Based Electromagnetic Analysis

Miaomiao Ma, Dan Jiao

The dense matrix resulting from an integral equation (IE) based solution of Maxwell's equations can be compactly represented by an ${\cal H}^2$-matrix. Given a general dense ${\cal H}^2$-matrix, prevailing fast direct solutions involve approximations whose accuracy can only be indirectly controlled. In this work, we propose new accuracy-controlled direct solution algorithms, including both factorization and inversion, for solving general ${\cal H}^2$-matrices, which does not exist prior to this work. Different from existing direct solutions, where the cluster bases are kept unchanged in the solution procedure thus lacking explicit accuracy control, the proposed new algorithms update the cluster bases and their rank level by level based on prescribed accuracy, without increasing computational complexity. Zeros are also introduced level by level such that the size of the matrix blocks computed at each tree level is the rank at that level, and hence being small. The proposed new direct solution has been applied to solve electrically large volume IEs whose rank linearly grows with electric size. A complexity of $O(NlogN)$ in factorization and inversion time, and a complexity of $O(N)$ in storage and solution time are both theoretically proven and numerically demonstrated. For constant-rank cases, the proposed direct solution has a strict $O(N)$ complexity in both time and memory. Rapid direct solutions of millions of unknowns can be obtained on a single CPU core with directly controlled accuracy.

NAFeb 28, 2017
$O(N)$ Iterative and $O(NlogN)$ Fast Direct Volume Integral Equation Solvers with a Minimal-Rank ${\cal H}^2$-Representation for Large-Scale $3$-D Electrodynamic Analysis

Saad Omar, Dan Jiao

Linear complexity iterative and log-linear complexity direct solvers are developed for the volume integral equation (VIE) based general large-scale electrodynamic analysis. The dense VIE system matrix is first represented by a new cluster-based multilevel low-rank representation. In this representation, all the admissible blocks associated with a single cluster are grouped together and represented by a single low-rank block, whose rank is minimized based on prescribed accuracy. From such an initial representation, an efficient algorithm is developed to generate a minimal-rank ${\cal H}^2$-matrix representation. This representation facilitates faster computation, and ensures the same minimal rank's growth rate with electrical size as evaluated from singular value decomposition. Taking into account the rank's growth with electrical size, we develop linear-complexity ${\cal H}^2$-matrix-based storage and matrix-vector multiplication, and thereby an $O(N)$ iterative VIE solver regardless of electrical size. Moreover, we develop an $O(NlogN)$ matrix inversion, and hence a fast $O(NlogN)$ \emph{direct} VIE solver for large-scale electrodynamic analysis. Both theoretical analysis and numerical simulations of large-scale $1$-, $2$- and $3$-D structures on a single-core CPU, resulting in millions of unknowns, have demonstrated the low complexity and superior performance of the proposed VIE electrodynamic solvers. %The algorithms developed in this work are kernel-independent, and hence applicable to other IE operators as well.

COMP-PHMar 7
Full-Scale GPU-Accelerated Transient EM-Thermal-Mechanical Co-Simulation for Early-Stage Design of Advanced Packages

Hongyang Liu, Tejas Kulkarni, Ganesh Subbarayan et al.

In the early-stage design of advanced electronic packages, designers face a critical trade-off between simulation fidelity and computational turnaround time. Conventional early-stage methodologies typically achieve speed by relying on steady-state assumptions and structural homogenization. While computationally efficient, these approximations fundamentally fail to capture dynamic thermal events and stress concentrations at fine-grained internal interfaces, effectively masking failure mechanisms driven by transient signal bursts. In this work, we present a GPU-accelerated transient coupled Electromagnetic-Thermal-Mechanical solver that resolves this bottleneck. The proposed solver enables full-scale, non-homogenized, time-domain simulation of large-scale packages with runtimes amenable for rapid design iteration. Simulation of a NEC SX-Aurora TSUBASA package demonstrates that the tool allows for the identification of signal-induced adiabatic stress that is typically invisible to steady-state and homogenized baselines. This capability brings sign-off level physics fidelity to the early design phase, facilitating the prevention of costly late-stage design failures and broader transient thermal performance degradation risks.