AIApr 27, 2022Code
TranSHER: Translating Knowledge Graph Embedding with Hyper-Ellipsoidal RestrictionYizhi Li, Wei Fan, Chao Liu et al.
Knowledge graph embedding methods are important for the knowledge graph completion (or link prediction) task. One existing efficient method, PairRE, leverages two separate vectors to model complex relations (i.e., 1-to-N, N-to-1, and N-to-N) in knowledge graphs. However, such a method strictly restricts entities on the hyper-ellipsoid surfaces which limits the optimization of entity distribution, leading to suboptimal performance of knowledge graph completion. To address this issue, we propose a novel score function TranSHER, which leverages relation-specific translations between head and tail entities to relax the constraint of hyper-ellipsoid restrictions. By introducing an intuitive and simple relation-specific translation, TranSHER can provide more direct guidance on optimization and capture more semantic characteristics of entities with complex relations. Experimental results show that TranSHER achieves significant performance improvements on link prediction and generalizes well to datasets in different domains and scales. Our codes are public available at https://github.com/yizhilll/TranSHER.
NAFeb 27, 2016
Refined Schur Method for Robust Pole Assignment with Repeated PolesZhen-Chen Guo, Jiang Qian, Yun-feng Cai et al.
Schur-type methods in \cite{Chu2} and \cite{GCQX} solve the robust pole assignment problem by employing the departure from normality of the closed-loop system matrix as the measure of robustness. They work well generally when all poles to be assigned are simple. However, when some poles are close or even repeated, the eigenvalues of the computed closed-loop system matrix might be inaccurate. In this paper, we present a refined Schur method, which is able to deal with the case when some or all of the poles to be assigned are repeated. More importantly, the refined Schur method can still be applied when \verb|place| \cite{KNV} and \verb|robpole| \cite{Tits} fail to output a solution when the multiplicity of some repeated poles is greater than the input freedom.
NAJun 13, 2016
On Some Inverse Eigenvalue Problems of Quadratic Palindromic SystemsYunfeng Cai, Jiang Qian
This paper concerns some inverse eigenvalue problems of the quadratic $\star$-(anti)-palindromic system $Q(λ)=λ^2 A_1^{\star}+λA_0 + εA_1$, where $ε=\pm 1$, $A_1, A_0 \in \mathbb{C}^{n\times n}$, $A_0^{\star}=εA_0$, $A_1$ is nonsingular, and the symbol $\star$ is used as an abbreviation for transpose for real matrices and either transpose or conjugate transpose for complex matrices. By using the spectral decomposition of the quadratic $\star$-(anti)-palindromic system, the inverse eigenvalue problems with entire/partial eigenpairs given, and the model updating problems with no-spillover are considered. Some conditions on the solvabilities of these problems are given, and algorithms are proposed to find these solutions. These algorithms are illustrated by some numerical examples.
DBMar 12, 2024
Couler: Unified Machine Learning Workflow Optimization in CloudXiaoda Wang, Yuan Tang, Tengda Guo et al.
Machine Learning (ML) has become ubiquitous, fueling data-driven applications across various organizations. Contrary to the traditional perception of ML in research, ML workflows can be complex, resource-intensive, and time-consuming. Expanding an ML workflow to encompass a wider range of data infrastructure and data types may lead to larger workloads and increased deployment costs. Currently, numerous workflow engines are available (with over ten being widely recognized). This variety poses a challenge for end-users in terms of mastering different engine APIs. While efforts have primarily focused on optimizing ML Operations (MLOps) for a specific workflow engine, current methods largely overlook workflow optimization across different engines. In this work, we design and implement Couler, a system designed for unified ML workflow optimization in the cloud. Our main insight lies in the ability to generate an ML workflow using natural language (NL) descriptions. We integrate Large Language Models (LLMs) into workflow generation, and provide a unified programming interface for various workflow engines. This approach alleviates the need to understand various workflow engines' APIs. Moreover, Couler enhances workflow computation efficiency by introducing automated caching at multiple stages, enabling large workflow auto-parallelization and automatic hyperparameters tuning. These enhancements minimize redundant computational costs and improve fault tolerance during deep learning workflow training. Couler is extensively deployed in real-world production scenarios at Ant Group, handling approximately 22k workflows daily, and has successfully improved the CPU/Memory utilization by more than 15% and the workflow completion rate by around 17%.
LGMar 22, 2020
BS-NAS: Broadening-and-Shrinking One-Shot NAS with Searchable Numbers of ChannelsZan Shen, Jiang Qian, Bojin Zhuang et al.
One-Shot methods have evolved into one of the most popular methods in Neural Architecture Search (NAS) due to weight sharing and single training of a supernet. However, existing methods generally suffer from two issues: predetermined number of channels in each layer which is suboptimal; and model averaging effects and poor ranking correlation caused by weight coupling and continuously expanding search space. To explicitly address these issues, in this paper, a Broadening-and-Shrinking One-Shot NAS (BS-NAS) framework is proposed, in which `broadening' refers to broadening the search space with a spring block enabling search for numbers of channels during training of the supernet; while `shrinking' refers to a novel shrinking strategy gradually turning off those underperforming operations. The above innovations broaden the search space for wider representation and then shrink it by gradually removing underperforming operations, followed by an evolutionary algorithm to efficiently search for the optimal architecture. Extensive experiments on ImageNet illustrate the effectiveness of the proposed BS-NAS as well as the state-of-the-art performance.