Ziyi Zhou

h-index11

7papers

302citations

Novelty54%

AI Score45

Ranked #45,228 of 194,257 authors (top 23%)#9,071 in CL (top 29%)

7 Papers

27.1CLJun 14, 2023Code

LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming

Jingsheng Gao, Yixin Lian, Ziyi Zhou et al.

Open-domain dialogue systems have made promising progress in recent years. While the state-of-the-art dialogue agents are built upon large-scale text-based social media data and large pre-trained models, there is no guarantee these agents could also perform well in fast-growing scenarios, such as live streaming, due to the bounded transferability of pre-trained models and biased distributions of public datasets from Reddit and Weibo, etc. To improve the essential capability of responding and establish a benchmark in the live open-domain scenario, we introduce the LiveChat dataset, composed of 1.33 million real-life Chinese dialogues with almost 3800 average sessions across 351 personas and fine-grained profiles for each persona. LiveChat is automatically constructed by processing numerous live videos on the Internet and naturally falls within the scope of multi-party conversations, where the issues of Who says What to Whom should be considered. Therefore, we target two critical tasks of response modeling and addressee recognition and propose retrieval-based baselines grounded on advanced techniques. Experimental results have validated the positive effects of leveraging persona profiles and larger average sessions per persona. In addition, we also benchmark the transferability of advanced generation-based models on LiveChat and pose some future directions for current challenges.

6.7AIAug 11, 2023

Controlling Character Motions without Observable Driving Source

Weiyuan Li, Bin Dai, Ziyi Zhou et al.

How to generate diverse, life-like, and unlimited long head/body sequences without any driving source? We argue that this under-investigated research problem is non-trivial at all, and has unique technical challenges behind it. Without semantic constraints from the driving sources, using the standard autoregressive model to generate infinitely long sequences would easily result in 1) out-of-distribution (OOD) issue due to the accumulated error, 2) insufficient diversity to produce natural and life-like motion sequences and 3) undesired periodic patterns along the time. To tackle the above challenges, we propose a systematic framework that marries the benefits of VQ-VAE and a novel token-level control policy trained with reinforcement learning using carefully designed reward functions. A high-level prior model can be easily injected on top to generate unlimited long and diverse sequences. Although we focus on no driving sources now, our framework can be generalized for controlled synthesis with explicit driving sources. Through comprehensive evaluations, we conclude that our proposed framework can address all the above-mentioned challenges and outperform other strong baselines very significantly.

4.2CLApr 23, 2024Code

Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models

Yang Tan, Mingchen Li, Bingxin Zhou et al.

Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfer to life science tasks is non-trivial due to the different training strategies and data forms. To address this gap, we introduce SES-Adapter, a simple, efficient, and scalable adapter method for enhancing the representation learning of PLMs. SES-Adapter incorporates PLM embeddings with structural sequence embeddings to create structure-aware representations. We show that the proposed method is compatible with different PLM architectures and across diverse tasks. Extensive evaluations are conducted on 2 types of folding structures with notable quality differences, 9 state-of-the-art baselines, and 9 benchmark datasets across distinct downstream tasks. Results show that compared to vanilla PLMs, SES-Adapter improves downstream task performance by a maximum of 11% and an average of 3%, with significantly accelerated training speed by a maximum of 1034% and an average of 362%, the convergence rate is also improved by approximately 2 times. Moreover, positive optimization is observed even with low-quality predicted structures. The source code for SES-Adapter is available at https://github.com/tyang816/SES-Adapter.

0.6CLFeb 2

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Shuai Zhang, Jiayu Hu, Zijie Chen et al.

Current embodied VLM evaluation relies on static, expert-defined, manually annotated benchmarks that exhibit severe redundancy and coverage imbalance. This labor intensive paradigm drains computational and annotation resources, inflates costs, and distorts model rankings, ultimately stifling iterative development. To address this, we propose Agentic Automatic Evaluation (A2Eval), the first agentic framework that automates benchmark curation and evaluation through two collaborative agents. The Data Agent autonomously induces capability dimensions and assembles a balanced, compact evaluation suite, while the Eval Agent synthesizes and validates executable evaluation pipelines, enabling fully autonomous, high-fidelity assessment. Evaluated across 10 benchmarks and 13 models, A2Eval compresses evaluation suites by 85%, reduces overall computational costs by 77%, and delivers a 4.6x speedup while preserving evaluation quality. Crucially, A2Eval corrects systematic ranking biases, improves human alignment to Spearman's rho=0.85, and maintains high ranking fidelity (Kendall's tau=0.81), establishing a new standard for high-fidelity, low-cost embodied assessment. Our code and data will be public soon.

7.0ROOct 21, 2020Code

SyDeBO: Symbolic-Decision-Embedded Bilevel Optimization for Long-Horizon Manipulation in Dynamic Environments

Zhigen Zhao, Ziyi Zhou, Michael Park et al.

This study proposes a Task and Motion Planning (TAMP) method with symbolic decisions embedded in a bilevel optimization. This TAMP method exploits the discrete structure of sequential manipulation for long-horizon and versatile tasks in dynamically changing environments. At the symbolic planning level, we propose a scalable decision-making method for long-horizon manipulation tasks using the Planning Domain Definition Language (PDDL) with causal graph decomposition. At the motion planning level, we devise a trajectory optimization (TO) approach based on the Alternating Direction Method of Multipliers (ADMM), suitable for solving constrained, large-scale nonlinear optimization in a distributed manner. Distinct from conventional geometric motion planners, our approach generates highly dynamic manipulation motions by incorporating the full robot and object dynamics. Furthermore, in lieu of a hierarchical planning approach, we solve a holistically integrated bilevel optimization problem involving costs from both the low-level TO and the high-level search. Simulation and experimental results demonstrate dynamic manipulation for long-horizon object sorting tasks in clutter and on a moving conveyor belt.

14.4ROMar 18, 2020

Accelerated ADMM based Trajectory Optimization for Legged Locomotion with Coupled Rigid Body Dynamics

Ziyi Zhou, Ye Zhao

Trajectory optimization is becoming increasingly powerful in addressing motion planning problems of underactuated robotic systems. Numerous prior studies solve such a class of large non-convex optimal control problems in a hierarchical fashion. However, numerical accuracy issues are prone to occur when one uses a full-order model to track reference trajectories generated from a reduced-order model. This study investigates an approach of Alternating Direction Method of Multipliers (ADMM) and proposes a new splitting scheme for legged locomotion problems. Rigid body dynamics constraints and other general constraints such as box and cone constraints are decomposed to multiple sub-problems in a principled manner. The resulting multi-block ADMM framework enables us to leverage the efficiency of an unconstrained optimization method--Differential Dynamical Programming--to iteratively solve the optimizations using centroidal and whole-body models. Furthermore, we propose a Stage-wise Accelerated ADMM with over-relaxation and varying-penalty schemes to improve the overall convergence rate. We evaluate and validate the performance of the proposed ADMM algorithm on a car-parking example and a bipedal locomotion problem over rough terrains.

1.9RONov 15, 2019

Flexoskeleton printing for versatile insect-inspired robots

Mingsong Jiang, Ziyi Zhou, Nicholas G. Gravish

One of the many secrets to the success and prevalence of insects is their versatile, robust, and complex exoskeleton morphology. A fundamental challenge in insect-inspired robotics has been the fabrication of robotic exoskeletons that can match the complexity of exoskeleton structural mechanics. Hybrid robots composed of rigid and soft elements have previously required access to expensive multi-material 3D printers, multi-step casting and machining processes, or limited material choice when using consumer-grade fabrication methods. Here we introduce a new design and fabrication process to rapidly construct flexible exoskeleton-inspired robots called flexoskeleton printing. We modify a consumer-grade fused deposition material (FDM) 3D printer to deposit filament directly onto a heated thermoplastic base layer which provides extremely strong bond strength between the deposited material and the inextensible, flexible base layer. This process significantly improves the fatigue resistance of printed components and enables a new class of insect-inspired robot morphologies. We demonstrate these capabilities through design and testing of a wide library of canonical flexoskeleton elements; ultimately leading to the integration of elements into a flexoskeleton walking legged robot.