98.8SEApr 19Code
KnowPilot: Your Knowledge-Driven Copilot for Domain TasksZekun Xi, Yichen Nie, Ziyan Jiang et al.
Despite the rapid advancement of generative agents, their deployment in real-world industry scenarios often encounters significant challenges due to a lack of domain-specific knowledge. To address this gap, we present KnowPilot: a Domain-Specific Knowledge Augmented Generative Agent System. KnowPilot is an open-source framework that integrates task-specific priors, explicit knowledge, and experiential knowledge to enhance agent performance in specialized applications. It combines knowledge retrieval from structured repositories with a memory system capable of capturing expert experience through human AI interaction. Taking domain-specific writing generation as a representative case, KnowPilot enables private deployment, supports injection of task requirements, loads private knowledge bases, and stores tacit expert knowledge as persistent memory. Experimental results demonstrate that KnowPilot achieves superior performance in domain-oriented text generation and is applicable across fields such as medicine, finance and industry.
LGFeb 4
From Data to Behavior: Predicting Unintended Model Behaviors Before TrainingMengru Wang, Zhenqian Xu, Junfeng Fang et al.
Large Language Models (LLMs) can acquire unintended biases from seemingly benign training data even without explicit cues or malicious content. Existing methods struggle to detect such risks before fine-tuning, making post hoc evaluation costly and inefficient. To address this challenge, we introduce Data2Behavior, a new task for predicting unintended model behaviors prior to training. We also propose Manipulating Data Features (MDF), a lightweight approach that summarizes candidate data through their mean representations and injects them into the forward pass of a base model, allowing latent statistical signals in the data to shape model activations and reveal potential biases and safety risks without updating any parameters. MDF achieves reliable prediction while consuming only about 20% of the GPU resources required for fine-tuning. Experiments on Qwen3-14B, Qwen2.5-32B-Instruct, and Gemma-3-12b-it confirm that MDF can anticipate unintended behaviors and provide insight into pre-training vulnerabilities.