SDJun 19, 2025
Improved Intelligibility of Dysarthric Speech using Conditional Flow MatchingShoutrik Das, Nishant Singh, Arjun Gangwar et al.
Dysarthria is a neurological disorder that significantly impairs speech intelligibility, often rendering affected individuals unable to communicate effectively. This necessitates the development of robust dysarthric-to-regular speech conversion techniques. In this work, we investigate the utility and limitations of self-supervised learning (SSL) features and their quantized representations as an alternative to mel-spectrograms for speech generation. Additionally, we explore methods to mitigate speaker variability by generating clean speech in a single-speaker voice using features extracted from WavLM. To this end, we propose a fully non-autoregressive approach that leverages Conditional Flow Matching (CFM) with Diffusion Transformers to learn a direct mapping from dysarthric to clean speech. Our findings highlight the effectiveness of discrete acoustic units in improving intelligibility while achieving faster convergence compared to traditional mel-spectrogram-based approaches.
AIApr 28, 2025
Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of MindMouad Abrini, Omri Abend, Dina Acklin et al. · cambridge
This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.
CVJul 18, 2014
Analysis of Gait Pattern to Recognize the Human ActivitiesJay Prakash Gupta, Pushkar Dixit, Nishant Singh et al.
Human activity recognition based on the computer vision is the process of labelling image sequences with action labels. Accurate systems for this problem are applied in areas such as visual surveillance, human computer interaction and video retrieval.
MMJul 18, 2014
Robust Lossless Semi Fragile Information Protection in ImagesPushkar Dixit, Nishant Singh, Jay Prakash Gupta
Internet security finds it difficult to keep the information secure and to maintain the integrity of the data. Sending messages over the internet secretly is one of the major tasks as it is widely used for passing the message.