CLAIOct 24, 2023

Speakerly: A Voice-based Writing Assistant for Text Composition

DeepMind
arXiv:2310.16251v1131 citationsh-index: 13
Originality Synthesis-oriented
AI Analysis

This addresses the need for efficient writing tools for general users, but it appears incremental as it combines existing models and methods for a specific application.

The paper tackles the problem of text composition by introducing Speakerly, a real-time voice-based writing assistant that generates well-formatted documents from user instructions or dictation, achieving effective performance across use cases like emails and notes.

We present Speakerly, a new real-time voice-based writing assistance system that helps users with text composition across various use cases such as emails, instant messages, and notes. The user can interact with the system through instructions or dictation, and the system generates a well-formatted and coherent document. We describe the system architecture and detail how we address the various challenges while building and deploying such a system at scale. More specifically, our system uses a combination of small, task-specific models as well as pre-trained language models for fast and effective text composition while supporting a variety of input modes for better usability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes