NADec 19, 2011
Vertex-centroid finite volume scheme on tetrahedral grids for conservation lawsPraveen Chandrashekar, Ashish Garg
Vertex-centroid schemes are cell-centered finite volume schemes for conservation laws which make use of vertex values to construct high resolution schemes. The vertex values must be obtained through a consistent averaging (interpolation) procedure. A modified interpolation scheme is proposed which is better than existing schemes in giving positive weights in the interpolation formula. A simplified reconstruction scheme is also proposed which is also more accurate and efficient. For scalar conservation laws, we develop limited versions of the schemes which are stable in maximum norm by constructing suitable limiters. The schemes are applied to compressible flows governed by the Euler equations of inviscid gas dynamics.
LGSep 18, 2019
Leveraging User Engagement Signals For Entity Labeling in a Virtual AssistantDeepak Muralidharan, Justine Kao, Xiao Yang et al.
Personal assistant AI systems such as Siri, Cortana, and Alexa have become widely used as a means to accomplish tasks through natural language commands. However, components in these systems generally rely on supervised machine learning algorithms that require large amounts of hand-annotated training data, which is expensive and time consuming to collect. The ability to incorporate unsupervised, weakly supervised, or distantly supervised data holds significant promise in overcoming this bottleneck. In this paper, we describe a framework that leverages user engagement signals (user behaviors that demonstrate a positive or negative response to content) to automatically create granular entity labels for training data augmentation. Strategies such as multi-task learning and validation using an external knowledge base are employed to incorporate the engagement annotated data and to boost the model's accuracy on a sequence labeling task. Our results show that learning from data automatically labeled by user engagement signals achieves significant accuracy gains in a production deep learning system, when measured on both the sequence labeling task as well as on user facing results produced by the system end-to-end. We believe this is the first use of user engagement signals to help generate training data for a sequence labeling task on a large scale, and can be applied in practical settings to speed up new feature deployment when little human annotated data is available.
LGAug 29, 2019
Active Learning for Domain Classification in a Commercial Spoken Personal AssistantXi C. Chen, Adithya Sagar, Justine T. Kao et al.
We describe a method for selecting relevant new training data for the LSTM-based domain selection component of our personal assistant system. Adding more annotated training data for any ML system typically improves accuracy, but only if it provides examples not already adequately covered in the existing data. However, obtaining, selecting, and labeling relevant data is expensive. This work presents a simple technique that automatically identifies new helpful examples suitable for human annotation. Our experimental results show that the proposed method, compared with random-selection and entropy-based methods, leads to higher accuracy improvements given a fixed annotation budget. Although developed and tested in the setting of a commercial intelligent assistant, the technique is of wider applicability.