LGNov 19, 2022
Accuracy Booster: Enabling 4-bit Fixed-point Arithmetic for DNN TrainingSimla Burcu Harma, Ayan Chakraborty, Nicholas Sperry et al.
The unprecedented demand for computing resources to train DNN models has led to a search for minimal numerical encoding. Recent state-of-the-art (SOTA) proposals advocate for multi-level scaled narrow bitwidth numerical formats. In this paper, we show that single-level scaling is sufficient to maintain training accuracy while maximizing arithmetic density. We identify a previously proposed single-level scaled format for 8-bit training, Hybrid Block Floating Point (HBFP), as the optimal candidate to minimize. We perform a full-scale exploration of the HBFP design space using mathematical tools to study the interplay among various parameters and identify opportunities for even smaller encodings across layers and epochs. Based on our findings, we propose Accuracy Booster, a mixed-mantissa HBFP technique that uses 4-bit mantissas for over 99% of all arithmetic operations in training and 6-bit mantissas only in the last epoch and first/last layers. We show Accuracy Booster enables increasing arithmetic density over all other SOTA formats by at least 2.3x while achieving state-of-the-art accuracies in 4-bit training.
NAJun 24, 2018
Weighted Extended B-Spline Finite Element Analysis of a coupled system of general Elliptic equationsAyan Chakraborty, BV. Rathish Kumar
In this study we establish the existence and uniqueness of the solution of a coupled system of general elliptic equations with anisotropic diffusion , non-uniform advection and variably influencing reaction terms on Lipschitz continuous domain $Ω\subset \mathbb{R}^m $ (m$\geq$1) with a Dirichlet boundary. Later we consider the finite element (FE) approximation of the coupled equations in a meshless framework based on weighted extended B-Spine functions (WEBS).The a priori error estimates corresponding to the finite element analysis are derived to establish the convergence of the corresponding FE scheme and the numerical methodology has been tested on few examples.
NAJun 30, 2018
Non uniform weighted extended B-Spline finite element analysis of non linear elliptic partial differential equationsB. V. Rathish Kumar, Ayan Chakraborty
We propose a non uniform web spline based finite element analysis for elliptic partial differential equation with the gradient type nonlinearity in their principal coefficients like p-laplacian equation and Quasi-Newtonian fluid flow equations. We discuss the well-posednes of the problems and also derive the apriori error estimates for the proposed finite element analysis and obtain convergence rate of $\mathcal{O}(h^α)$ for $α> 0$.
NAJun 24, 2018
Web spline error estimation of non-cooperative elliptic equations for population dynamicsAyan Chakraborty, B. V. Rathish Kumar
We analyze the error of the WEB-S finite element method applied to elliptic systems with non-cooperative dominant coupling,with a mixed Dirichlet/Neumann/Robin boundary condition. This problem is strongly related to a posteriori error estimates, giving computable bounds for computational errors and detecting zones in the solution domain where such errors are too large and certain mesh refinements should be performed. These results are based on an extensive regularity analysis of the interface problems of concern.Finally, the error analysis is illustrated by numerical experiments.
LGDec 21, 2021
Multigoal-oriented dual-weighted-residual error estimation using deep neural networksAyan Chakraborty, Thomas Wick, Xiaoying Zhuang et al.
Deep learning has shown successful application in visual recognition and certain artificial intelligence tasks. Deep learning is also considered as a powerful tool with high flexibility to approximate functions. In the present work, functions with desired properties are devised to approximate the solutions of PDEs. Our approach is based on a posteriori error estimation in which the adjoint problem is solved for the error localization to formulate an error estimator within the framework of neural network. An efficient and easy to implement algorithm is developed to obtain a posteriori error estimate for multiple goal functionals by employing the dual-weighted residual approach, which is followed by the computation of both primal and adjoint solutions using the neural network. The present study shows that such a data-driven model based learning has superior approximation of quantities of interest even with relatively less training data. The novel algorithmic developments are substantiated with numerical test examples. The advantages of using deep neural network over the shallow neural network are demonstrated and the convergence enhancing techniques are also presented
SYNov 10, 2019
Synthesis of Feedback Controller for Nonlinear Control Systems with Optimal Region of AttractionAyan Chakraborty, Indranil Saha
We propose a framework for synthesizing a feedback control policy that maximizes the region of attraction (ROA) of a closed-loop nonlinear dynamical system. Our synthesis technique relies on stochastic optimization, which involves computation of an objective function capturing the ROA for a feedback control law. We employ a machine learning technique based on deep neural network to estimate the ROA for a given feedback controller. Overall, our technique is capable of synthesizing a controller co-optimizing traditional control objectives like LQR cost together with ROA. We demonstrate the efficacy of our technique through exhaustive experiments carried out on various nonlinear systems.
IRNov 28, 2013
Searching and Establishment of S-P-O Relationships for Linked RDF Graphs : An Adaptive ApproachAyan Chakraborty, Shiladitya Munshi, Debajyoti Mukhopadhyay
In the coming era of semantic web linked data analysis is a very burning issue for efficient searching and retrieval of information. One way of establishing this link is to implement subject predicate object relationship through Set Theory approach which is already done in our previous work. For analyzing inter relationship between two RDF Graphs, RDF- Schema (RDFS) should also be taken care of. In the present paper, an adaptive combination rule based framework has been proposed for establishment of S P O relationship and RDF Graph searching is reported. Hence the identification of criteria for inter-relationship of RDF Graphs opens up new road in semantic search.