Satish K. Singh

h-index29

3papers

10citations

Novelty33%

AI Score18

Ranked #189,320 of 194,257 authors (top 97%)#58,446 in CV (top 99%)

3 Papers

2.6CVFeb 22, 2022

Bag of Visual Words (BoVW) with Deep Features -- Patch Classification Model for Limited Dataset of Breast Tumours

Suvidha Tripathi, Satish Kumar Singh, Lee Hwee Kuan

Currently, the computational complexity limits the training of high resolution gigapixel images using Convolutional Neural Networks. Therefore, such images are divided into patches or tiles. Since, these high resolution patches are encoded with discriminative information therefore; CNNs are trained on these patches to perform patch-level predictions. However, the problem with patch-level prediction is that pathologist generally annotates at image-level and not at patch level. Due to this limitation most of the patches may not contain enough class-relevant features. Through this work, we tried to incorporate patch descriptive capability within the deep framework by using Bag of Visual Words (BoVW) as a kind of regularisation to improve generalizability. Using this hypothesis, we aim to build a patch based classifier to discriminate between four classes of breast biopsy image patches (normal, benign, \textit{In situ} carcinoma, invasive carcinoma). The task is to incorporate quality deep features using CNN to describe relevant information in the images while simultaneously discarding irrelevant information using Bag of Visual Words (BoVW). The proposed method passes patches obtained from WSI and microscopy images through pre-trained CNN to extract features. BoVW is used as a feature selector to select most discriminative features among the CNN features. Finally, the selected feature sets are classified as one of the four classes. The hybrid model provides flexibility in terms of choice of pre-trained models for feature extraction. The pipeline is end-to-end since it does not require post processing of patch predictions to select discriminative patches. We compared our observations with state-of-the-art methods like ResNet50, DenseNet169, and InceptionV3 on the BACH-2018 challenge dataset. Our proposed method shows better performance than all the three methods.

1.4CVJul 13, 2021

A Novel Deep Learning Method for Thermal to Annotated Thermal-Optical Fused Images

Suranjan Goswami, Satish Kumar Singh, and Bidyut B. Chaudhuri

Thermal Images profile the passive radiation of objects and capture them in grayscale images. Such images have a very different distribution of data compared to optical colored images. We present here a work that produces a grayscale thermo-optical fused mask given a thermal input. This is a deep learning based pioneering work since to the best of our knowledge, there exists no other work on thermal-optical grayscale fusion. Our method is also unique in the sense that the deep learning method we are proposing here works on the Discrete Wavelet Transform (DWT) domain instead of the gray level domain. As a part of this work, we also present a new and unique database for obtaining the region of interest in thermal images based on an existing thermal visual paired database, containing the Region of Interest on 5 different classes of data. Finally, we are proposing a simple low cost overhead statistical measure for identifying the region of interest in the fused images, which we call as the Region of Fusion (RoF). Experiments on the database show encouraging results in identifying the region of interest in the fused images. We also show that they can be processed better in the mixed form rather than with only thermal images.

2.4IVJan 18, 2021

A Simple Mutual Information based Registration Method for Thermal-Optical Image Pairs applied on a Novel Dataset

Suranjan Goswami, Satish Kumar Singh

While thermal optical registered datasets are becoming widely available, most of these works are based on image pairs which are pre-registered. However, thermal imagers where these images are registered by default are quite expensive. We present in this work, a thermal image registration technique which is computationally lightweight, and can be employed regardless of the resolution of the images captured. We use 2 different thermal imagers to create a completely new database and introduce it as a part of this work as well. The images captured are based on 5 different classes and encompass subjects like the Prayagraj Kumbh Mela, one of the largest public fairs in the world, captured over a period of 2 years.