0.4CVMay 16
A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and SegmentationMinhas Kamal, Hiranya Garbha Kumar, Balakrishnan Prabhakaran
Point cloud stands as the most widely adopted format for representing 3D shapes and scenes due to its simplicity and geometric fidelity. However, its inherent unordered and irregular nature, exacerbated by sensor noise and occlusions, introduces unique challenges for machine learning based methodologies. To combat these issues, diverse strategies have been developed, including converting to a format that has orderliness, extracting local geometry, and permutation-invariant or self-attention-based processing. In this paper, our focus is directed towards deep learning models for three fundamental tasks in 3D vision: point cloud classification, part segmentation, and semantic segmentation. We begin by formally defining point cloud data, followed by an in-depth discussion on its structural characteristics. Then, we categorize notable works based on their backbone structure and evaluate their performance on popular benchmarks. Beyond empirical comparison, we offer insights into architectural innovations and limitations. We also outline open challenges and promising future directions for 3D point cloud understanding.
CVDec 2, 2020
Braille to Text Translation for Bengali Language: A Geometric ApproachMinhas Kamal, Amin Ahsan Ali, Muhammad Asif Hossain Khan et al.
Braille is the only system to visually impaired people for reading and writing. However, general people cannot read Braille. So, teachers and relatives find it hard to assist them with learning. Almost every major language has software solutions for this translation purpose. However, in Bengali there is an absence of this useful tool. Here, we propose Braille to Text Translator, which takes image of these tactile alphabets, and translates them to plain text. Image deterioration, scan-time page rotation, and braille dot deformation are the principal issues in this scheme. All of these challenges are directly checked using special image processing and geometric structure analysis. The technique yields 97.25% accuracy in recognizing Braille characters.
IRDec 1, 2020
A Statistical Real-Time Prediction Model for Recommender SystemMd Rifat Arefin, Minhas Kamal, Kishan Kumar Ganguly et al.
Recommender system has become an inseparable part of online shopping and its usability is increasing with the advancement of these e-commerce sites. An effective and efficient recommender system benefits both the seller and the buyer significantly. We considered user activities and product information for the filtering process in our proposed recommender system. Our model has achieved inspiring result (approximately 58% true-positive and 13% false-positive) for the data set provided by RecSys Challenge 2015. This paper aims to describe a statistical model that will help to predict the buying behavior of a user in real-time during a session.