LGJul 17, 2022
Coupling Adversarial Learning with Selective Voting Strategy for Distribution Alignment in Partial Domain AdaptationSandipan Choudhuri, Hemanth Venkateswara, Arunabha Sen
In contrast to a standard closed-set domain adaptation task, partial domain adaptation setup caters to a realistic scenario by relaxing the identical label set assumption. The fact of source label set subsuming the target label set, however, introduces few additional obstacles as training on private source category samples thwart relevant knowledge transfer and mislead the classification process. To mitigate these issues, we devise a mechanism for strategic selection of highly-confident target samples essential for the estimation of class-importance weights. Furthermore, we capture class-discriminative and domain-invariant features by coupling the process of achieving compact and distinct class distributions with an adversarial objective. Experimental findings over numerous cross-domain classification tasks demonstrate the potential of the proposed technique to deliver superior and comparable accuracy over existing methods.
AIJul 19, 2024
The Cardinality of Identifying Code Sets for Soccer Ball Graph with Application to Remote SensingAnna L. D. Latour, Arunabha Sen, Kaustav Basu et al.
In the context of satellite monitoring of the earth, we can assume that the surface of the earth is divided into a set of regions. We assume that the impact of a big social/environmental event spills into neighboring regions. Using Identifying Code Sets (ICSes), we can deploy sensors in such a way that the region in which an event takes place can be uniquely identified, even with fewer sensors than regions. As Earth is almost a sphere, we use a soccer ball as a model. We construct a Soccer Ball Graph (SBG), and provide human-oriented, analytical proofs that 1) the SBG has at least 26 ICSes of cardinality ten, implying that there are at least 26 different ways to deploy ten satellites to monitor the Earth and 2) that the cardinality of the minimum Identifying Code Set (MICS) for the SBG is at least nine. We then provide a machine-oriented formal proof that the cardinality of the MICS for the SBG is in fact ten, meaning that one must deploy at least ten satellites to monitor the Earth in the SBG model. We also provide machine-oriented proof that there are exactly 26 ICSes of cardinality ten for the SBG.
LGDec 3, 2022
Domain-Invariant Feature Alignment Using Variational Inference For Partial Domain AdaptationSandipan Choudhuri, Suli Adeniye, Arunabha Sen et al.
The standard closed-set domain adaptation approaches seek to mitigate distribution discrepancies between two domains under the constraint of both sharing identical label sets. However, in realistic scenarios, finding an optimal source domain with identical label space is a challenging task. Partial domain adaptation alleviates this problem of procuring a labeled dataset with identical label space assumptions and addresses a more practical scenario where the source label set subsumes the target label set. This, however, presents a few additional obstacles during adaptation. Samples with categories private to the source domain thwart relevant knowledge transfer and degrade model performance. In this work, we try to address these issues by coupling variational information and adversarial learning with a pseudo-labeling technique to enforce class distribution alignment and minimize the transfer of superfluous information from the source samples. The experimental findings in numerous cross-domain classification tasks demonstrate that the proposed technique delivers superior and comparable accuracy to existing methods.
CVJun 20, 2019Code
Predicting Future Opioid Incidences TodaySandipan Choudhuri, Kaustav Basu, Kevin Thomas et al.
According to the Center of Disease Control (CDC), the Opioid epidemic has claimed more than 72,000 lives in the US in 2017 alone. In spite of various efforts at the local, state and federal level, the impact of the epidemic is becoming progressively worse, as evidenced by the fact that the number of Opioid related deaths increased by 12.5\% between 2016 and 2017. Predictive analytics can play an important role in combating the epidemic by providing decision making tools to stakeholders at multiple levels - from health care professionals to policy makers to first responders. Generating Opioid incidence heat maps from past data, aid these stakeholders to visualize the profound impact of the Opioid epidemic. Such post-fact creation of the heat map provides only retrospective information, and as a result, may not be as useful for preventive action in the current or future time-frames. In this paper, we present a novel deep neural architecture, which learns subtle spatio-temporal variations in Opioid incidences data and accurately predicts future heat maps. We evaluated the efficacy of our model on two open source datasets- (i) The Cincinnati Heroin Overdose dataset, and (ii) Connecticut Drug Related Death Dataset.
CVOct 18, 2023
Robust Class-Conditional Distribution Alignment for Partial Domain AdaptationSandipan Choudhuri, Arunabha Sen
Unwanted samples from private source categories in the learning objective of a partial domain adaptation setup can lead to negative transfer and reduce classification performance. Existing methods, such as re-weighting or aggregating target predictions, are vulnerable to this issue, especially during initial training stages, and do not adequately address class-level feature alignment. Our proposed approach seeks to overcome these limitations by delving deeper than just the first-order moments to derive distinct and compact categorical distributions. We employ objectives that optimize the intra and inter-class distributions in a domain-invariant fashion and design a robust pseudo-labeling for efficient target supervision. Our approach incorporates a complement entropy objective module to reduce classification uncertainty and flatten incorrect category predictions. The experimental findings and ablation analysis of the proposed modules demonstrate the superior performance of our proposed model compared to benchmarks.
CVSep 7, 2023
A Robust Negative Learning Approach to Partial Domain Adaptation Using Source PrototypesSandipan Choudhuri, Suli Adeniye, Arunabha Sen
This work proposes a robust Partial Domain Adaptation (PDA) framework that mitigates the negative transfer problem by incorporating a robust target-supervision strategy. It leverages ensemble learning and includes diverse, complementary label feedback, alleviating the effect of incorrect feedback and promoting pseudo-label refinement. Rather than relying exclusively on first-order moments for distribution alignment, our approach offers explicit objectives to optimize intra-class compactness and inter-class separation with the inferred source prototypes and highly-confident target samples in a domain-invariant fashion. Notably, we ensure source data privacy by eliminating the need to access the source data during the adaptation phase through a priori inference of source prototypes. We conducted a series of comprehensive experiments, including an ablation analysis, covering a range of partial domain adaptation tasks. Comprehensive evaluations on benchmark datasets corroborate our framework's enhanced robustness and generalization, demonstrating its superiority over existing state-of-the-art PDA approaches.
CRMar 25, 2021
Identification and Mitigation of False Data Injection using Multi State Implicative Interdependency Model (MSIIM) for Smart GridSohini Roy, Arunabha Sen
Smart grid monitoring, automation and control will completely rely on PMU based sensor data soon. Accordingly, a high throughput, low latency Information and Communication Technology (ICT) infrastructure should be opted in this regard. Due to the low cost, low power profile, dynamic nature, improved accuracy and scalability, wireless sensor networks (WSNs) can be a good choice. Yet, the efficiency of a WSN depends a lot on the network design and the routing technique. In this paper a new design of the ICT network for smart grid using WSN is proposed. In order to understand the interactions between different entities, detect their operational levels, design the routing scheme and identify false data injection by particular ICT entities, a new model of interdependency called the Multi State Implicative Interdependency Model (MSIIM) is proposed in this paper, which is an updated version of the Modified Implicative Interdependency Model (MIIM) [1]. MSIIM considers the data dependency and operational accuracy of entities together with structural and functional dependencies between them. A multi-path secure routing technique is also proposed in this paper which relies on the MSIIM model for its functioning. Simulation results prove that MSIIM based False Data Injection (FDI) detection and mitigation works better and faster than existing methods.
CVJan 6, 2021
Partial Domain Adaptation Using Selective Representation Learning For Class-Weight ComputationSandipan Choudhuri, Riti Paul, Arunabha Sen et al.
The generalization power of deep-learning models is dependent on rich-labelled data. This supervision using large-scaled annotated information is restrictive in most real-world scenarios where data collection and their annotation involve huge cost. Various domain adaptation techniques exist in literature that bridge this distribution discrepancy. However, a majority of these models require the label sets of both the domains to be identical. To tackle a more practical and challenging scenario, we formulate the problem statement from a partial domain adaptation perspective, where the source label set is a super set of the target label set. Driven by the motivation that image styles are private to each domain, in this work, we develop a method that identifies outlier classes exclusively from image content information and train a label classifier exclusively on class-content from source images. Additionally, elimination of negative transfer of samples from classes private to the source domain is achieved by transforming the soft class-level weights into two clusters, 0 (outlier source classes) and 1 (shared classes) by maximizing the between-cluster variance between them.
MMFeb 25, 2018
User Satisfaction-Driven Bandwidth Allocation for Image Transmission in a Crowded EnvironmentSandipan Choudhuri, Kaustav Basu, Arunabha Sen
A major portion of postings on social networking sites constitute high quality digital images and videos. These images and videos require a fairly large amount of bandwidth during transmission. Accordingly, high quality image and video postings become a challenge for the network service provider, especially in a crowded environment where bandwidth is in high demand. In this paper we present a user satisfaction driven bandwidth allocation scheme for image transmission in such environments. In an image, there are always objects that stand out more than others. The reason behind some set of objects being more important in a scene is based on a number of visual, as well as, cognitive factors. Being motivated by the fact that user satisfaction is more dependent on the quality of these salient objects in an image than non-salient ones, we propose a quantifiable metric for measuring user-satisfiability (based on image quality and delay of transmission). The bandwidth allocation technique proposed thereafter, ensures that this user-satisfiability is maximized. Unlike the existing approaches that utilize some fixed set of non-linear functions for framing the user-satisfiability index, our metric is modelled over customer survey data, where the unknown parameters are trained with machine learning methods.
SYMay 21, 2017
Finding $K$ Contingency List in Power Networks using a New Model of DependencyJoydeep Banerjee, Anamitra Pal, Kaustav Basu et al.
Smart grid systems are composed of power and communication network components. The components in either network exhibit complex dependencies on components in its own as well as the other network to drive their functionality. Existing, models fail to capture these complex dependencies. In this paper, we restrict to the dependencies in the power network and propose the Multi-scale Implicative Interdependency Relation (MIIR) model that address the existing limitations. A formal description of the model along with its working dynamics and a brief validation with respect to the 2011 Southwest blackout are provided. Utilizing the MIIR model, the $K$ Contingency List problem is proposed. For a given time instant, the problem solves for a set of $K$ entities in a power network which when failed at that time instant would cause the maximum number of entities to fail eventually. Owing to the problem being NP-complete we devised a Mixed Integer Program (MIP) to obtain the optimal solution and a polynomial time sub-optimal heuristic. The efficacy of the heuristic with respect to the MIP is compared by using different bus system data. In general, the heuristic is shown to provide near optimal solution at a much faster time than the MIP.
NIJan 24, 2017
On Robustness in Multilayer Interdependent NetworkJoydeep Banerjee, Chenyang Zhou, Arunabha Sen
Critical Infrastructures like power and communication networks are highly interdependent on each other for their full functionality. Many significant research have been pursued to model the interdependency and failure analysis of these interdependent networks. However, most of these models fail to capture the complex interdependencies that might actually exist between the infrastructures. The \emph{Implicative Interdependency Model} that utilizes Boolean Logic to capture complex interdependencies was recently proposed which overcome the limitations of the existing models. A number of problems were studies based on this model. In this paper we study the \textit{Robustness} problem in Interdependent Power and Communication Network. The robustness is defined with respect to two parameters $K \in I^{+} \cup \{0\}$ and $ρ\in (0,1]$. We utilized the \emph{Implicative Interdependency Model} model to capture the complex interdependency between the two networks. The model classifies the interdependency relations into four cases. Computational complexity of the problem is analyzed for each of these cases. A polynomial time algorithm is designed for the first case that outputs the optimal solution. All the other cases are proved to be NP-complete. An in-approximability bound is provided for the third case. For the general case we formulate an Integer Linear Program to get the optimal solution and a polynomial time heuristic. The applicability of the heuristic is evaluated using power and communication network data of Maricopa County, Arizona. The experimental results showed that the heuristic almost always produced near optimal value of parameter $K$ for $ρ< 0.42$.