Lukasz Bolikowski

IR
3papers
14citations
Novelty28%
AI Score17

3 Papers

LGJul 26, 2021
Feature Synergy, Redundancy, and Independence in Global Model Explanations using SHAP Vector Decomposition

Jan Ittner, Lukasz Bolikowski, Konstantin Hemker et al.

We offer a new formalism for global explanations of pairwise feature dependencies and interactions in supervised models. Building upon SHAP values and SHAP interaction values, our approach decomposes feature contributions into synergistic, redundant and independent components (S-R-I decomposition of SHAP vectors). We propose a geometric interpretation of the components and formally prove its basic properties. Finally, we demonstrate the utility of synergy, redundancy and independence by applying them to a constructed data set and model.

IRMar 21, 2013
Taming the zoo - about algorithms implementation in the ecosystem of Apache Hadoop

Piotr Jan Dendek, Artur Czeczko, Mateusz Fedoryszak et al.

Content Analysis System (CoAnSys) is a research framework for mining scientific publications using Apache Hadoop. This article describes the algorithms currently implemented in CoAnSys including classification, categorization and citation matching of scientific publications. The size of the input data classifies these algorithms in the range of big data problems, which can be efficiently solved on Hadoop clusters.

SEMar 21, 2013
How to perform research in Hadoop environment not losing mental equilibrium - case study

Piotr Jan Dendek, Artur Czeczko, Mateusz Fedoryszak et al.

Conducting a research in an efficient, repetitive, evaluable, but also convenient (in terms of development) way has always been a challenge. To satisfy those requirements in a long term and simultaneously minimize costs of the software engineering process, one has to follow a certain set of guidelines. This article describes such guidelines based on the research environment called Content Analysis System (CoAnSys) created in the Center for Open Science (CeON). Best practices and tools for working in the Apache Hadoop environment, as well as the process of establishing these rules are portrayed.