LGNov 9, 2022

Minimalist Data Wrangling with Python

arXiv:2211.04630v1h-index: 19
Originality Synthesis-oriented
AI Analysis

It provides an introductory educational resource for students learning data science, but it is incremental as it covers established topics without new research contributions.

This textbook introduces data science concepts and methods for cleaning, transforming, analyzing, and reporting data, aimed at students as a first resource, with free online and PDF versions available.

Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes