ML LG OCJun 27, 2020

Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python

Jason Ge, Xingguo Li, Haoming Jiang, Han Liu, Tong Zhang, Mengdi Wang, Tuo Zhao

arXiv:2006.15261v110.937 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This provides a tool for researchers and practitioners working with high-dimensional data, but it is incremental as it builds on existing optimization methods.

The authors introduced picasso, a library implementing pathwise coordinate optimization for various sparse learning problems, which scales efficiently to large datasets.

We describe a new library named picasso, which implements a unified framework of pathwise coordinate optimization for a variety of sparse learning problems (e.g., sparse linear regression, sparse logistic regression, sparse Poisson regression and scaled sparse linear regression) combined with efficient active set selection strategies. Besides, the library allows users to choose different sparsity-inducing regularizers, including the convex $\ell_1$, nonconvex MCP and SCAD regularizers. The library is coded in C++ and has user-friendly R and Python wrappers. Numerical experiments demonstrate that picasso can scale up to large problems efficiently.

View on arXiv PDF Code

Similar