Distributed Function Minimization in Apache Spark
This work provides a tool for distributed optimization in big data contexts, but it is incremental as it adapts existing methods to a new platform without novel algorithmic contributions.
The authors tackled the problem of distributed function minimization by implementing gradient and quasi-Newton methods in Apache Spark, showcasing it with applications like Optimal Transport and scalability tests on classification and regression tasks, but no concrete performance numbers are provided.
We report on an open-source implementation for distributed function minimization on top of Apache Spark by using gradient and quasi-Newton methods. We show-case it with an application to Optimal Transport and some scalability tests on classification and regression problems.