GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing
This toolkit release addresses the need for accessible and flexible deep learning tools in CV and NLP communities, though it is incremental as it builds on existing frameworks.
The authors introduced GluonCV and GluonNLP, deep learning toolkits for computer vision and natural language processing based on Apache MXNet, providing state-of-the-art pre-trained models and modular APIs to facilitate rapid prototyping and reproducible research.
We present GluonCV and GluonNLP, the deep learning toolkits for computer vision and natural language processing based on Apache MXNet (incubating). These toolkits provide state-of-the-art pre-trained models, training scripts, and training logs, to facilitate rapid prototyping and promote reproducible research. We also provide modular APIs with flexible building blocks to enable efficient customization. Leveraging the MXNet ecosystem, the deep learning models in GluonCV and GluonNLP can be deployed onto a variety of platforms with different programming languages. The Apache 2.0 license has been adopted by GluonCV and GluonNLP to allow for software distribution, modification, and usage.