NE AIMar 10, 2018

Evolutionary Architecture Search For Deep Multitask Networks

Jason Liang, Elliot Meyerson, Risto Miikkulainen

arXiv:1803.03745v229.5128 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of automating architecture design for multitask learning, which is incremental as it builds on existing soft ordering architectures.

The paper tackled the challenge of designing deep neural network architectures for multitask learning by using evolutionary optimization to evolve modules and routing, significantly improving state-of-the-art performance in the Omniglot character recognition domain with concrete gains.

Multitask learning, i.e. learning several tasks at once with the same neural network, can improve performance in each of the tasks. Designing deep neural network architectures for multitask learning is a challenge: There are many ways to tie the tasks together, and the design choices matter. The size and complexity of this problem exceeds human design ability, making it a compelling domain for evolutionary optimization. Using the existing state of the art soft ordering architecture as the starting point, methods for evolving the modules of this architecture and for evolving the overall topology or routing between modules are evaluated in this paper. A synergetic approach of evolving custom routings with evolved, shared modules for each task is found to be very powerful, significantly improving the state of the art in the Omniglot multitask, multialphabet character recognition domain. This result demonstrates how evolution can be instrumental in advancing deep neural network and complex system design in general.

View on arXiv PDF

Similar