pith. machine review for the scientific record.
sign in

arxiv: 1611.02167 · v3 · pith:7PTB2TD4new · submitted 2016-11-07 · 💻 cs.LG

Designing Neural Network Architectures using Reinforcement Learning

classification 💻 cs.LG
keywords learningarchitecturesexistingnetworknetworksagentclassificationdesigning
0
0 comments X
read the original abstract

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using $Q$-learning with an $\epsilon$-greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Spiking Neural Network Architecture Search: A Survey

    cs.NE 2025-10 unverdicted novelty 2.0

    A survey of Spiking Neural Network architecture search techniques viewed through a hardware/software co-design lens.