DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

· 2019 · cs.CV · arXiv 1904.02216

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

This paper introduces an extremely efficient CNN architecture named DFANet for semantic segmentation under resource constraints. Our proposed network starts from a single lightweight backbone and aggregates discriminative features through sub-network and sub-stage cascade respectively. Based on the multi-scale feature propagation, DFANet substantially reduces the number of parameters, but still obtains sufficient receptive field and enhances the model learning ability, which strikes a balance between the speed and segmentation performance. Experiments on Cityscapes and CamVid datasets demonstrate the superior performance of DFANet with 8$\times$ less FLOPs and 2$\times$ faster than the existing state-of-the-art real-time semantic segmentation methods while providing comparable accuracy. Specifically, it achieves 70.3\% Mean IOU on the Cityscapes test dataset with only 1.7 GFLOPs and a speed of 160 FPS on one NVIDIA Titan X card, and 71.3\% Mean IOU with 3.4 GFLOPs while inferring on a higher resolution image.

representative citing papers

ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation

cs.CV · 2019-06-27 · unverdicted · novelty 4.0

ELKPPNet combines a balanced encoder-decoder, large kernel spatial pyramid pooling for multi-scale fusion, and an edge-aware loss to claim superior semantic segmentation performance on Cityscapes, CamVid, and NYUDv2 versus prior methods.

citing papers explorer

Showing 1 of 1 citing paper.

ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation cs.CV · 2019-06-27 · unverdicted · none · ref 36 · internal anchor
ELKPPNet combines a balanced encoder-decoder, large kernel spatial pyramid pooling for multi-scale fusion, and an edge-aware loss to claim superior semantic segmentation performance on Cityscapes, CamVid, and NYUDv2 versus prior methods.

DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

fields

years

verdicts

representative citing papers

citing papers explorer