B.1.2 S ELF -SUPERVISION We employ the masked patch prediction objective for preliminary self-supervision experiments

for all tasks · 2019

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

cs.CV · 2020-10-22 · accept · novelty 9.0

Vision Transformer (ViT) applies a standard transformer directly to image patches and matches or exceeds state-of-the-art CNN performance on classification benchmarks after large-scale pre-training.

citing papers explorer

Showing 1 of 1 citing paper.

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale cs.CV · 2020-10-22 · accept · none · ref 12
Vision Transformer (ViT) applies a standard transformer directly to image patches and matches or exceeds state-of-the-art CNN performance on classification benchmarks after large-scale pre-training.

B.1.2 S ELF -SUPERVISION We employ the masked patch prediction objective for preliminary self-supervision experiments

fields

years

verdicts

representative citing papers

citing papers explorer