Quoc V. Le — Pith Author Registry

Identifiers

name variant Quoc V. Le 0.60 · backfill

Papers (63)

Rethinking Generative Image Pretraining: How Far Are We From Scaling Up Next-Pixel Prediction? cs.CV · 2025 · author #6
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #7
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling cs.LG · 2024 · author #5
AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions cs.NE · 2023 · author #6
Large Language Models as Optimizers cs.LG · 2023 · author #5
Simple synthetic data reduces sycophancy in large language models cs.CL · 2023 · author #5
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning cs.AI · 2023 · author #8
Scaling Instruction-Finetuned Language Models cs.LG · 2022 · author #34
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them cs.CL · 2022 · author #8
Finetuned Language Models Are Zero-Shot Learners cs.CL · 2021 · author #9
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators cs.CL · 2020 · author #3
BAM! Born-Again Multi-Task Networks for Natural Language Understanding cs.CL · 2019 · author #5
Neural Input Search for Large Scale Recommendation Models cs.LG · 2019 · author #5
Learning Data Augmentation Strategies for Object Detection cs.CV · 2019 · author #6
XLNet: Generalized Autoregressive Pretraining for Language Understanding cs.CL · 2019 · author #6
Selfie: Self-supervised Pretraining for Image Embedding cs.LG · 2019 · author #3
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks cs.LG · 2019 · author #2
The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study cs.LG · 2019 · author #3
NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection cs.CV · 2019 · author #4
The Evolved Transformer cs.LG · 2019 · author #3
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context cs.LG · 2019 · author #5
Domain Adaptive Transfer Learning with Specialist Models cs.CV · 2018 · author #5
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism cs.CV · 2018 · author #9
DropBlock: A regularization method for convolutional networks cs.CV · 2018 · author #3
Semi-Supervised Sequence Modeling with Cross-View Training cs.CL · 2018 · author #4
MnasNet: Platform-Aware Neural Architecture Search for Mobile cs.CV · 2018 · author #7
Stochastic natural gradient descent draws posterior samples in function space cs.LG · 2018 · author #4
AutoAugment: Learning Augmentation Policies from Data cs.CV · 2018 · author #5
Do Better ImageNet Models Transfer Better? cs.CV · 2018 · author #3
QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension cs.CL · 2018 · author #7
Learning Longer-term Dependencies in RNNs with Auxiliary Losses cs.LG · 2018 · author #4
Efficient Neural Architecture Search via Parameter Sharing cs.LG · 2018 · author #4
Neural Program Synthesis with Priority Queue Training cs.AI · 2018 · author #5
Intriguing Properties of Adversarial Examples stat.ML · 2017 · author #4
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? cs.AI · 2017 · author #5
Don't Decay the Learning Rate, Increase the Batch Size cs.LG · 2017 · author #4
A Bayesian Perspective on Generalization and Stochastic Gradient Descent cs.LG · 2017 · author #2
Searching for Activation Functions cs.NE · 2017 · author #3
Neural Optimizer Search with Reinforcement Learning cs.AI · 2017 · author #4
Learning Transferable Architectures for Scalable Image Recognition cs.CV · 2017 · author #4
Device Placement Optimization with Reinforcement Learning cs.LG · 2017 · author #3
Learning to Skim Text cs.CL · 2017 · author #3
Neural Combinatorial Optimization with Reinforcement Learning cs.AI · 2016 · author #3
Learning a Natural Language Interface with Neural Programmer cs.CL · 2016 · author #2
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation cs.CL · 2016 · author #3
Unsupervised Pretraining for Sequence to Sequence Learning cs.CL · 2016 · author #3
Neural Architecture Search with Reinforcement Learning cs.LG · 2016 · author #2
HyperNetworks cs.LG · 2016 · author #3
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation cs.CL · 2016 · author #4
Adding Gradient Noise Improves Learning for Very Deep Networks stat.ML · 2015 · author #3
Multi-task Sequence to Sequence Learning cs.LG · 2015 · author #2
A Neural Transducer cs.LG · 2015 · author #3
Neural Programmer: Inducing Latent Programs with Gradient Descent cs.LG · 2015 · author #2
Semi-supervised Sequence Learning cs.LG · 2015 · author #2
Listen, Attend and Spell cs.CL · 2015 · author #3
Document Embedding with Paragraph Vectors cs.CL · 2015 · author #3
A Simple Way to Initialize Recurrent Networks of Rectified Linear Units cs.NE · 2015 · author #1
Addressing the Rare Word Problem in Neural Machine Translation cs.CL · 2014 · author #3
Sequence to Sequence Learning with Neural Networks cs.CL · 2014 · author #3
Distributed Representations of Sentences and Documents cs.CL · 2014 · author #1
Exploiting Similarities among Languages for Machine Translation cs.CL · 2013 · author #2
Building high-level features using large scale unsupervised learning cs.LG · 2011 · author #1
Learning Graph Matching cs.CV · 2008 · author #4

Mentions

1511.06807 #3 · backfill · confidence 0.70 Quoc V. Le
1511.06114 #2 · backfill · confidence 0.70 Quoc V. Le
1511.04868 #3 · backfill · confidence 0.70 Quoc V. Le
1511.04834 #2 · backfill · confidence 0.70 Quoc V. Le
1511.01432 #2 · backfill · confidence 0.70 Quoc V. Le
1508.01211 #3 · backfill · confidence 0.70 Quoc V. Le
1507.07998 #3 · backfill · confidence 0.70 Quoc V. Le
2312.08472 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
1504.00941 #1 · backfill · confidence 0.70 Quoc V. Le
1410.8206 #3 · backfill · confidence 0.70 Quoc V. Le
1409.3215 #3 · backfill · confidence 0.70 Quoc V. Le
1405.4053 #1 · backfill · confidence 0.70 Quoc V. Le
1309.4168 #2 · backfill · confidence 0.70 Quoc V. Le
1112.6209 #1 · backfill · confidence 0.70 Quoc V. Le
2511.08704 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
1906.08237 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
2301.13688 #8 · arxiv_oai · confidence 0.70 Quoc V. Le
2308.03958 #5 · arxiv_oai · confidence 0.70 Quoc V. Le
1905.11946 #2 · arxiv_oai · confidence 0.70 Quoc V. Le
2003.10555 #3 · arxiv_oai · confidence 0.70 Quoc V. Le
0806.2890 #4 · backfill · confidence 0.70 Quoc V. Le

Frequent Coauthors

Barret Zoph 10 shared papers
Minh-Thang Luong 8 shared papers
Ilya Sutskever 7 shared papers
Oriol Vinyals 6 shared papers
Denny Zhou 5 shared papers
Mohammad Norouzi 5 shared papers
Vijay Vasudevan 5 shared papers
Adams Wei Yu 4 shared papers
Andrew M. Dai 4 shared papers
Jason Wei 4 shared papers
Jeff Dean 4 shared papers
Samuel L. Smith 4 shared papers
Arvind Neelakantan 3 shared papers
Christopher D. Manning 3 shared papers
Ekin D. Cubuk 3 shared papers
Golnaz Ghiasi 3 shared papers
Hieu Pham 3 shared papers
Hyung Won Chung 3 shared papers
Jonathon Shlens 3 shared papers
Kevin Clark 3 shared papers