Quoc V. Le
Identifiers
- name variant Quoc V. Le 0.60 · backfill
Papers (63)
- Rethinking Generative Image Pretraining: How Far Are We From Scaling Up Next-Pixel Prediction? cs.CV · 2025 · author #6
- SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #7
- Large Language Monkeys: Scaling Inference Compute with Repeated Sampling cs.LG · 2024 · author #5
- AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions cs.NE · 2023 · author #6
- Large Language Models as Optimizers cs.LG · 2023 · author #5
- Simple synthetic data reduces sycophancy in large language models cs.CL · 2023 · author #5
- The Flan Collection: Designing Data and Methods for Effective Instruction Tuning cs.AI · 2023 · author #8
- Scaling Instruction-Finetuned Language Models cs.LG · 2022 · author #34
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them cs.CL · 2022 · author #8
- Finetuned Language Models Are Zero-Shot Learners cs.CL · 2021 · author #9
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators cs.CL · 2020 · author #3
- BAM! Born-Again Multi-Task Networks for Natural Language Understanding cs.CL · 2019 · author #5
- Neural Input Search for Large Scale Recommendation Models cs.LG · 2019 · author #5
- Learning Data Augmentation Strategies for Object Detection cs.CV · 2019 · author #6
- XLNet: Generalized Autoregressive Pretraining for Language Understanding cs.CL · 2019 · author #6
- Selfie: Self-supervised Pretraining for Image Embedding cs.LG · 2019 · author #3
- EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks cs.LG · 2019 · author #2
- The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study cs.LG · 2019 · author #3
- NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection cs.CV · 2019 · author #4
- The Evolved Transformer cs.LG · 2019 · author #3
- Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context cs.LG · 2019 · author #5
- Domain Adaptive Transfer Learning with Specialist Models cs.CV · 2018 · author #5
- GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism cs.CV · 2018 · author #9
- DropBlock: A regularization method for convolutional networks cs.CV · 2018 · author #3
- Semi-Supervised Sequence Modeling with Cross-View Training cs.CL · 2018 · author #4
- MnasNet: Platform-Aware Neural Architecture Search for Mobile cs.CV · 2018 · author #7
- Stochastic natural gradient descent draws posterior samples in function space cs.LG · 2018 · author #4
- AutoAugment: Learning Augmentation Policies from Data cs.CV · 2018 · author #5
- Do Better ImageNet Models Transfer Better? cs.CV · 2018 · author #3
- QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension cs.CL · 2018 · author #7
- Learning Longer-term Dependencies in RNNs with Auxiliary Losses cs.LG · 2018 · author #4
- Efficient Neural Architecture Search via Parameter Sharing cs.LG · 2018 · author #4
- Neural Program Synthesis with Priority Queue Training cs.AI · 2018 · author #5
- Intriguing Properties of Adversarial Examples stat.ML · 2017 · author #4
- Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? cs.AI · 2017 · author #5
- Don't Decay the Learning Rate, Increase the Batch Size cs.LG · 2017 · author #4
- A Bayesian Perspective on Generalization and Stochastic Gradient Descent cs.LG · 2017 · author #2
- Searching for Activation Functions cs.NE · 2017 · author #3
- Neural Optimizer Search with Reinforcement Learning cs.AI · 2017 · author #4
- Learning Transferable Architectures for Scalable Image Recognition cs.CV · 2017 · author #4
- Device Placement Optimization with Reinforcement Learning cs.LG · 2017 · author #3
- Learning to Skim Text cs.CL · 2017 · author #3
- Neural Combinatorial Optimization with Reinforcement Learning cs.AI · 2016 · author #3
- Learning a Natural Language Interface with Neural Programmer cs.CL · 2016 · author #2
- Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation cs.CL · 2016 · author #3
- Unsupervised Pretraining for Sequence to Sequence Learning cs.CL · 2016 · author #3
- Neural Architecture Search with Reinforcement Learning cs.LG · 2016 · author #2
- HyperNetworks cs.LG · 2016 · author #3
- Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation cs.CL · 2016 · author #4
- Adding Gradient Noise Improves Learning for Very Deep Networks stat.ML · 2015 · author #3
- Multi-task Sequence to Sequence Learning cs.LG · 2015 · author #2
- A Neural Transducer cs.LG · 2015 · author #3
- Neural Programmer: Inducing Latent Programs with Gradient Descent cs.LG · 2015 · author #2
- Semi-supervised Sequence Learning cs.LG · 2015 · author #2
- Listen, Attend and Spell cs.CL · 2015 · author #3
- Document Embedding with Paragraph Vectors cs.CL · 2015 · author #3
- A Simple Way to Initialize Recurrent Networks of Rectified Linear Units cs.NE · 2015 · author #1
- Addressing the Rare Word Problem in Neural Machine Translation cs.CL · 2014 · author #3
- Sequence to Sequence Learning with Neural Networks cs.CL · 2014 · author #3
- Distributed Representations of Sentences and Documents cs.CL · 2014 · author #1
- Exploiting Similarities among Languages for Machine Translation cs.CL · 2013 · author #2
- Building high-level features using large scale unsupervised learning cs.LG · 2011 · author #1
- Learning Graph Matching cs.CV · 2008 · author #4
Mentions
- 1511.06807 #3 · backfill · confidence 0.70 Quoc V. Le
- 1511.06114 #2 · backfill · confidence 0.70 Quoc V. Le
- 1511.04868 #3 · backfill · confidence 0.70 Quoc V. Le
- 1511.04834 #2 · backfill · confidence 0.70 Quoc V. Le
- 1511.01432 #2 · backfill · confidence 0.70 Quoc V. Le
- 1508.01211 #3 · backfill · confidence 0.70 Quoc V. Le
- 1507.07998 #3 · backfill · confidence 0.70 Quoc V. Le
- 2312.08472 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
- 1504.00941 #1 · backfill · confidence 0.70 Quoc V. Le
- 1410.8206 #3 · backfill · confidence 0.70 Quoc V. Le
- 1409.3215 #3 · backfill · confidence 0.70 Quoc V. Le
- 1405.4053 #1 · backfill · confidence 0.70 Quoc V. Le
- 1309.4168 #2 · backfill · confidence 0.70 Quoc V. Le
- 1112.6209 #1 · backfill · confidence 0.70 Quoc V. Le
- 2511.08704 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
- 1906.08237 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
- 2301.13688 #8 · arxiv_oai · confidence 0.70 Quoc V. Le
- 2308.03958 #5 · arxiv_oai · confidence 0.70 Quoc V. Le
- 1905.11946 #2 · arxiv_oai · confidence 0.70 Quoc V. Le
- 2003.10555 #3 · arxiv_oai · confidence 0.70 Quoc V. Le
- 0806.2890 #4 · backfill · confidence 0.70 Quoc V. Le
Frequent Coauthors
- Barret Zoph 10 shared papers
- Minh-Thang Luong 8 shared papers
- Ilya Sutskever 7 shared papers
- Oriol Vinyals 6 shared papers
- Denny Zhou 5 shared papers
- Mohammad Norouzi 5 shared papers
- Vijay Vasudevan 5 shared papers
- Adams Wei Yu 4 shared papers
- Andrew M. Dai 4 shared papers
- Jason Wei 4 shared papers
- Jeff Dean 4 shared papers
- Samuel L. Smith 4 shared papers
- Arvind Neelakantan 3 shared papers
- Christopher D. Manning 3 shared papers
- Ekin D. Cubuk 3 shared papers
- Golnaz Ghiasi 3 shared papers
- Hieu Pham 3 shared papers
- Hyung Won Chung 3 shared papers
- Jonathon Shlens 3 shared papers
- Kevin Clark 3 shared papers