pith. sign in

Quoc V. Le

Identifiers

  • name variant Quoc V. Le 0.60 · backfill

Papers (63)

  1. Rethinking Generative Image Pretraining: How Far Are We From Scaling Up Next-Pixel Prediction? cs.CV · 2025 · author #6
  2. SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #7
  3. Large Language Monkeys: Scaling Inference Compute with Repeated Sampling cs.LG · 2024 · author #5
  4. AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions cs.NE · 2023 · author #6
  5. Large Language Models as Optimizers cs.LG · 2023 · author #5
  6. Simple synthetic data reduces sycophancy in large language models cs.CL · 2023 · author #5
  7. The Flan Collection: Designing Data and Methods for Effective Instruction Tuning cs.AI · 2023 · author #8
  8. Scaling Instruction-Finetuned Language Models cs.LG · 2022 · author #34
  9. Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them cs.CL · 2022 · author #8
  10. Finetuned Language Models Are Zero-Shot Learners cs.CL · 2021 · author #9
  11. ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators cs.CL · 2020 · author #3
  12. BAM! Born-Again Multi-Task Networks for Natural Language Understanding cs.CL · 2019 · author #5
  13. Neural Input Search for Large Scale Recommendation Models cs.LG · 2019 · author #5
  14. Learning Data Augmentation Strategies for Object Detection cs.CV · 2019 · author #6
  15. XLNet: Generalized Autoregressive Pretraining for Language Understanding cs.CL · 2019 · author #6
  16. Selfie: Self-supervised Pretraining for Image Embedding cs.LG · 2019 · author #3
  17. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks cs.LG · 2019 · author #2
  18. The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study cs.LG · 2019 · author #3
  19. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection cs.CV · 2019 · author #4
  20. The Evolved Transformer cs.LG · 2019 · author #3
  21. Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context cs.LG · 2019 · author #5
  22. Domain Adaptive Transfer Learning with Specialist Models cs.CV · 2018 · author #5
  23. GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism cs.CV · 2018 · author #9
  24. DropBlock: A regularization method for convolutional networks cs.CV · 2018 · author #3
  25. Semi-Supervised Sequence Modeling with Cross-View Training cs.CL · 2018 · author #4
  26. MnasNet: Platform-Aware Neural Architecture Search for Mobile cs.CV · 2018 · author #7
  27. Stochastic natural gradient descent draws posterior samples in function space cs.LG · 2018 · author #4
  28. AutoAugment: Learning Augmentation Policies from Data cs.CV · 2018 · author #5
  29. Do Better ImageNet Models Transfer Better? cs.CV · 2018 · author #3
  30. QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension cs.CL · 2018 · author #7
  31. Learning Longer-term Dependencies in RNNs with Auxiliary Losses cs.LG · 2018 · author #4
  32. Efficient Neural Architecture Search via Parameter Sharing cs.LG · 2018 · author #4
  33. Neural Program Synthesis with Priority Queue Training cs.AI · 2018 · author #5
  34. Intriguing Properties of Adversarial Examples stat.ML · 2017 · author #4
  35. Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? cs.AI · 2017 · author #5
  36. Don't Decay the Learning Rate, Increase the Batch Size cs.LG · 2017 · author #4
  37. A Bayesian Perspective on Generalization and Stochastic Gradient Descent cs.LG · 2017 · author #2
  38. Searching for Activation Functions cs.NE · 2017 · author #3
  39. Neural Optimizer Search with Reinforcement Learning cs.AI · 2017 · author #4
  40. Learning Transferable Architectures for Scalable Image Recognition cs.CV · 2017 · author #4
  41. Device Placement Optimization with Reinforcement Learning cs.LG · 2017 · author #3
  42. Learning to Skim Text cs.CL · 2017 · author #3
  43. Neural Combinatorial Optimization with Reinforcement Learning cs.AI · 2016 · author #3
  44. Learning a Natural Language Interface with Neural Programmer cs.CL · 2016 · author #2
  45. Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation cs.CL · 2016 · author #3
  46. Unsupervised Pretraining for Sequence to Sequence Learning cs.CL · 2016 · author #3
  47. Neural Architecture Search with Reinforcement Learning cs.LG · 2016 · author #2
  48. HyperNetworks cs.LG · 2016 · author #3
  49. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation cs.CL · 2016 · author #4
  50. Adding Gradient Noise Improves Learning for Very Deep Networks stat.ML · 2015 · author #3
  51. Multi-task Sequence to Sequence Learning cs.LG · 2015 · author #2
  52. A Neural Transducer cs.LG · 2015 · author #3
  53. Neural Programmer: Inducing Latent Programs with Gradient Descent cs.LG · 2015 · author #2
  54. Semi-supervised Sequence Learning cs.LG · 2015 · author #2
  55. Listen, Attend and Spell cs.CL · 2015 · author #3
  56. Document Embedding with Paragraph Vectors cs.CL · 2015 · author #3
  57. A Simple Way to Initialize Recurrent Networks of Rectified Linear Units cs.NE · 2015 · author #1
  58. Addressing the Rare Word Problem in Neural Machine Translation cs.CL · 2014 · author #3
  59. Sequence to Sequence Learning with Neural Networks cs.CL · 2014 · author #3
  60. Distributed Representations of Sentences and Documents cs.CL · 2014 · author #1
  61. Exploiting Similarities among Languages for Machine Translation cs.CL · 2013 · author #2
  62. Building high-level features using large scale unsupervised learning cs.LG · 2011 · author #1
  63. Learning Graph Matching cs.CV · 2008 · author #4

Mentions

  • 1511.06807 #3 · backfill · confidence 0.70 Quoc V. Le
  • 1511.06114 #2 · backfill · confidence 0.70 Quoc V. Le
  • 1511.04868 #3 · backfill · confidence 0.70 Quoc V. Le
  • 1511.04834 #2 · backfill · confidence 0.70 Quoc V. Le
  • 1511.01432 #2 · backfill · confidence 0.70 Quoc V. Le
  • 1508.01211 #3 · backfill · confidence 0.70 Quoc V. Le
  • 1507.07998 #3 · backfill · confidence 0.70 Quoc V. Le
  • 2312.08472 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 1504.00941 #1 · backfill · confidence 0.70 Quoc V. Le
  • 1410.8206 #3 · backfill · confidence 0.70 Quoc V. Le
  • 1409.3215 #3 · backfill · confidence 0.70 Quoc V. Le
  • 1405.4053 #1 · backfill · confidence 0.70 Quoc V. Le
  • 1309.4168 #2 · backfill · confidence 0.70 Quoc V. Le
  • 1112.6209 #1 · backfill · confidence 0.70 Quoc V. Le
  • 2511.08704 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 1906.08237 #6 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 2301.13688 #8 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 2308.03958 #5 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 1905.11946 #2 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 2003.10555 #3 · arxiv_oai · confidence 0.70 Quoc V. Le
  • 0806.2890 #4 · backfill · confidence 0.70 Quoc V. Le

Frequent Coauthors