pith. machine review for the scientific record. sign in

foundational unreviewed works

The most-cited unreviewed works inside Pith’s own reviewed-paper corpus. This queue grows naturally as new papers are reviewed and their references are resolved.

  1. Instruction tuning with GPT-4.arXiv preprint arXiv:2304.03277, 2023 21 Pith citing papers · 2023

    Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley, and Jianfeng Gao

  2. mplug-owl: Modularization empowers large lan- guage models with multimodality 21 Pith citing papers · 2023

    Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, et al

  3. Eureka: Human- level reward design via coding large language models, 20 Pith citing papers · 2023

    Y

  4. Transfer between modalities with metaqueries.arXiv preprint arXiv:2504.06256, 2025 20 Pith citing papers · 2025

    Xichen Pan, Satya Narayan Shukla, Aashu Singh, Zhuokai Zhao, Shlok Kumar Mishra, Jialiang Wang, Zhiyang Xu, Jiuhai Chen, Kunpeng Li, Felix Juefei-Xu, et al

  5. Aligning text-to-image models using human feedback, 20 Pith citing papers · 2023

    K

  6. Cobaya: Code for Bayesian Analysis of hierarchical physical models, 20 Pith citing papers · 2021

    Jesus Torrado and Antony Lewis, “Cobaya: Code for Bayesian Analysis of hierarchical physical models,” JCAP 05, 057 (

  7. Deep retinex decomposition for low-light enhancement 20 Pith citing papers · 2018

    Chen Wei, Wenjing Wang, Wenhan Yang, and Jiaying Liu

  8. MMDetection: Open mmlab detection toolbox and benchmark, 20 Pith citing papers · 1906

    K

  9. One billion word benchmark for measuring progress in statistical language modeling 20 Pith citing papers · 2014

    Chelba, C

  10. Td-mpc2: Scalable, robust world models for continuous control 20 Pith citing papers · 2023

    Nicklas Hansen, Hao Su, and Xiaolong Wang

  11. Vbench-2.0: Advancing video generation benchmark suite for intrinsic faithfulness 20 Pith citing papers · 2025

    Dian Zheng, Ziqi Huang, Hongbo Liu, Kai Zou, Yinan He, Fan Zhang, Lulu Gu, Yuanhan Zhang, Jingwen He, Wei-Shi Zheng, et al

  12. Aligning large multi-modal model with robust instruction tuning 20 Pith citing papers · 2023

    Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, and Lijuan Wang

  13. arXiv preprint arXiv:2309.05463 (2023) ExploreVLA 17 20 Pith citing papers · 2023

    Li, Y

  14. Averaging weights leads to wider optima and better generalization.arXiv preprint arXiv:1803.05407, 2018 20 Pith citing papers · 2018

    Pavel Izmailov, Dmitrii Podoprikhin, Timur Garipov, Dmitry Vetrov, and Andrew Gordon Wilson

  15. Fast and accurate deep network learning by exponential linear units (elus).arXiv preprint arXiv:1511.07289 , 20 Pith citing papers · 2015

    Djork-Arné Clevert, Thomas Unterthiner, and Sepp Hochreiter

  16. Glm-130b: An open bilingual pre-trained model 20 Pith citing papers · 2022

    Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, et al

  17. Improving alignment of dialogue agents via targeted human judgements 20 Pith citing papers · 2022

    Amelia Glaese, Nat McAleese, Maja Tr˛ ebacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, et al

  18. Llm+ p: Em- powering large language models with optimal planning proficiency 20 Pith citing papers · 2023

    [Liu et al

  19. Mle-bench: Evaluating machine learning agents on machine learning engineering 20 Pith citing papers · 2024

    Jun Shern Chan, Neil Chowdhury, Oliver Jaffe, James Aung, Dane Sherburn, Evan Mays, Giulio Starace, Kevin Liu, Leon Maksin, Tejal Patwardhan, and 1 others

  20. Tests of general relativity with GW150914, 20 Pith citing papers · 2016

    B

  21. Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model, 2022 20 Pith citing papers · 2022

    Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song,

  22. Video- llava: Learning united visual representation by alignment before projection 20 Pith citing papers · 2023

    [Lin et al

  23. Di Valentinoet al.(CosmoVerse Network), The Cos- moVerseWhitePaper: Addressingobservationaltensions in cosmology with systematics and fundamental physics, Phys 19 Pith citing papers · 2025

    E

  24. Llada2.0: Scaling up diffusion language models to 100b, 2025 19 Pith citing papers · 2025

    Bie, T

  25. Luoet al.(TianQin), TianQin: a space-borne gravitational wave detector, Classical Quantum Gravity33, 035010 (2016), arXiv:1512.02076 [astro-ph.IM] 19 Pith citing papers · 2016

    J

  26. SoK: Agentic skills–beyond tool use in LLM agents.arXiv preprint arXiv:2602.20867, 2026 19 Pith citing papers · 2026

    Yanna Jiang, Delong Li, Haiyu Deng, Baihe Ma, Xu Wang, et al

  27. Training agents inside of scalable world models, 19 Pith citing papers · 2025

    D

  28. Agentspec: Customizable runtime enforcement for safe and reliable llm agents 19 Pith citing papers · 2025

    Haoyu Wang, Christopher M Poskitt, and Jun Sun

  29. Bigcodebench: Bench- marking code generation with diverse function calls and complex instructions.arXiv preprint arXiv:2406.15877, 19 Pith citing papers · 2024

    89 Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, et al

  30. Codebleu: a method for automatic evaluation of code synthesis 19 Pith citing papers · 2020

    Shuo Ren, Daya Guo, Shuai Lu, Long Zhou, Shujie Liu, Duyu Tang, Neel Sundaresan, Ming Zhou, Ambrosio Blanco, and Shuai Ma

  31. Enhancing chat language models by scaling high-quality instructional conversations 19 Pith citing papers · 2023

    Ding, N

  32. Measuring short-form factuality in large language models 19 Pith citing papers · 2024

    Jason Wei, Nguyen Karina, Hyung Won Chung, Yunxin Joy Jiao, Spencer Papay, Amelia Glaese, John Schulman, and William Fedus

  33. Project aria: A new tool for egocentric multi-modal ai research.arXiv preprint arXiv:2308.13561, 2023 19 Pith citing papers · 2023

    Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Mered- ith, et al

  34. Reasoning models don’t always say what they think.arXiv preprint arXiv:2505.05410, 2025 19 Pith citing papers · 2025

    Yanda Chen, Joe Benton, Ansh Radhakrishnan, Jonathan Uesato, Carson Denison, John Schul- man, Arushi Somani, Peter Hase, Misha Wagner, Fabien Roger, Vlad Mikulik, Samuel R Bowman, Jan Leike, Jared Kaplan, and Ethan Perez

  35. R-zero: Self-evolving reasoning llm from zero data.arXiv preprint arXiv:2508.05004, 19 Pith citing papers · 2025

    Chengsong Huang, Wenhao Yu, Xiaoyang Wang, Hongming Zhang, Zongxia Li, Ruosen Li, Jiaxin Huang, Haitao Mi, and Dong Yu

  36. Tabpfn-2.5: Advancing the state of the art in tabular foundation models, 2025 19 Pith citing papers · 2025

    Léo Grinsztajn, Klemens Flöge, Oscar Key, Felix Birkel, Philipp Jund, Brendan Roof, Benjamin Jäger, Dominik Safaric, Simone Alessi, Adrian Hayler, Mihir Manium, Rosen Yu, Felix Jablon- ski, Shi Bin Hoo, Anurag Garg, Jake Robertson, Magnus B

  37. Universal manipula- tion interface: In-the-wild robot teaching without in-the- wild robots, 19 Pith citing papers · 2024

    C

  38. Accelerating uni- verses with scaling dark matter, 19 Pith citing papers · 2001

    Michel Chevallier and David Polarski, “Accelerating uni- verses with scaling dark matter,” Int

  39. Audio flamingo 3: Advancing audio intelligence with fully open large audio language models, 19 Pith citing papers · 2025

    A

  40. Defending against indirect prompt injection attacks with spotlighting 19 Pith citing papers · 2024

    Keegan Hines et al

  41. DYNESTY: a dynamic nested sampling package for estimating Bayesian posteriors and evidences, 19 Pith citing papers · 2020

    J

  42. Gated linear attention transformers with hardware-efficient training.arXiv preprint arXiv:2312.06635, 2023 19 Pith citing papers · 2023

    Songlin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, and Yoon Kim

  43. How much knowledge can you pack into the parameters of a language model? arXiv preprint arXiv:2002.08910, 2020 19 Pith citing papers · 2002

    Roberts, A

  44. Jailbreak attacks and defenses against large language models: A survey.arXiv preprint arXiv:2407.04295, 2024 19 Pith citing papers · 2024

    Sibo Yi, Yule Liu, Zhen Sun, Tianshuo Cong, Xinlei He, Jiaxing Song, Ke Xu, and Qi Li

  45. Jailbroken: How does llm safety training fail?, 19 Pith citing papers · 2023

    A

  46. Llada 1.5: Variance-reduced preference optimization for large language diffusion models.arXiv preprint arXiv:2505.19223, 2025 19 Pith citing papers · 2025

    F

  47. Mlvu: A comprehensive benchmark for multi-task long video understanding, 19 Pith citing papers · 2024

    J

  48. Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation.arXiv preprint arXiv:2401.02117, 19 Pith citing papers · 2024

    Zipeng Fu, Tony Z Zhao, and Chelsea Finn

  49. Nv-embed: Improved techniques for training llms as generalist embedding models.arXiv preprint arXiv:2405.17428, 19 Pith citing papers · 2024

    Chankyu Lee, Rajarshi Roy, Mengyao Xu, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, and Wei Ping

  50. Open problems and fundamental limitations of reinforcement learning from human feedback 19 Pith citing papers · 2023

    Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, J ´er´emy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, et al

  51. Thyme: Think beyond images.arXiv preprint arXiv:2508.11630, 2025 19 Pith citing papers · 2025

    Yi-Fan Zhang, Xingyu Lu, Shukang Yin, Chaoyou Fu, Wei Chen, Xiao Hu, Bin Wen, Kaiyu Jiang, Changyi Liu, Tianke Zhang, et al

  52. TTRL: Test-time reinforcement learning.arXiv preprint arXiv:2504.16084, 2025 19 Pith citing papers · 2025

    Anonymous

  53. Workarena: How capable are web agents at solving common knowledge work tasks? arXiv preprint arXiv:2403.07718, 19 Pith citing papers · 2024

    Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H Laradji, Manuel Del Verme, Tom Marty, L ´eo Boisvert, Megh Thakkar, Quentin Cappart, David Vazquez, et al

  54. Probing classifiers: Promises, shortcomings, and advances 18 Pith citing papers · 130 global citations · 2022

    Yonatan Belinkov

  55. Vl-rethinker: Incentivizing self-reflection of vision-language models with reinforcement learning.arXiv preprint arXiv:2504.08837, 2025 18 Pith citing papers · 2025

    Haozhe Wang, Chao Qu, Zuming Huang, Wei Chu, Fangzhen Lin, and Wenhu Chen

  56. Pengcheng He, Jianfeng Gao, and Weizhu Chen 18 Pith citing papers · 2021

    Pengcheng He, Jianfeng Gao, and Weizhu Chen

  57. Gptfuzzer: Red teaming large language models with auto-generated jailbreak prompts, 18 Pith citing papers · 2023

    J

  58. Mass- editing memory in a transformer.arXiv preprint arXiv:2210.07229, 2023 18 Pith citing papers · 2023

    Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, and David Bau

  59. MEM1: learning to synergize memory and reasoning for efficient long-horizon agents.CoRR, abs/2506.15841,2025 18 Pith citing papers · 2025

    Zijian Zhou, Ao Qu, Zhaoxuan Wu, Sunghwan Kim, Alok Prakash, Daniela Rus, Jinhua Zhao, Bryan Kian Hsiang Low, and Paul Pu Liang

  60. mimic-video: Video-action models for generalizable robot control beyond vlas.arXiv preprint 2512.15692, 2025 18 Pith citing papers · 2025

    Jonas Pai, Liam Achenbach, Victoriano Montesinos, Benedek Forrai, Oier Mees, and Elvis Nava

  61. Orca: Progressive learning from complex explanation traces of GPT-4.arXiv preprint arXiv:2306.02707, 18 Pith citing papers · 2023

    Subhabrata Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal, Hamid Palangi, and Ahmed Awadallah

  62. Pal: Program-aided language models 18 Pith citing papers · 2022

    Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, and Graham Neubig

  63. Scaling relationship on learning mathematical reasoning with large language models.arXiv preprint arXiv:2308.01825, 2023 18 Pith citing papers · 2023

    Zheng Yuan, Hongyi Yuan, Chengpeng Li, Guanting Dong, Keming Lu, Chuanqi Tan, Chang Zhou, and Jingren Zhou

  64. Seedance 1.5 pro: A native audio-visual joint generation foundation model.arXiv preprint arXiv:2512.13507, 2025 18 Pith citing papers · 2025

    Team Seedance, Heyi Chen, Siyan Chen, Xin Chen, Yanfei Chen, Ying Chen, Zhuo Chen, Feng Cheng, Tianheng Cheng, Xinqi Cheng, et al

  65. Zoedepth: Zero- shot transfer by combining relative and metric depth.arXiv preprint arXiv:2302.12288, 2023 18 Pith citing papers · 2023

    Shariq Farooq Bhat, Reiner Birkl, Diana Wofk, Peter Wonka, and Matthias Müller

  66. 3d diffusion policy: Generalizable visuomotor policy learning via simple 3d representations, 18 Pith citing papers · 2024

    Y

  67. Akutsuet al.(KAGRA), Overview of KAGRA: Detector design and construction history, PTEP2021, 05A101 (2021), arXiv:2005.05574 [physics.ins-det] 18 Pith citing papers · 2021

    T

  68. arXiv preprint arXiv:2410.10629 (2024) PromptEvolver 19 18 Pith citing papers · 2024

    Xie, E

  69. A Unified Approach to Interpreting Model Predic- tions, November 2017 18 Pith citing papers · 2017

    Scott Lundberg and Su-In Lee

  70. DeepONet: Learning nonlinear operators for iden- tifying differential equations based on the universal approximation theorem of operators, 18 Pith citing papers · 2021

    L

  71. Feder Cooper, Daphne Ippolito, Christopher A 18 Pith citing papers · 2023

    Milad Nasr, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A

  72. From llm reasoning to autonomous ai agents: A comprehensive review.arXiv preprint arXiv:2504.19678, 18 Pith citing papers · 2025

    Mohamed Amine Ferrag, Norbert Tihanyi, and Merouane Debbah

  73. Internvla-m1: A spatially guided vision- language-action framework for generalist robot policy, 18 Pith citing papers · 2025

    I

  74. Leworldmodel: Stable end-to-end joint-embedding predictive archi- tecture from pixels, 18 Pith citing papers · 2026

    L

  75. Progress measures for grokking via mechanistic interpretability, 18 Pith citing papers · 2023

    N

  76. Ro- bodreamer: Learning compositional world models for robot imagination.arXiv preprint arXiv:2404.12377, 2024 18 Pith citing papers · 2024

    Siyuan Zhou, Yilun Du, Jiaben Chen, Yandong Li, Dit-Yan Yeung, and Chuang Gan

  77. Tabpfn: A transformer that solves small tabu- lar classification problems in a second, 18 Pith citing papers · 2023

    N

  78. Universal language model fine-tuning for text classification 18 Pith citing papers · 2018

    Jeremy Howard and Sebastian Ruder

  79. Vicreg: Variance-invariance-covariance regularization for self- supervised learning.arXiv preprint arXiv:2105.04906, 2021 18 Pith citing papers · 2021

    Adrien Bardes, Jean Ponce, and Yann LeCun

  80. Abbottet al.(LIGO Scientific, Virgo), Tests of general relativity with binary black holes from the second LIGO- Virgo gravitational-wave transient catalog, Phys 18 Pith citing papers · 2021

    R

  81. Active learning for convolutional neural networks: A core-set approach.arXiv preprint arXiv:1708.00489, 2017 18 Pith citing papers · 2017

    Ozan Sener and Silvio Savarese

  82. Agent skills in the wild: An empirical study of security vulnerabilities at scale, 18 Pith citing papers · 2026

    Y

  83. Blink: Multimodal large language models can see but not perceive 18 Pith citing papers · 2024

    Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A Smith, Wei-Chiu Ma, and Ranjay Krishna

  84. Coca: Con- trastive captioners are image-text foundation models 18 Pith citing papers · 2022

    Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini, and Yonghui Wu

  85. Eagle: Speculative sampling requires rethinking feature uncertainty.arXiv preprint arXiv:2401.15077, 2024 18 Pith citing papers · 2024

    Yuhui Li, Fangyun Wei, Chao Zhang, and Hongyang Zhang

  86. ediff-i: Text-to-image diffusion models with ensemble of expert denoisers 18 Pith citing papers · 2022

    Balaji, Y

  87. Gen2act: Human video generation in novel scenarios enables generalizable robot manipulation.arXiv preprint arXiv:2409.16283, 2024 18 Pith citing papers · 2024

    Homanga Bharadhwaj, Debidatta Dwibedi, Abhinav Gupta, Shubham Tulsiani, Carl Doer- sch, Ted Xiao, Dhruv Shah, Fei Xia, Dorsa Sadigh, and Sean Kirmani

  88. Gsm-symbolic: Understanding the limitations of mathematical reasoning in large language models.arXiv preprint arXiv:2410.05229, 2024 18 Pith citing papers · 2024

    Iman Mirzadeh, Keivan Alizadeh, Hooman Shahrokhi, Oncel Tuzel, Samy Bengio, and Mehrdad Farajtabar

  89. GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants, 18 Pith citing papers · 2026

    A

  90. GWTC-4.0: Tests of General Relativity. I. Overview and General Tests, 18 Pith citing papers · 2026

    A

  91. Llama-adapter: Efficient fine-tuning of language models with zero-init at- tention 18 Pith citing papers · 2023

    Renrui Zhang, Jiaming Han, Aojun Zhou, Xi- angfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, and Yu Qiao

  92. Lrm: Large reconstruction model for single image to 3d, 18 Pith citing papers · 2023

    Y

  93. Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi, 2024 18 Pith citing papers · 2024

    URL https://api

  94. MRKL systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning.arXiv preprint arXiv:2205.00445, 2022 18 Pith citing papers · 2022

    Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, et al

  95. Neural operator: Graph kernel network for partial differential equations.arXiv preprint arXiv:2003.03485, 2020 18 Pith citing papers · 2003

    Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhat- tacharya, Andrew Stuart, and Anima Anandkumar

  96. Pathvqa: 30000+ questions for medical visual question answering 18 Pith citing papers · 2003

    He, X

  97. Risks from learned optimization in advanced machine learning systems.arXiv preprint arXiv:1906.01820, 2019 18 Pith citing papers · 1906

    Evan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, and Scott Garrabrant

  98. Scaling latent reasoning via looped language models.arXiv preprint arXiv:2510.25741, 2025 18 Pith citing papers · 2025

    Rui-Jie Zhu, Zixuan Wang, Kai Hua, Tianyu Zhang, Ziniu Li, Haoran Que, Boyi Wei, Zixin Wen, Fan Yin, He Xing, et al

  99. Simcse: Simple contrastive learning of sentence embeddings, 18 Pith citing papers · 2021

    T

  100. The lottery ticket hypothesis: Finding sparse, trainable neural networks, 18 Pith citing papers · 2018

    J

  101. Vip: Towards universal visual reward and representation via value-implicit pre-training.arXiv preprint arXiv:2210.00030, 2022 18 Pith citing papers · 2022

    Yecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Os- bert Bastani, Vikash Kumar, and Amy Zhang

  102. Write a recipe for chocolate cake 18 Pith citing papers · 2023

    Miles Turpin, Julian Michael, Ethan Perez, and Samuel R

  103. Auxiliary-loss-free load balancing strategy for mixture-of-experts.arXiv preprint arXiv:2408.15664, 2024 17 Pith citing papers · 2024

    Lean Wang, Huazuo Gao, Chenggang Zhao, Xu Sun, and Damai Dai

  104. Col- pali: Efficient document retrieval with vision language mod- els.arXiv preprint arXiv:2407.01449, 2024 17 Pith citing papers · 2024

    Manuel Faysse, Hugues Sibille, Tony Wu, Bilel Omrani, Gautier Viaud, C ´eline Hudelot, and Pierre Colombo

  105. Gui-r1: A generalist r1-style vision-language action model for gui agents.arXiv preprint arXiv:2504.10458, 2025 17 Pith citing papers · 2025

    Run Luo, Lu Wang, Wanwei He, Longze Chen, Jiaming Li, and Xiaobo Xia

  106. Molmoact: Action reasoning models that can reason in space.arXiv preprint arXiv:2508.07917, 2025 17 Pith citing papers · 2025

    Jason Lee, Jiafei Duan, Haoquan Fang, Yuquan Deng, Shuo Liu, Boyang Li, Bohan Fang, Jieyu Zhang, Yi Ru Wang, Sangho Lee, et al

  107. Robotic control via embodied chain-of-thought reasoning.arXiv preprint arXiv:2407.08693, 2024 17 Pith citing papers · 2024

    Michał Zawalski, William Chen, Karl Pertsch, Oier Mees, Chelsea Finn, and Sergey Levine

  108. Financebench: A new benchmark for financial question answering.arXiv preprint arXiv:2311.11944, 2023 17 Pith citing papers · 2023

    Pranab Islam, Anand Kannappan, Douwe Kiela, Rebecca Qian, Nino Scherrer, and Bertie Vidgen

  109. Gme: Improving universal multimodal retrieval by multimodal llms.arXiv preprint arXiv:2412.16855, 2024 17 Pith citing papers · 2024

    Xin Zhang, Yanzhao Zhang, Wen Xie, Mingxin Li, Ziqi Dai, Dingkun Long, Pengjun Xie, Meishan Zhang, Wenjie Li, and Min Zhang

  110. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 11170–11189, 2024 17 Pith citing papers · 2024

    Jiwoo Hong, Noah Lee, and James Thorne

  111. Latent visual reasoning.arXiv preprint arXiv:2509.24251, 2025 17 Pith citing papers · 2025

    Bangzheng Li, Ximeng Sun, Jiang Liu, Ze Wang, Jialian Wu, Xiaodong Yu, Hao Chen, Emad Barsoum, Muhao Chen, and Zicheng Liu

  112. Aaijet al.,LHCb detector performance, Int 17 Pith citing papers · 2015

    LHCb collaboration, R

  113. Egodex: Learning dexterous manipulation from large-scale egocentric video.arXiv preprint arXiv:2505.11709, 2025 17 Pith citing papers · 2025

    Ryan Hoque, Peide Huang, David J Yoon, Mouli Sivapurapu, and Jian Zhang

  114. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks 17 Pith citing papers · 2013

    Saxe, Andrew M

  115. From explicit cot to implicit cot: Learning to internalize cot step by step 17 Pith citing papers · 2024

    Yuntian Deng, Yejin Choi, and Stuart Shieber

  116. Generalizing verifiable instruction following.arXiv preprint arXiv:2507.02833, 17 Pith citing papers · 2025

    Valentina Pyatkin, Saumya Malik, Victoria Graf, Hamish Ivison, Shengyi Huang, Pradeep Dasigi, Nathan Lambert, and Hannaneh Hajishirzi

  117. GW231123: A Binary Black Hole Merger with Total Mass 190–265M ⊙, 17 Pith citing papers · 2025

    GW231123: a Binary Black Hole Merger with Total Mass 190-265M ⊙ (

  118. Internvid: A large-scale video-text dataset for multimodal understanding and generation 17 Pith citing papers · 2023

    Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, et al

  119. In the realm of the Hubble tension—a review of solutions, 17 Pith citing papers · 2021

    Eleonora Di Valentino, Olga Mena, Supriya Pan, Luca Visinelli, Weiqiang Yang, Alessandro Melchiorri, David F

  120. Matterport3d: Learning from rgb-d data in indoor environments, 17 Pith citing papers · 2017

    A

  121. Open- rlhf: An easy-to-use, scalable and high-performance rlhf framework.arXiv preprint arXiv:2405.11143, 17 Pith citing papers · 2024

    Jian Hu, Xibin Wu, Zilin Zhu, Xianyu, Weixun Wang, Dehao Zhang, and Yu Cao

  122. Pyramiddrop: Accelerating your large vision-language models via pyramid visual redundancy reduction.arXiv preprint arXiv:2410.17247, 2024 17 Pith citing papers · 2024

    Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, et al

  123. Qwen3-tts technical report.arXiv preprint arXiv:2601.15621, 2026 17 Pith citing papers · 2026

    Hangrui Hu, Xinfa Zhu, Ting He, Dake Guo, Bin Zhang, Xiong Wang, Zhifang Guo, Ziyue Jiang, Hongkun Hao, Zishan Guo, et al

  124. R1-onevision: Advancing gen- eralized multimodal reasoning through cross-modal formal- ization.arXiv preprint arXiv:2503.10615, 2025 17 Pith citing papers · 2025

    Yi Yang, Xiaoxuan He, Hongkun Pan, Xiyan Jiang, Yan Deng, Xingtao Yang, Haoyu Lu, Dacheng Yin, Fengyun Rao, Minfeng Zhu, et al

  125. Reasoning with exploration: An entropy perspective.arXiv preprint arXiv:2506.14758, 2025 17 Pith citing papers · 2025

    Daixuan Cheng, Shaohan Huang, Xuekai Zhu, Bo Dai, Wayne Xin Zhao, Zhenliang Zhang, and Furu Wei

  126. Simplevla-rl: Scaling vla training via reinforcement learning.arXiv preprint arXiv:2509.09674, 2025 17 Pith citing papers · 2025

    Li, H

  127. Spinquant: Llm quantization with learned rotations.arXiv preprint arXiv:2405.16406, 2024 17 Pith citing papers · 2024

    Zechun Liu, Changsheng Zhao, Igor Fedorov, Bilge Soran, Dhruv Choudhary, Raghuraman Krishnamoorthi, Vikas Chandra, Yuandong Tian, and Tijmen Blankevoort

  128. Videocrafter1: Open diffusion models for high-quality video generation, 17 Pith citing papers · 2023

    H

  129. Aligning large multimodal models with factually augmented RLHF 17 Pith citing papers · 2023

    Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Y an Gui, Y u-Xiong Wang, Yiming Y ang, Kurt Keutzer, and Trevor Dar- rell

  130. Aokiet al.(Flavour Lattice Averaging Group (FLAG)), FLAG review 2024, Phys 17 Pith citing papers · 2024

    Y

  131. A survey on large language model based autonomous agents 17 Pith citing papers · 2023

    [Wang et al

  132. Cogvlm: Visual expert for pretrained language models 17 Pith citing papers · 2023

    Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, et al

  133. Constraints on primordial black holes, 17 Pith citing papers · 2021

    B

  134. Cosyvoice: A scalable multi- lingual zero-shot text-to-speech synthesizer based on supervised semantic tokens 17 Pith citing papers · 2024

    Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, and Zhijie Yan

  135. Deepeyesv2: Toward agentic multimodal model 17 Pith citing papers · 2025

    Hong, J

  136. Dexvla: Vision-language model with plug-in diffusion expert for general robot control.arXiv preprint arXiv:2502.05855, 2025 17 Pith citing papers · 2025

    Junjie Wen, Yichen Zhu, Jinming Li, Zhibin Tang, Chaomin Shen, and Feifei Feng

  137. Diffusion policies as an expressive policy class for offline reinforcement learning 17 Pith citing papers · 2022

    Zhendong Wang, Jonathan J Hunt, and Mingyuan Zhou

  138. Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps.Advances in neural information processing systems, 35:5775–5787, 2022a 17 Pith citing papers · 2023

    C

  139. Efficientnet: Rethinking model scaling for convolutional neural networks, 17 Pith citing papers · 1905

    M

  140. Extended dark energy analy- sis using DESI DR2 BAO measurements, 17 Pith citing papers · 2025

    K

  141. Finbert: Financial sentiment analysis with pre-trained language models 17 Pith citing papers · 1908

    Dogu Araci

  142. Gaia-2: A controllable multi-view generative world model for autonomous driving, 17 Pith citing papers · 2025

    L

  143. Gemini embedding: Generalizable embeddings from gemini.arXiv:2503.07891, 2025 17 Pith citing papers · 2025

    Jinhyuk Lee, Feiyang Chen, Sahil Dua, Daniel Cer, Madhuri Shanbhogue, Iftekhar Naim, Gustavo Hernández Ábrego, Zhe Li, Kaifeng Chen, Henrique Schechter Vera, Xiaoqi Ren, Shanfeng Zhang, Daniel Salz, Michael Boratko, Jay Han, Blair Chen, Shu

  144. GetDist: a Python package for analysing Monte Carlo samples, 17 Pith citing papers · 2019

    Antony Lewis, “GetDist: a Python package for analysing Monte Carlo samples,” (

  145. Griffin: Mixing gated linear recurrences with local attention for efficient language models, 17 Pith citing papers · 2024

    S

  146. GWTC-4.0: Tests of General Relativ- 10 ity. II. Parameterized Tests, 17 Pith citing papers · 2026

    A

  147. Helios: A 98-qubit trapped-ion quantum computer.arXiv preprint arXiv:2511.05465, 2025 17 Pith citing papers · 2025

    Anthony Ransford, MS Allman, Jake Arkinstall, JP Campora III, Samuel F Cooper, Robert D Delaney, Joan M Dreiling, Brian Estey, Caroline Figgatt, Alex Hall, et al

  148. Human motion dif- fusion model.arXiv preprint arXiv:2209.14916, 2022 17 Pith citing papers · 2022

    Guy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Daniel Cohen-Or, and Amit H Bermano

  149. Inference-time intervention: Eliciting truthful answers from a language model 17 Pith citing papers · 2023

    Kenneth Li, Oam Patel, Fernanda Viégas, Hanspeter Pfister, and Martin Wattenberg

  150. Justice or prejudice? quantifying biases in llm-as-a-judge 17 Pith citing papers · 2024

    Jiayi Ye, Yanbo Wang, Yue Huang, Dongping Chen, Qihui Zhang, Nuno Moniz, Tian Gao, Werner Geyer, Chao Huang, Pin-Yu Chen, Nitesh V Chawla, and Xiangliang Zhang

  151. Livebench: A challenging, contamination-free llm benchmark, 17 Pith citing papers · 2024

    C

  152. Longrope: Extending llm context window beyond 2 million tokens 17 Pith citing papers · 2024

    Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, and Mao Yang

  153. Memagent: Re- shaping long-context llm with multi-conv rl-based mem- ory agent.arXiv preprint arXiv:2507.02259, 17 Pith citing papers · 2025

    Yu, H

  154. Mmada: Multimodal large diffusion language models.arXiv preprint arXiv:2505.15809, 17 Pith citing papers · 2025

    L

  155. Mteb: Massive text embedding benchmark 17 Pith citing papers · 2023

    Niklas Muennighoff, Nouamane Tazi, Loïc Magne, and Nils Reimers

  156. Primordial Black Holes as a dark matter candidate 17 Pith citing papers · 2021

    A

  157. Prompt infection: Llm-to-llm prompt injection within multi-agent systems, 17 Pith citing papers · 2024

    D

  158. REALM: retrieval-augmented language model pre-training 17 Pith citing papers · 2002

    Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, and Ming - Wei Chang

  159. Selfcheckgpt: Zero-resource black-box hallucination detection for generative large language models 17 Pith citing papers · 2023

    Potsawee Manakul, Adian Liusie, and Mark John Francis Gales

  160. Self-play fine- tuning converts weak language models to strong language models.arXiv preprint arXiv:2401.01335, 17 Pith citing papers · 2024

    Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, and Quanquan Gu

  161. Shor’s algorithm is possible with as few as 10,000 reconfigurable atomic qubits, 17 Pith citing papers · 2026

    Madelyn Cain, Qian Xu, Robbie King, Lewis R

  162. Soap: Improving and stabilizing shampoo using adam.arXiv preprint arXiv:2409.11321, 2024 17 Pith citing papers · 2024

    N

  163. Solving rubik’s cube with a robot hand, 17 Pith citing papers · 1910

    I

  164. Tests of General Relativity with the Binary Black Hole Signals from the LIGO-Virgo Catalog GWTC-1, 17 Pith citing papers · 2019

    B

  165. The Prompt Report: A Systematic Survey of Prompt Engineering Techniques 17 Pith citing papers · 2024

    Schulhoff S, Ilie M, Balepur N, Kahadze K, Liu A, Si C, et al

  166. Titans: Learning to 9 Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference memorize at test time.arXiv preprint arXiv:2501.00663, 17 Pith citing papers · 2024

    Behrouz, A

  167. Ultralight scalars as cosmological dark matter, 17 Pith citing papers · 2017

    L

  168. Modified Gravity Theories on a Nutshell: Inflation, Bounce and Late-time Evolution, 16 Pith citing papers · 2559 global citations · 2017

    S

  169. Visrag: Vision-based retrieval-augmented generation on multi-modality documents.arXiv preprint arXiv:2410.10594, 2024 16 Pith citing papers · 2024

    Shi Yu, Chaoyue Tang, Bokai Xu, Junbo Cui, Junhao Ran, Yukun Yan, Zhenghao Liu, Shuo Wang, Xu Han, Zhiyuan Liu, et al

  170. Fast-dllm: Training-free acceleration of diffusion LLM by enabling KV cache and parallel decoding.CoRR, abs/2505.22618, 2025 16 Pith citing papers · 2025

    Chengyue Wu, Hao Zhang, Shuchen Xue, Zhijian Liu, Shizhe Diao, Ligeng Zhu, Ping Luo, Song Han, and Enze Xie

  171. Fp8 formats for deep learning.arXiv preprint arXiv:2209.05433, 2022 16 Pith citing papers · 2022

    Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, et al

  172. RepoBench : Benchmarking repository-level code auto-completion systems 16 Pith citing papers · 2023

    Tianyang Liu, Canwen Xu, and Julian McAuley

  173. Spacer: Rein- forcing mllms in video spatial reasoning.arXiv preprint arXiv:2504.01805, 2025 16 Pith citing papers · 2025

    Kun Ouyang, Yuanxin Liu, Haoning Wu, Yi Liu, Hao Zhou, Jie Zhou, Fandong Meng, and Xu Sun

  174. Spurious rewards: Rethinking training signals in rlvr.arXiv preprint arXiv:2506.10947, 16 Pith citing papers · 2025

    Rulin Shao, Shuyue Stella Li, Rui Xin, Scott Geng, Yiping Wang, Sewoong Oh, Simon Shaolei Du, Nathan Lambert, Sewon Min, Ranjay Krishna, et al

  175. An empirical investigation of catastrophic forgetting in gradient-based neural networks.arXiv preprint arXiv:1312.6211, 16 Pith citing papers · 2013

    22 I

  176. Bayesian active learning for classification and preferenc e learning, 16 Pith citing papers · 2011

    N

  177. Clinicalbert: Modeling clinical notes and predicting hospital readmission 16 Pith citing papers · 2019

    Kexin Huang, Jaan Altosaar, and Rajesh Ranganath

  178. Cosmos-reason1: From physical common sense to embodied reasoning.arXiv preprint arXiv:2503.15558, 2025 16 Pith citing papers · 2025

    NVIDIA

  179. Deepmath-103k: A large-scale, challenging, de- contaminated, and verifiable mathematical dataset for advancing reasoning.arXiv preprint arXiv:2504.11456, 2025 16 Pith citing papers · 2025

    Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, et al

  180. Finite scalar quantization: Vq-vae made simple.arXiv preprint arXiv:2309.15505, 2023 16 Pith citing papers · 2024

    Fabian Mentzer, David Minnen, Eirikur Agustsson, and Michael Tschannen

  181. Genie envisioner: A unified world foundation platform for robotic manipulation.arXiv preprint arXiv:2508.05635, 2025 16 Pith citing papers · 2025

    Yue Liao, Pengfei Zhou, Siyuan Huang, Donglin Yang, Shengcong Chen, Yuxin Jiang, Yue Hu, Jingbin Cai, Si Liu, Jianlan Luo, Liliang Chen, Shuicheng Yan, Maoqing Yao, and Guanghui Ren

  182. Ghost in the minecraft: Generally capable agents for open-world enviroments via large language mod- els with text-based knowledge and memory 16 Pith citing papers · 2023

    Xizhou Zhu, Yuntao Chen, Hao Tian, Chenxin Tao, Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, et al

  183. Glm-4-voice: Towards intelligent and human-like end- to-end spoken chatbot, 16 Pith citing papers · 2024

    A

  184. Magpie: Alignment data synthesis from scratch by prompting aligned llms with nothing.arXiv preprint arXiv:2406.08464, 16 Pith citing papers · 2024

    Zhangchen Xu, Fengqing Jiang, Luyao Niu, Yuntian Deng, Radha Poovendran, Yejin Choi, and Bill Yuchen Lin

  185. Mme-realworld: Could your multimodal llm challenge high-resolution real-world scenarios that are difficult for humans?arXiv preprint arXiv:2408.13257, 2024 16 Pith citing papers · 2024

    Yi-Fan Zhang, Huanyu Zhang, Haochen Tian, Chaoyou Fu, Shuangqing Zhang, Junfei Wu, Feng Li, Kun Wang, Qingsong Wen, Zhang Zhang, et al

  186. Monst3r: A simple approach for estimat- ing geometry in the presence of motion.arXiv preprint arXiv:2410.03825, 2024 16 Pith citing papers · 2024

    Junyi Zhang, Charles Herrmann, Junhwa Hur, Varun Jam- pani, Trevor Darrell, Forrester Cole, Deqing Sun, and Ming- 10 Hsuan Yang

  187. nuPlan: A closed-loop ML- based planning benchmark for autonomous vehicles, 16 Pith citing papers · 2021

    H

  188. Pllava: Parameter-free llava extension from images to videos for video dense captioning, 16 Pith citing papers · 2024

    Lin Xu, Yilin Zhao, Daquan Zhou, Zhijie Lin, See Kiong Ng, and Jiashi Feng, “Pllava: Parameter-free llava extension from images to videos for video dense captioning,”arXiv preprint arXiv:2404

  189. Real-time execution of action chunking flow policies.arXiv preprint arXiv:2506.07339, 2025 16 Pith citing papers · 2025

    Kevin Black, Manuel Y Galliker, and Sergey Levine

  190. Securing AI agents with information-flow control 16 Pith citing papers · 2026

    Manuel Costa, Ahmed Salem, Aashish Kolluri, Boris Kopf, Shruti Tople, Andrew Paverd, Lukas Wutschitz, Mark Russinovich, and Santiago Zanella-Beguelin

  191. Seed-tts: A family of high-quality versatile speech generation models.arXiv preprint arXiv:2406.02430, 2024 16 Pith citing papers · 2024

    Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, et al

  192. Skill-inject: Measuring agent vulnerability to skill file attacks, 16 Pith citing papers · 2026

    D

  193. Video generators are robot policies.arXiv preprint arXiv:2508.00795, 2025 16 Pith citing papers · 2025

    Junbang Liang, Pavel Tokmakov, Ruoshi Liu, Sruthi Sudhakar, Paarth Shah, Rares Ambrus, and Carl V ondrick

  194. WASP: Benchmarking web agent security against prompt injection attacks 16 Pith citing papers · 2025

    Chaofan Li et al

  195. Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task 16 Pith citing papers · 2025

    Nataliya Kosmyna, Eugene Hauptmann, Ye Tong Yuan, Jessica Situ, Xian-Hao Liao, Ashly Vivian Beresnitzky, Iris Braunstein, and Pattie Maes

  196. Advancing open-source world models, 16 Pith citing papers · 2026

    R

  197. An empirical model of large-batch training, 2018, arXiv:1812.06162 http://arxiv.org/abs/arXiv:1812.06162 16 Pith citing papers · 2018

    Sam McCandlish, Jared Kaplan, Dario Amodei, and OpenAI Dota Team

  198. Black Hole Spectroscopy and Tests of General Relativity with GW250114, 16 Pith citing papers · 2026

    A

  199. Capabil- ities of GPT-4 on Medical Challenge Problems 16 Pith citing papers · 2023

    Nori, Harsha; King, Nicholas; McKinney, Scott Mayer; Carignan, Dean; Horvitz, Eric

  200. Classification with quantum neural networks on near term processors, 16 Pith citing papers · 2018

    Edward Farhi and Hartmut Neven