foundational unreviewed works

The most-cited unreviewed works inside Pith’s own reviewed-paper corpus. This queue grows naturally as new papers are reviewed and their references are resolved.

Instruction tuning with GPT-4.arXiv preprint arXiv:2304.03277, 2023 21 Pith citing papers · 2023
Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley, and Jianfeng Gao
mplug-owl: Modularization empowers large lan- guage models with multimodality 21 Pith citing papers · 2023
Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, et al
Eureka: Human- level reward design via coding large language models, 20 Pith citing papers · 2023
Y
Transfer between modalities with metaqueries.arXiv preprint arXiv:2504.06256, 2025 20 Pith citing papers · 2025
Xichen Pan, Satya Narayan Shukla, Aashu Singh, Zhuokai Zhao, Shlok Kumar Mishra, Jialiang Wang, Zhiyang Xu, Jiuhai Chen, Kunpeng Li, Felix Juefei-Xu, et al
Aligning text-to-image models using human feedback, 20 Pith citing papers · 2023
K
Cobaya: Code for Bayesian Analysis of hierarchical physical models, 20 Pith citing papers · 2021
Jesus Torrado and Antony Lewis, “Cobaya: Code for Bayesian Analysis of hierarchical physical models,” JCAP 05, 057 (
Deep retinex decomposition for low-light enhancement 20 Pith citing papers · 2018
Chen Wei, Wenjing Wang, Wenhan Yang, and Jiaying Liu
MMDetection: Open mmlab detection toolbox and benchmark, 20 Pith citing papers · 1906
K
One billion word benchmark for measuring progress in statistical language modeling 20 Pith citing papers · 2014
Chelba, C
Td-mpc2: Scalable, robust world models for continuous control 20 Pith citing papers · 2023
Nicklas Hansen, Hao Su, and Xiaolong Wang
Vbench-2.0: Advancing video generation benchmark suite for intrinsic faithfulness 20 Pith citing papers · 2025
Dian Zheng, Ziqi Huang, Hongbo Liu, Kai Zou, Yinan He, Fan Zhang, Lulu Gu, Yuanhan Zhang, Jingwen He, Wei-Shi Zheng, et al
Aligning large multi-modal model with robust instruction tuning 20 Pith citing papers · 2023
Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, and Lijuan Wang
arXiv preprint arXiv:2309.05463 (2023) ExploreVLA 17 20 Pith citing papers · 2023
Li, Y
Averaging weights leads to wider optima and better generalization.arXiv preprint arXiv:1803.05407, 2018 20 Pith citing papers · 2018
Pavel Izmailov, Dmitrii Podoprikhin, Timur Garipov, Dmitry Vetrov, and Andrew Gordon Wilson
Fast and accurate deep network learning by exponential linear units (elus).arXiv preprint arXiv:1511.07289 , 20 Pith citing papers · 2015
Djork-Arné Clevert, Thomas Unterthiner, and Sepp Hochreiter
Glm-130b: An open bilingual pre-trained model 20 Pith citing papers · 2022
Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, et al
Improving alignment of dialogue agents via targeted human judgements 20 Pith citing papers · 2022
Amelia Glaese, Nat McAleese, Maja Tr˛ ebacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, et al
Llm+ p: Em- powering large language models with optimal planning proficiency 20 Pith citing papers · 2023
[Liu et al
Mle-bench: Evaluating machine learning agents on machine learning engineering 20 Pith citing papers · 2024
Jun Shern Chan, Neil Chowdhury, Oliver Jaffe, James Aung, Dane Sherburn, Evan Mays, Giulio Starace, Kevin Liu, Leon Maksin, Tejal Patwardhan, and 1 others
Tests of general relativity with GW150914, 20 Pith citing papers · 2016
B
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model, 2022 20 Pith citing papers · 2022
Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song,
Video- llava: Learning united visual representation by alignment before projection 20 Pith citing papers · 2023
[Lin et al
Di Valentinoet al.(CosmoVerse Network), The Cos- moVerseWhitePaper: Addressingobservationaltensions in cosmology with systematics and fundamental physics, Phys 19 Pith citing papers · 2025
E
Llada2.0: Scaling up diffusion language models to 100b, 2025 19 Pith citing papers · 2025
Bie, T
Luoet al.(TianQin), TianQin: a space-borne gravitational wave detector, Classical Quantum Gravity33, 035010 (2016), arXiv:1512.02076 [astro-ph.IM] 19 Pith citing papers · 2016
J
SoK: Agentic skills–beyond tool use in LLM agents.arXiv preprint arXiv:2602.20867, 2026 19 Pith citing papers · 2026
Yanna Jiang, Delong Li, Haiyu Deng, Baihe Ma, Xu Wang, et al
Training agents inside of scalable world models, 19 Pith citing papers · 2025
D
Agentspec: Customizable runtime enforcement for safe and reliable llm agents 19 Pith citing papers · 2025
Haoyu Wang, Christopher M Poskitt, and Jun Sun
Bigcodebench: Bench- marking code generation with diverse function calls and complex instructions.arXiv preprint arXiv:2406.15877, 19 Pith citing papers · 2024
89 Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, et al
Codebleu: a method for automatic evaluation of code synthesis 19 Pith citing papers · 2020
Shuo Ren, Daya Guo, Shuai Lu, Long Zhou, Shujie Liu, Duyu Tang, Neel Sundaresan, Ming Zhou, Ambrosio Blanco, and Shuai Ma
Enhancing chat language models by scaling high-quality instructional conversations 19 Pith citing papers · 2023
Ding, N
Measuring short-form factuality in large language models 19 Pith citing papers · 2024
Jason Wei, Nguyen Karina, Hyung Won Chung, Yunxin Joy Jiao, Spencer Papay, Amelia Glaese, John Schulman, and William Fedus
Project aria: A new tool for egocentric multi-modal ai research.arXiv preprint arXiv:2308.13561, 2023 19 Pith citing papers · 2023
Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Mered- ith, et al
Reasoning models don’t always say what they think.arXiv preprint arXiv:2505.05410, 2025 19 Pith citing papers · 2025
Yanda Chen, Joe Benton, Ansh Radhakrishnan, Jonathan Uesato, Carson Denison, John Schul- man, Arushi Somani, Peter Hase, Misha Wagner, Fabien Roger, Vlad Mikulik, Samuel R Bowman, Jan Leike, Jared Kaplan, and Ethan Perez
R-zero: Self-evolving reasoning llm from zero data.arXiv preprint arXiv:2508.05004, 19 Pith citing papers · 2025
Chengsong Huang, Wenhao Yu, Xiaoyang Wang, Hongming Zhang, Zongxia Li, Ruosen Li, Jiaxin Huang, Haitao Mi, and Dong Yu
Tabpfn-2.5: Advancing the state of the art in tabular foundation models, 2025 19 Pith citing papers · 2025
Léo Grinsztajn, Klemens Flöge, Oscar Key, Felix Birkel, Philipp Jund, Brendan Roof, Benjamin Jäger, Dominik Safaric, Simone Alessi, Adrian Hayler, Mihir Manium, Rosen Yu, Felix Jablon- ski, Shi Bin Hoo, Anurag Garg, Jake Robertson, Magnus B
Universal manipula- tion interface: In-the-wild robot teaching without in-the- wild robots, 19 Pith citing papers · 2024
C
Accelerating uni- verses with scaling dark matter, 19 Pith citing papers · 2001
Michel Chevallier and David Polarski, “Accelerating uni- verses with scaling dark matter,” Int
Audio flamingo 3: Advancing audio intelligence with fully open large audio language models, 19 Pith citing papers · 2025
A
Defending against indirect prompt injection attacks with spotlighting 19 Pith citing papers · 2024
Keegan Hines et al
DYNESTY: a dynamic nested sampling package for estimating Bayesian posteriors and evidences, 19 Pith citing papers · 2020
J
Gated linear attention transformers with hardware-efficient training.arXiv preprint arXiv:2312.06635, 2023 19 Pith citing papers · 2023
Songlin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, and Yoon Kim
How much knowledge can you pack into the parameters of a language model? arXiv preprint arXiv:2002.08910, 2020 19 Pith citing papers · 2002
Roberts, A
Jailbreak attacks and defenses against large language models: A survey.arXiv preprint arXiv:2407.04295, 2024 19 Pith citing papers · 2024
Sibo Yi, Yule Liu, Zhen Sun, Tianshuo Cong, Xinlei He, Jiaxing Song, Ke Xu, and Qi Li
Jailbroken: How does llm safety training fail?, 19 Pith citing papers · 2023
A
Llada 1.5: Variance-reduced preference optimization for large language diffusion models.arXiv preprint arXiv:2505.19223, 2025 19 Pith citing papers · 2025
F
Mlvu: A comprehensive benchmark for multi-task long video understanding, 19 Pith citing papers · 2024
J
Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation.arXiv preprint arXiv:2401.02117, 19 Pith citing papers · 2024
Zipeng Fu, Tony Z Zhao, and Chelsea Finn
Nv-embed: Improved techniques for training llms as generalist embedding models.arXiv preprint arXiv:2405.17428, 19 Pith citing papers · 2024
Chankyu Lee, Rajarshi Roy, Mengyao Xu, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, and Wei Ping
Open problems and fundamental limitations of reinforcement learning from human feedback 19 Pith citing papers · 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, J ´er´emy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, et al
Thyme: Think beyond images.arXiv preprint arXiv:2508.11630, 2025 19 Pith citing papers · 2025
Yi-Fan Zhang, Xingyu Lu, Shukang Yin, Chaoyou Fu, Wei Chen, Xiao Hu, Bin Wen, Kaiyu Jiang, Changyi Liu, Tianke Zhang, et al
TTRL: Test-time reinforcement learning.arXiv preprint arXiv:2504.16084, 2025 19 Pith citing papers · 2025
Anonymous
Workarena: How capable are web agents at solving common knowledge work tasks? arXiv preprint arXiv:2403.07718, 19 Pith citing papers · 2024
Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H Laradji, Manuel Del Verme, Tom Marty, L ´eo Boisvert, Megh Thakkar, Quentin Cappart, David Vazquez, et al
Probing classifiers: Promises, shortcomings, and advances 18 Pith citing papers · 130 global citations · 2022
Yonatan Belinkov
Vl-rethinker: Incentivizing self-reflection of vision-language models with reinforcement learning.arXiv preprint arXiv:2504.08837, 2025 18 Pith citing papers · 2025
Haozhe Wang, Chao Qu, Zuming Huang, Wei Chu, Fangzhen Lin, and Wenhu Chen
Pengcheng He, Jianfeng Gao, and Weizhu Chen 18 Pith citing papers · 2021
Pengcheng He, Jianfeng Gao, and Weizhu Chen
Gptfuzzer: Red teaming large language models with auto-generated jailbreak prompts, 18 Pith citing papers · 2023
J
Mass- editing memory in a transformer.arXiv preprint arXiv:2210.07229, 2023 18 Pith citing papers · 2023
Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, and David Bau
MEM1: learning to synergize memory and reasoning for efficient long-horizon agents.CoRR, abs/2506.15841,2025 18 Pith citing papers · 2025
Zijian Zhou, Ao Qu, Zhaoxuan Wu, Sunghwan Kim, Alok Prakash, Daniela Rus, Jinhua Zhao, Bryan Kian Hsiang Low, and Paul Pu Liang
mimic-video: Video-action models for generalizable robot control beyond vlas.arXiv preprint 2512.15692, 2025 18 Pith citing papers · 2025
Jonas Pai, Liam Achenbach, Victoriano Montesinos, Benedek Forrai, Oier Mees, and Elvis Nava
Orca: Progressive learning from complex explanation traces of GPT-4.arXiv preprint arXiv:2306.02707, 18 Pith citing papers · 2023
Subhabrata Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal, Hamid Palangi, and Ahmed Awadallah
Pal: Program-aided language models 18 Pith citing papers · 2022
Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, and Graham Neubig
Scaling relationship on learning mathematical reasoning with large language models.arXiv preprint arXiv:2308.01825, 2023 18 Pith citing papers · 2023
Zheng Yuan, Hongyi Yuan, Chengpeng Li, Guanting Dong, Keming Lu, Chuanqi Tan, Chang Zhou, and Jingren Zhou
Seedance 1.5 pro: A native audio-visual joint generation foundation model.arXiv preprint arXiv:2512.13507, 2025 18 Pith citing papers · 2025
Team Seedance, Heyi Chen, Siyan Chen, Xin Chen, Yanfei Chen, Ying Chen, Zhuo Chen, Feng Cheng, Tianheng Cheng, Xinqi Cheng, et al
Zoedepth: Zero- shot transfer by combining relative and metric depth.arXiv preprint arXiv:2302.12288, 2023 18 Pith citing papers · 2023
Shariq Farooq Bhat, Reiner Birkl, Diana Wofk, Peter Wonka, and Matthias Müller
3d diffusion policy: Generalizable visuomotor policy learning via simple 3d representations, 18 Pith citing papers · 2024
Y
Akutsuet al.(KAGRA), Overview of KAGRA: Detector design and construction history, PTEP2021, 05A101 (2021), arXiv:2005.05574 [physics.ins-det] 18 Pith citing papers · 2021
T
arXiv preprint arXiv:2410.10629 (2024) PromptEvolver 19 18 Pith citing papers · 2024
Xie, E
A Unified Approach to Interpreting Model Predic- tions, November 2017 18 Pith citing papers · 2017
Scott Lundberg and Su-In Lee
DeepONet: Learning nonlinear operators for iden- tifying differential equations based on the universal approximation theorem of operators, 18 Pith citing papers · 2021
L
Feder Cooper, Daphne Ippolito, Christopher A 18 Pith citing papers · 2023
Milad Nasr, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A
From llm reasoning to autonomous ai agents: A comprehensive review.arXiv preprint arXiv:2504.19678, 18 Pith citing papers · 2025
Mohamed Amine Ferrag, Norbert Tihanyi, and Merouane Debbah
Internvla-m1: A spatially guided vision- language-action framework for generalist robot policy, 18 Pith citing papers · 2025
I
Leworldmodel: Stable end-to-end joint-embedding predictive archi- tecture from pixels, 18 Pith citing papers · 2026
L
Progress measures for grokking via mechanistic interpretability, 18 Pith citing papers · 2023
N
Ro- bodreamer: Learning compositional world models for robot imagination.arXiv preprint arXiv:2404.12377, 2024 18 Pith citing papers · 2024
Siyuan Zhou, Yilun Du, Jiaben Chen, Yandong Li, Dit-Yan Yeung, and Chuang Gan
Tabpfn: A transformer that solves small tabu- lar classification problems in a second, 18 Pith citing papers · 2023
N
Universal language model fine-tuning for text classification 18 Pith citing papers · 2018
Jeremy Howard and Sebastian Ruder
Vicreg: Variance-invariance-covariance regularization for self- supervised learning.arXiv preprint arXiv:2105.04906, 2021 18 Pith citing papers · 2021
Adrien Bardes, Jean Ponce, and Yann LeCun
Abbottet al.(LIGO Scientific, Virgo), Tests of general relativity with binary black holes from the second LIGO- Virgo gravitational-wave transient catalog, Phys 18 Pith citing papers · 2021
R
Active learning for convolutional neural networks: A core-set approach.arXiv preprint arXiv:1708.00489, 2017 18 Pith citing papers · 2017
Ozan Sener and Silvio Savarese
Agent skills in the wild: An empirical study of security vulnerabilities at scale, 18 Pith citing papers · 2026
Y
Blink: Multimodal large language models can see but not perceive 18 Pith citing papers · 2024
Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A Smith, Wei-Chiu Ma, and Ranjay Krishna
Coca: Con- trastive captioners are image-text foundation models 18 Pith citing papers · 2022
Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini, and Yonghui Wu
Eagle: Speculative sampling requires rethinking feature uncertainty.arXiv preprint arXiv:2401.15077, 2024 18 Pith citing papers · 2024
Yuhui Li, Fangyun Wei, Chao Zhang, and Hongyang Zhang
ediff-i: Text-to-image diffusion models with ensemble of expert denoisers 18 Pith citing papers · 2022
Balaji, Y
Gen2act: Human video generation in novel scenarios enables generalizable robot manipulation.arXiv preprint arXiv:2409.16283, 2024 18 Pith citing papers · 2024
Homanga Bharadhwaj, Debidatta Dwibedi, Abhinav Gupta, Shubham Tulsiani, Carl Doer- sch, Ted Xiao, Dhruv Shah, Fei Xia, Dorsa Sadigh, and Sean Kirmani
Gsm-symbolic: Understanding the limitations of mathematical reasoning in large language models.arXiv preprint arXiv:2410.05229, 2024 18 Pith citing papers · 2024
Iman Mirzadeh, Keivan Alizadeh, Hooman Shahrokhi, Oncel Tuzel, Samy Bengio, and Mehrdad Farajtabar
GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants, 18 Pith citing papers · 2026
A
GWTC-4.0: Tests of General Relativity. I. Overview and General Tests, 18 Pith citing papers · 2026
A
Llama-adapter: Efficient fine-tuning of language models with zero-init at- tention 18 Pith citing papers · 2023
Renrui Zhang, Jiaming Han, Aojun Zhou, Xi- angfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, and Yu Qiao
Lrm: Large reconstruction model for single image to 3d, 18 Pith citing papers · 2023
Y
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi, 2024 18 Pith citing papers · 2024
URL https://api
MRKL systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning.arXiv preprint arXiv:2205.00445, 2022 18 Pith citing papers · 2022
Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, et al
Neural operator: Graph kernel network for partial differential equations.arXiv preprint arXiv:2003.03485, 2020 18 Pith citing papers · 2003
Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhat- tacharya, Andrew Stuart, and Anima Anandkumar
Pathvqa: 30000+ questions for medical visual question answering 18 Pith citing papers · 2003
He, X
Risks from learned optimization in advanced machine learning systems.arXiv preprint arXiv:1906.01820, 2019 18 Pith citing papers · 1906
Evan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, and Scott Garrabrant
Scaling latent reasoning via looped language models.arXiv preprint arXiv:2510.25741, 2025 18 Pith citing papers · 2025
Rui-Jie Zhu, Zixuan Wang, Kai Hua, Tianyu Zhang, Ziniu Li, Haoran Que, Boyi Wei, Zixin Wen, Fan Yin, He Xing, et al
Simcse: Simple contrastive learning of sentence embeddings, 18 Pith citing papers · 2021
T
The lottery ticket hypothesis: Finding sparse, trainable neural networks, 18 Pith citing papers · 2018
J
Vip: Towards universal visual reward and representation via value-implicit pre-training.arXiv preprint arXiv:2210.00030, 2022 18 Pith citing papers · 2022
Yecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Os- bert Bastani, Vikash Kumar, and Amy Zhang
Write a recipe for chocolate cake 18 Pith citing papers · 2023
Miles Turpin, Julian Michael, Ethan Perez, and Samuel R
Auxiliary-loss-free load balancing strategy for mixture-of-experts.arXiv preprint arXiv:2408.15664, 2024 17 Pith citing papers · 2024
Lean Wang, Huazuo Gao, Chenggang Zhao, Xu Sun, and Damai Dai
Col- pali: Efficient document retrieval with vision language mod- els.arXiv preprint arXiv:2407.01449, 2024 17 Pith citing papers · 2024
Manuel Faysse, Hugues Sibille, Tony Wu, Bilel Omrani, Gautier Viaud, C ´eline Hudelot, and Pierre Colombo
Gui-r1: A generalist r1-style vision-language action model for gui agents.arXiv preprint arXiv:2504.10458, 2025 17 Pith citing papers · 2025
Run Luo, Lu Wang, Wanwei He, Longze Chen, Jiaming Li, and Xiaobo Xia
Molmoact: Action reasoning models that can reason in space.arXiv preprint arXiv:2508.07917, 2025 17 Pith citing papers · 2025
Jason Lee, Jiafei Duan, Haoquan Fang, Yuquan Deng, Shuo Liu, Boyang Li, Bohan Fang, Jieyu Zhang, Yi Ru Wang, Sangho Lee, et al
Robotic control via embodied chain-of-thought reasoning.arXiv preprint arXiv:2407.08693, 2024 17 Pith citing papers · 2024
Michał Zawalski, William Chen, Karl Pertsch, Oier Mees, Chelsea Finn, and Sergey Levine
Financebench: A new benchmark for financial question answering.arXiv preprint arXiv:2311.11944, 2023 17 Pith citing papers · 2023
Pranab Islam, Anand Kannappan, Douwe Kiela, Rebecca Qian, Nino Scherrer, and Bertie Vidgen
Gme: Improving universal multimodal retrieval by multimodal llms.arXiv preprint arXiv:2412.16855, 2024 17 Pith citing papers · 2024
Xin Zhang, Yanzhao Zhang, Wen Xie, Mingxin Li, Ziqi Dai, Dingkun Long, Pengjun Xie, Meishan Zhang, Wenjie Li, and Min Zhang
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 11170–11189, 2024 17 Pith citing papers · 2024
Jiwoo Hong, Noah Lee, and James Thorne
Latent visual reasoning.arXiv preprint arXiv:2509.24251, 2025 17 Pith citing papers · 2025
Bangzheng Li, Ximeng Sun, Jiang Liu, Ze Wang, Jialian Wu, Xiaodong Yu, Hao Chen, Emad Barsoum, Muhao Chen, and Zicheng Liu
Aaijet al.,LHCb detector performance, Int 17 Pith citing papers · 2015
LHCb collaboration, R
Egodex: Learning dexterous manipulation from large-scale egocentric video.arXiv preprint arXiv:2505.11709, 2025 17 Pith citing papers · 2025
Ryan Hoque, Peide Huang, David J Yoon, Mouli Sivapurapu, and Jian Zhang
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks 17 Pith citing papers · 2013
Saxe, Andrew M
From explicit cot to implicit cot: Learning to internalize cot step by step 17 Pith citing papers · 2024
Yuntian Deng, Yejin Choi, and Stuart Shieber
Generalizing verifiable instruction following.arXiv preprint arXiv:2507.02833, 17 Pith citing papers · 2025
Valentina Pyatkin, Saumya Malik, Victoria Graf, Hamish Ivison, Shengyi Huang, Pradeep Dasigi, Nathan Lambert, and Hannaneh Hajishirzi
GW231123: A Binary Black Hole Merger with Total Mass 190–265M ⊙, 17 Pith citing papers · 2025
GW231123: a Binary Black Hole Merger with Total Mass 190-265M ⊙ (
Internvid: A large-scale video-text dataset for multimodal understanding and generation 17 Pith citing papers · 2023
Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, et al
In the realm of the Hubble tension—a review of solutions, 17 Pith citing papers · 2021
Eleonora Di Valentino, Olga Mena, Supriya Pan, Luca Visinelli, Weiqiang Yang, Alessandro Melchiorri, David F
Matterport3d: Learning from rgb-d data in indoor environments, 17 Pith citing papers · 2017
A
Open- rlhf: An easy-to-use, scalable and high-performance rlhf framework.arXiv preprint arXiv:2405.11143, 17 Pith citing papers · 2024
Jian Hu, Xibin Wu, Zilin Zhu, Xianyu, Weixun Wang, Dehao Zhang, and Yu Cao
Pyramiddrop: Accelerating your large vision-language models via pyramid visual redundancy reduction.arXiv preprint arXiv:2410.17247, 2024 17 Pith citing papers · 2024
Long Xing, Qidong Huang, Xiaoyi Dong, Jiajie Lu, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang, Feng Wu, et al
Qwen3-tts technical report.arXiv preprint arXiv:2601.15621, 2026 17 Pith citing papers · 2026
Hangrui Hu, Xinfa Zhu, Ting He, Dake Guo, Bin Zhang, Xiong Wang, Zhifang Guo, Ziyue Jiang, Hongkun Hao, Zishan Guo, et al
R1-onevision: Advancing gen- eralized multimodal reasoning through cross-modal formal- ization.arXiv preprint arXiv:2503.10615, 2025 17 Pith citing papers · 2025
Yi Yang, Xiaoxuan He, Hongkun Pan, Xiyan Jiang, Yan Deng, Xingtao Yang, Haoyu Lu, Dacheng Yin, Fengyun Rao, Minfeng Zhu, et al
Reasoning with exploration: An entropy perspective.arXiv preprint arXiv:2506.14758, 2025 17 Pith citing papers · 2025
Daixuan Cheng, Shaohan Huang, Xuekai Zhu, Bo Dai, Wayne Xin Zhao, Zhenliang Zhang, and Furu Wei
Simplevla-rl: Scaling vla training via reinforcement learning.arXiv preprint arXiv:2509.09674, 2025 17 Pith citing papers · 2025
Li, H
Spinquant: Llm quantization with learned rotations.arXiv preprint arXiv:2405.16406, 2024 17 Pith citing papers · 2024
Zechun Liu, Changsheng Zhao, Igor Fedorov, Bilge Soran, Dhruv Choudhary, Raghuraman Krishnamoorthi, Vikas Chandra, Yuandong Tian, and Tijmen Blankevoort
Videocrafter1: Open diffusion models for high-quality video generation, 17 Pith citing papers · 2023
H
Aligning large multimodal models with factually augmented RLHF 17 Pith citing papers · 2023
Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Y an Gui, Y u-Xiong Wang, Yiming Y ang, Kurt Keutzer, and Trevor Dar- rell
Aokiet al.(Flavour Lattice Averaging Group (FLAG)), FLAG review 2024, Phys 17 Pith citing papers · 2024
Y
A survey on large language model based autonomous agents 17 Pith citing papers · 2023
[Wang et al
Cogvlm: Visual expert for pretrained language models 17 Pith citing papers · 2023
Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, et al
Constraints on primordial black holes, 17 Pith citing papers · 2021
B
Cosyvoice: A scalable multi- lingual zero-shot text-to-speech synthesizer based on supervised semantic tokens 17 Pith citing papers · 2024
Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, and Zhijie Yan
Deepeyesv2: Toward agentic multimodal model 17 Pith citing papers · 2025
Hong, J
Dexvla: Vision-language model with plug-in diffusion expert for general robot control.arXiv preprint arXiv:2502.05855, 2025 17 Pith citing papers · 2025
Junjie Wen, Yichen Zhu, Jinming Li, Zhibin Tang, Chaomin Shen, and Feifei Feng
Diffusion policies as an expressive policy class for ofﬂine reinforcement learning 17 Pith citing papers · 2022
Zhendong Wang, Jonathan J Hunt, and Mingyuan Zhou
Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps.Advances in neural information processing systems, 35:5775–5787, 2022a 17 Pith citing papers · 2023
C
Efficientnet: Rethinking model scaling for convolutional neural networks, 17 Pith citing papers · 1905
M
Extended dark energy analy- sis using DESI DR2 BAO measurements, 17 Pith citing papers · 2025
K
Finbert: Financial sentiment analysis with pre-trained language models 17 Pith citing papers · 1908
Dogu Araci
Gaia-2: A controllable multi-view generative world model for autonomous driving, 17 Pith citing papers · 2025
L
Gemini embedding: Generalizable embeddings from gemini.arXiv:2503.07891, 2025 17 Pith citing papers · 2025
Jinhyuk Lee, Feiyang Chen, Sahil Dua, Daniel Cer, Madhuri Shanbhogue, Iftekhar Naim, Gustavo Hernández Ábrego, Zhe Li, Kaifeng Chen, Henrique Schechter Vera, Xiaoqi Ren, Shanfeng Zhang, Daniel Salz, Michael Boratko, Jay Han, Blair Chen, Shu
GetDist: a Python package for analysing Monte Carlo samples, 17 Pith citing papers · 2019
Antony Lewis, “GetDist: a Python package for analysing Monte Carlo samples,” (
Griffin: Mixing gated linear recurrences with local attention for efficient language models, 17 Pith citing papers · 2024
S
GWTC-4.0: Tests of General Relativ- 10 ity. II. Parameterized Tests, 17 Pith citing papers · 2026
A
Helios: A 98-qubit trapped-ion quantum computer.arXiv preprint arXiv:2511.05465, 2025 17 Pith citing papers · 2025
Anthony Ransford, MS Allman, Jake Arkinstall, JP Campora III, Samuel F Cooper, Robert D Delaney, Joan M Dreiling, Brian Estey, Caroline Figgatt, Alex Hall, et al
Human motion dif- fusion model.arXiv preprint arXiv:2209.14916, 2022 17 Pith citing papers · 2022
Guy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Daniel Cohen-Or, and Amit H Bermano
Inference-time intervention: Eliciting truthful answers from a language model 17 Pith citing papers · 2023
Kenneth Li, Oam Patel, Fernanda Viégas, Hanspeter Pfister, and Martin Wattenberg
Justice or prejudice? quantifying biases in llm-as-a-judge 17 Pith citing papers · 2024
Jiayi Ye, Yanbo Wang, Yue Huang, Dongping Chen, Qihui Zhang, Nuno Moniz, Tian Gao, Werner Geyer, Chao Huang, Pin-Yu Chen, Nitesh V Chawla, and Xiangliang Zhang
Livebench: A challenging, contamination-free llm benchmark, 17 Pith citing papers · 2024
C
Longrope: Extending llm context window beyond 2 million tokens 17 Pith citing papers · 2024
Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, and Mao Yang
Memagent: Re- shaping long-context llm with multi-conv rl-based mem- ory agent.arXiv preprint arXiv:2507.02259, 17 Pith citing papers · 2025
Yu, H
Mmada: Multimodal large diffusion language models.arXiv preprint arXiv:2505.15809, 17 Pith citing papers · 2025
L
Mteb: Massive text embedding benchmark 17 Pith citing papers · 2023
Niklas Muennighoff, Nouamane Tazi, Loïc Magne, and Nils Reimers
Primordial Black Holes as a dark matter candidate 17 Pith citing papers · 2021
A
Prompt infection: Llm-to-llm prompt injection within multi-agent systems, 17 Pith citing papers · 2024
D
REALM: retrieval-augmented language model pre-training 17 Pith citing papers · 2002
Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, and Ming - Wei Chang
Selfcheckgpt: Zero-resource black-box hallucination detection for generative large language models 17 Pith citing papers · 2023
Potsawee Manakul, Adian Liusie, and Mark John Francis Gales
Self-play fine- tuning converts weak language models to strong language models.arXiv preprint arXiv:2401.01335, 17 Pith citing papers · 2024
Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, and Quanquan Gu
Shor’s algorithm is possible with as few as 10,000 reconfigurable atomic qubits, 17 Pith citing papers · 2026
Madelyn Cain, Qian Xu, Robbie King, Lewis R
Soap: Improving and stabilizing shampoo using adam.arXiv preprint arXiv:2409.11321, 2024 17 Pith citing papers · 2024
N
Solving rubik’s cube with a robot hand, 17 Pith citing papers · 1910
I
Tests of General Relativity with the Binary Black Hole Signals from the LIGO-Virgo Catalog GWTC-1, 17 Pith citing papers · 2019
B
The Prompt Report: A Systematic Survey of Prompt Engineering Techniques 17 Pith citing papers · 2024
Schulhoff S, Ilie M, Balepur N, Kahadze K, Liu A, Si C, et al
Titans: Learning to 9 Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference memorize at test time.arXiv preprint arXiv:2501.00663, 17 Pith citing papers · 2024
Behrouz, A
Ultralight scalars as cosmological dark matter, 17 Pith citing papers · 2017
L
Modified Gravity Theories on a Nutshell: Inflation, Bounce and Late-time Evolution, 16 Pith citing papers · 2559 global citations · 2017
S
Visrag: Vision-based retrieval-augmented generation on multi-modality documents.arXiv preprint arXiv:2410.10594, 2024 16 Pith citing papers · 2024
Shi Yu, Chaoyue Tang, Bokai Xu, Junbo Cui, Junhao Ran, Yukun Yan, Zhenghao Liu, Shuo Wang, Xu Han, Zhiyuan Liu, et al
Fast-dllm: Training-free acceleration of diffusion LLM by enabling KV cache and parallel decoding.CoRR, abs/2505.22618, 2025 16 Pith citing papers · 2025
Chengyue Wu, Hao Zhang, Shuchen Xue, Zhijian Liu, Shizhe Diao, Ligeng Zhu, Ping Luo, Song Han, and Enze Xie
Fp8 formats for deep learning.arXiv preprint arXiv:2209.05433, 2022 16 Pith citing papers · 2022
Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, et al
RepoBench : Benchmarking repository-level code auto-completion systems 16 Pith citing papers · 2023
Tianyang Liu, Canwen Xu, and Julian McAuley
Spacer: Rein- forcing mllms in video spatial reasoning.arXiv preprint arXiv:2504.01805, 2025 16 Pith citing papers · 2025
Kun Ouyang, Yuanxin Liu, Haoning Wu, Yi Liu, Hao Zhou, Jie Zhou, Fandong Meng, and Xu Sun
Spurious rewards: Rethinking training signals in rlvr.arXiv preprint arXiv:2506.10947, 16 Pith citing papers · 2025
Rulin Shao, Shuyue Stella Li, Rui Xin, Scott Geng, Yiping Wang, Sewoong Oh, Simon Shaolei Du, Nathan Lambert, Sewon Min, Ranjay Krishna, et al
An empirical investigation of catastrophic forgetting in gradient-based neural networks.arXiv preprint arXiv:1312.6211, 16 Pith citing papers · 2013
22 I
Bayesian active learning for classiﬁcation and preferenc e learning, 16 Pith citing papers · 2011
N
Clinicalbert: Modeling clinical notes and predicting hospital readmission 16 Pith citing papers · 2019
Kexin Huang, Jaan Altosaar, and Rajesh Ranganath
Cosmos-reason1: From physical common sense to embodied reasoning.arXiv preprint arXiv:2503.15558, 2025 16 Pith citing papers · 2025
NVIDIA
Deepmath-103k: A large-scale, challenging, de- contaminated, and verifiable mathematical dataset for advancing reasoning.arXiv preprint arXiv:2504.11456, 2025 16 Pith citing papers · 2025
Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, et al
Finite scalar quantization: Vq-vae made simple.arXiv preprint arXiv:2309.15505, 2023 16 Pith citing papers · 2024
Fabian Mentzer, David Minnen, Eirikur Agustsson, and Michael Tschannen
Genie envisioner: A unified world foundation platform for robotic manipulation.arXiv preprint arXiv:2508.05635, 2025 16 Pith citing papers · 2025
Yue Liao, Pengfei Zhou, Siyuan Huang, Donglin Yang, Shengcong Chen, Yuxin Jiang, Yue Hu, Jingbin Cai, Si Liu, Jianlan Luo, Liliang Chen, Shuicheng Yan, Maoqing Yao, and Guanghui Ren
Ghost in the minecraft: Generally capable agents for open-world enviroments via large language mod- els with text-based knowledge and memory 16 Pith citing papers · 2023
Xizhou Zhu, Yuntao Chen, Hao Tian, Chenxin Tao, Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, et al
Glm-4-voice: Towards intelligent and human-like end- to-end spoken chatbot, 16 Pith citing papers · 2024
A
Magpie: Alignment data synthesis from scratch by prompting aligned llms with nothing.arXiv preprint arXiv:2406.08464, 16 Pith citing papers · 2024
Zhangchen Xu, Fengqing Jiang, Luyao Niu, Yuntian Deng, Radha Poovendran, Yejin Choi, and Bill Yuchen Lin
Mme-realworld: Could your multimodal llm challenge high-resolution real-world scenarios that are difficult for humans?arXiv preprint arXiv:2408.13257, 2024 16 Pith citing papers · 2024
Yi-Fan Zhang, Huanyu Zhang, Haochen Tian, Chaoyou Fu, Shuangqing Zhang, Junfei Wu, Feng Li, Kun Wang, Qingsong Wen, Zhang Zhang, et al
Monst3r: A simple approach for estimat- ing geometry in the presence of motion.arXiv preprint arXiv:2410.03825, 2024 16 Pith citing papers · 2024
Junyi Zhang, Charles Herrmann, Junhwa Hur, Varun Jam- pani, Trevor Darrell, Forrester Cole, Deqing Sun, and Ming- 10 Hsuan Yang
nuPlan: A closed-loop ML- based planning benchmark for autonomous vehicles, 16 Pith citing papers · 2021
H
Pllava: Parameter-free llava extension from images to videos for video dense captioning, 16 Pith citing papers · 2024
Lin Xu, Yilin Zhao, Daquan Zhou, Zhijie Lin, See Kiong Ng, and Jiashi Feng, “Pllava: Parameter-free llava extension from images to videos for video dense captioning,”arXiv preprint arXiv:2404
Real-time execution of action chunking flow policies.arXiv preprint arXiv:2506.07339, 2025 16 Pith citing papers · 2025
Kevin Black, Manuel Y Galliker, and Sergey Levine
Securing AI agents with information-flow control 16 Pith citing papers · 2026
Manuel Costa, Ahmed Salem, Aashish Kolluri, Boris Kopf, Shruti Tople, Andrew Paverd, Lukas Wutschitz, Mark Russinovich, and Santiago Zanella-Beguelin
Seed-tts: A family of high-quality versatile speech generation models.arXiv preprint arXiv:2406.02430, 2024 16 Pith citing papers · 2024
Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, et al
Skill-inject: Measuring agent vulnerability to skill file attacks, 16 Pith citing papers · 2026
D
Video generators are robot policies.arXiv preprint arXiv:2508.00795, 2025 16 Pith citing papers · 2025
Junbang Liang, Pavel Tokmakov, Ruoshi Liu, Sruthi Sudhakar, Paarth Shah, Rares Ambrus, and Carl V ondrick
WASP: Benchmarking web agent security against prompt injection attacks 16 Pith citing papers · 2025
Chaofan Li et al
Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task 16 Pith citing papers · 2025
Nataliya Kosmyna, Eugene Hauptmann, Ye Tong Yuan, Jessica Situ, Xian-Hao Liao, Ashly Vivian Beresnitzky, Iris Braunstein, and Pattie Maes
Advancing open-source world models, 16 Pith citing papers · 2026
R
An empirical model of large-batch training, 2018, arXiv:1812.06162 http://arxiv.org/abs/arXiv:1812.06162 16 Pith citing papers · 2018
Sam McCandlish, Jared Kaplan, Dario Amodei, and OpenAI Dota Team
Black Hole Spectroscopy and Tests of General Relativity with GW250114, 16 Pith citing papers · 2026
A
Capabil- ities of GPT-4 on Medical Challenge Problems 16 Pith citing papers · 2023
Nori, Harsha; King, Nicholas; McKinney, Scott Mayer; Carignan, Dean; Horvitz, Eric
Classification with quantum neural networks on near term processors, 16 Pith citing papers · 2018
Edward Farhi and Hartmut Neven