pith. sign in

Weinan Zhang

Identifiers

  • name variant Weinan Zhang 0.60 · backfill

Papers (114)

  1. Clarus: Coordinating Autonomous Research Agents toward Web-Scale Scientific Collaboration cs.AI · 2026 · author #18
  2. EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies cs.RO · 2026 · author #24
  3. BALTO: Balanced Token-Level Policy Optimization for Hallucination Mitigation cs.CL · 2026 · author #10
  4. DiffCold: A Diffusion-based Generative Model for Cold-Start Item Recommendation cs.IR · 2026 · author #3
  5. SkillJuror: Measuring How Agent Skill Organization Changes Runtime Behavior cs.AI · 2026 · author #7
  6. AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing cs.RO · 2026 · author #9
  7. LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents cs.CL · 2026 · author #10
  8. SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents cs.AI · 2026 · author #5
  9. Autoregressive Diffusion World Models for Off-Policy Evaluation of LLM Agents cs.LG · 2026 · author #3
  10. Agent Planning Benchmark: A Diagnostic Framework for Planning Capabilities in LLM Agents cs.CL · 2026 · author #5
  11. SkillPager: Query-Adaptive Intra-Skill Navigation via Semantic Node Retrieval cs.IR · 2026 · author #4
  12. DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval cs.IR · 2026 · author #8
  13. Anticipate and Learn: Unleashing Idle-Time Compute in Proactive Agents cs.CL · 2026 · author #9
  14. Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents cs.CL · 2026 · author #6
  15. XSearch: Explainable Code Search via Concept-to-Code Alignment cs.SE · 2026 · author #8
  16. Contexting as Recommendation: Evolutionary Collaborative Filtering for Context Engineering cs.CL · 2026 · author #11
  17. SMMBench: A Benchmark for Source-Distributed Multimodal Agent Memory cs.CL · 2026 · author #10
  18. MMSkills: Towards Multimodal Skills for General Visual Agents cs.AI · 2026 · author #10
  19. SWE-Cycle: Benchmarking Code Agents across the Complete Issue Resolution Cycle cs.SE · 2026 · author #10
  20. Position: Agentic AI System Is a Foreseeable Pathway to AGI cs.AI · 2026 · author #5
  21. Holder Policy Optimisation cs.LG · 2026 · author #10
  22. Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents cs.CL · 2026 · author #8
  23. SkillMAS: Skill Co-Evolution with LLM-based Multi-Agent System cs.MA · 2026 · author #9
  24. MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs cs.AI · 2026 · author #7
  25. Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Models cs.CL · 2026 · author #4
  26. Relative braid group symmetries on quantum supersymmetric pairs of type sAIII math.QA · 2026 · author #2
  27. PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations cs.AI · 2026 · author #10
  28. Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations cs.IR · 2026 · author #7
  29. AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search cs.SE · 2026 · author #3
  30. Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering cs.SE · 2026 · author #21
  31. SW-$A^2$-Bench: Benchmarking Autonomous Software Agent Generation for Agentic Web cs.MA · 2026 · author #16
  32. Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization cs.AI · 2026 · author #8
  33. PEPA: a Persistently Autonomous Embodied Agent with Personalities cs.RO · 2026 · author #4
  34. Beyond Imitation: Reinforcement Learning-Based Sim-Real Co-Training for VLA Models cs.RO · 2026 · author #9
  35. Scalable and General Whole-Body Control for Cross-Humanoid Locomotion cs.RO · 2026 · author #9
  36. MonoScale: Scaling Multi-Agent System with Monotonic Improvement cs.MA · 2026 · author #4
  37. UniCon: A Unified System for Efficient Robot Learning Transfers cs.RO · 2026 · author #5
  38. Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web cs.AI · 2026 · author #23
  39. ACE-Router: Generalizing History-Aware Routing from MCP Tools to the Agent Web cs.AI · 2026 · author #7
  40. ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution cs.HC · 2026 · author #8
  41. Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning cs.CL · 2026 · author #6
  42. MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory cs.CL · 2026 · author #8
  43. ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling cs.AI · 2025 · author #11
  44. A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models cs.CL · 2025 · author #11
  45. Quantum Howe duality and Schur duality of type AIII math.QA · 2025 · author #1
  46. VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents cs.CL · 2025 · author #8
  47. Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents cs.CL · 2025 · author #7
  48. CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models in Mathematical Reasoning cs.CL · 2025 · author #8
  49. KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills cs.RO · 2025 · author #7
  50. MARFT: Multi-Agent Reinforcement Fine-Tuning cs.MA · 2025 · author #4
  51. MOTOR: Learning ID-free Item Representation with Token Crossing for Embedding-based Multimodal Recommendation cs.IR · 2024 · author #7
  52. DyDiff: Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement Learning cs.LG · 2024 · author #7
  53. Vision-Language Foundation Models as Effective Robot Imitators cs.RO · 2023 · author #9
  54. Triple-to-Text: Converting RDF Triples into High-Quality Natural Languages via Optimizing an Inverse KL Divergence cs.CL · 2019 · author #6
  55. Dynamically Fused Graph Network for Multi-hop Reasoning cs.CL · 2019 · author #6
  56. CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario cs.MA · 2019 · author #7
  57. Deep Landscape Forecasting for Real-time Bidding Advertising cs.IR · 2019 · author #5
  58. Lifelong Sequential Modeling with Personalized Memorization for User Response Prediction cs.IR · 2019 · author #4
  59. Towards Efficient and Unbiased Implementation of Lipschitz Continuity in GANs cs.LG · 2019 · author #4
  60. Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space cs.LG · 2019 · author #3
  61. Lipschitz Generative Adversarial Nets cs.LG · 2019 · author #6
  62. Large-scale Interactive Recommendation with Tree-structured Policy Gradient cs.LG · 2018 · author #4
  63. Layout Design for Intelligent Warehouse by Evolution with Fitness Approximation cs.AI · 2018 · author #5
  64. AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods cs.LG · 2018 · author #5
  65. Retrieval-Enhanced Adversarial Training for Neural Response Generation cs.CL · 2018 · author #3
  66. Deep Recurrent Survival Analysis cs.LG · 2018 · author #5
  67. Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising cs.IR · 2018 · author #3
  68. AceKG: A Large-scale Knowledge Graph for Academic Data Mining cs.IR · 2018 · author #6
  69. On the Equilibrium of Query Reformulation and Document Retrieval cs.IR · 2018 · author #4
  70. Understanding the Effectiveness of Lipschitz-Continuity in Generative Adversarial Nets cs.LG · 2018 · author #6
  71. Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data cs.IR · 2018 · author #3
  72. Deep Reinforcement Learning for Chinese Zero pronoun Resolution cs.CL · 2018 · author #3
  73. Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances cs.IR · 2018 · author #1
  74. Path-Level Network Transformation for Efficient Architecture Search cs.LG · 2018 · author #3
  75. Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition cs.CL · 2018 · author #5
  76. CoT: Cooperative Training for Generative Modeling of Discrete Data cs.LG · 2018 · author #5
  77. QA4IE: A Question Answering based Framework for Information Extraction cs.IR · 2018 · author #4
  78. A Machine Learning Approach To Prevent Malicious Calls Over Telephony Networks cs.CR · 2018 · author #7
  79. Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning cs.IR · 2018 · author #5
  80. Neural Text Generation: Past, Present and Beyond cs.CL · 2018 · author #3
  81. Collaborative Filtering with Graph-based Implicit Feedback cs.IR · 2018 · author #2
  82. Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising cs.GT · 2018 · author #2
  83. Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising stat.ML · 2018 · author #6
  84. Texygen: A Benchmarking Platform for Text Generation Models cs.CL · 2018 · author #5
  85. MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence cs.LG · 2017 · author #4
  86. A Neural Stochastic Volatility Model cs.LG · 2017 · author #2
  87. GraphGAN: Graph Representation Learning with Generative Adversarial Nets cs.LG · 2017 · author #5
  88. Improving Negative Sampling for Word Representation using Self-embedded Features cs.LG · 2017 · author #4
  89. Face Transfer with Generative Adversarial Network cs.CV · 2017 · author #3
  90. Long Text Generation via Adversarial Training with Leaked Information cs.CL · 2017 · author #4
  91. A Study of AI Population Dynamics with Million-agent Reinforcement Learning cs.AI · 2017 · author #5
  92. Inception Score, Label Smoothing, Gradient Vanishing and -log(D(x)) Alternative cs.LG · 2017 · author #2
  93. Efficient Architecture Search by Network Transformation cs.LG · 2017 · author #3
  94. Wasserstein Distance Guided Representation Learning for Domain Adaptation stat.ML · 2017 · author #3
  95. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models cs.IR · 2017 · author #3
  96. Activation Maximization Generative Adversarial Nets cs.LG · 2017 · author #6
  97. Unsupervised Diverse Colorization via Generative Adversarial Networks cs.CV · 2017 · author #3
  98. Real-Time Bidding by Reinforcement Learning in Display Advertising cs.LG · 2017 · author #3
  99. Managing Risk of Bidding in Display Advertising cs.GT · 2017 · author #2
  100. Product-based Neural Networks for User Response Prediction cs.LG · 2016 · author #4
  101. Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting cs.GT · 2016 · author #2
  102. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient cs.LG · 2016 · author #2
  103. Learning to Start for Sequence to Sequence Architecture cs.CL · 2016 · author #2
  104. Learning text representation using recurrent convolutional neural network with highway layers cs.CL · 2016 · author #2
  105. Generating and Exploiting Large-scale Pseudo Training Data for Zero Pronoun Resolution cs.CL · 2016 · author #4
  106. A Deep Neural Network for Chinese Zero Pronoun Resolution cs.CL · 2016 · author #2
  107. Feedback Control of Real-Time Display Advertising cs.GT · 2016 · author #1
  108. Optimal Real-Time Bidding Frameworks Discussion cs.GT · 2016 · author #1
  109. Implicit Look-alike Modelling in Display Ads: Transfer Collaborative Filtering to CTR Estimation cs.LG · 2016 · author #1
  110. Deep Learning over Multi-field Categorical Data: A Case Study on User Response Prediction cs.LG · 2016 · author #1
  111. Statistical Arbitrage Mining for Display Advertising cs.GT · 2015 · author #1
  112. An Empirical Study on Display Ad Impression Viewability Measurements cs.HC · 2015 · author #1
  113. Real-Time Bidding Benchmarking with iPinYou Dataset cs.GT · 2014 · author #1
  114. Feature-Based Matrix Factorization cs.AI · 2011 · author #4

Mentions

  • 2506.12851 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.30246 #18 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.15893 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.18239 #24 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.12245 #3 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2410.19276 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.11543 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2602.05791 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.09811 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2604.04226 #16 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.06087 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2602.12628 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.05761 #5 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.05558 #3 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 1603.01055 #1 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.04874 #5 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 1506.03837 #1 · backfill · confidence 0.70 Weinan Zhang
  • 1505.05788 #1 · backfill · confidence 0.70 Weinan Zhang
  • 2605.13527 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.04638 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2606.00822 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2504.16129 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.31377 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 1407.7073 #1 · backfill · confidence 0.70 Weinan Zhang
  • 2605.25971 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.12058 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2601.23219 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 1109.2271 #4 · backfill · confidence 0.70 Weinan Zhang
  • 2507.15698 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2405.19189 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.16986 #6 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.09341 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.16046 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.15721 #11 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2605.15710 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2601.03192 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
  • 2311.01378 #9 · arxiv_oai · confidence 0.70 Weinan Zhang

Frequent Coauthors