Weinan Zhang
Identifiers
- name variant Weinan Zhang 0.60 · backfill
Papers (114)
- Clarus: Coordinating Autonomous Research Agents toward Web-Scale Scientific Collaboration cs.AI · 2026 · author #18
- EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies cs.RO · 2026 · author #24
- BALTO: Balanced Token-Level Policy Optimization for Hallucination Mitigation cs.CL · 2026 · author #10
- DiffCold: A Diffusion-based Generative Model for Cold-Start Item Recommendation cs.IR · 2026 · author #3
- SkillJuror: Measuring How Agent Skill Organization Changes Runtime Behavior cs.AI · 2026 · author #7
- AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing cs.RO · 2026 · author #9
- LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents cs.CL · 2026 · author #10
- SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents cs.AI · 2026 · author #5
- Autoregressive Diffusion World Models for Off-Policy Evaluation of LLM Agents cs.LG · 2026 · author #3
- Agent Planning Benchmark: A Diagnostic Framework for Planning Capabilities in LLM Agents cs.CL · 2026 · author #5
- SkillPager: Query-Adaptive Intra-Skill Navigation via Semantic Node Retrieval cs.IR · 2026 · author #4
- DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval cs.IR · 2026 · author #8
- Anticipate and Learn: Unleashing Idle-Time Compute in Proactive Agents cs.CL · 2026 · author #9
- Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents cs.CL · 2026 · author #6
- XSearch: Explainable Code Search via Concept-to-Code Alignment cs.SE · 2026 · author #8
- Contexting as Recommendation: Evolutionary Collaborative Filtering for Context Engineering cs.CL · 2026 · author #11
- SMMBench: A Benchmark for Source-Distributed Multimodal Agent Memory cs.CL · 2026 · author #10
- MMSkills: Towards Multimodal Skills for General Visual Agents cs.AI · 2026 · author #10
- SWE-Cycle: Benchmarking Code Agents across the Complete Issue Resolution Cycle cs.SE · 2026 · author #10
- Position: Agentic AI System Is a Foreseeable Pathway to AGI cs.AI · 2026 · author #5
- Holder Policy Optimisation cs.LG · 2026 · author #10
- Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents cs.CL · 2026 · author #8
- SkillMAS: Skill Co-Evolution with LLM-based Multi-Agent System cs.MA · 2026 · author #9
- MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs cs.AI · 2026 · author #7
- Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Models cs.CL · 2026 · author #4
- Relative braid group symmetries on quantum supersymmetric pairs of type sAIII math.QA · 2026 · author #2
- PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations cs.AI · 2026 · author #10
- Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations cs.IR · 2026 · author #7
- AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search cs.SE · 2026 · author #3
- Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering cs.SE · 2026 · author #21
- SW-$A^2$-Bench: Benchmarking Autonomous Software Agent Generation for Agentic Web cs.MA · 2026 · author #16
- Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization cs.AI · 2026 · author #8
- PEPA: a Persistently Autonomous Embodied Agent with Personalities cs.RO · 2026 · author #4
- Beyond Imitation: Reinforcement Learning-Based Sim-Real Co-Training for VLA Models cs.RO · 2026 · author #9
- Scalable and General Whole-Body Control for Cross-Humanoid Locomotion cs.RO · 2026 · author #9
- MonoScale: Scaling Multi-Agent System with Monotonic Improvement cs.MA · 2026 · author #4
- UniCon: A Unified System for Efficient Robot Learning Transfers cs.RO · 2026 · author #5
- Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web cs.AI · 2026 · author #23
- ACE-Router: Generalizing History-Aware Routing from MCP Tools to the Agent Web cs.AI · 2026 · author #7
- ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution cs.HC · 2026 · author #8
- Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning cs.CL · 2026 · author #6
- MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory cs.CL · 2026 · author #8
- ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling cs.AI · 2025 · author #11
- A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models cs.CL · 2025 · author #11
- Quantum Howe duality and Schur duality of type AIII math.QA · 2025 · author #1
- VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents cs.CL · 2025 · author #8
- Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents cs.CL · 2025 · author #7
- CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models in Mathematical Reasoning cs.CL · 2025 · author #8
- KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills cs.RO · 2025 · author #7
- MARFT: Multi-Agent Reinforcement Fine-Tuning cs.MA · 2025 · author #4
- MOTOR: Learning ID-free Item Representation with Token Crossing for Embedding-based Multimodal Recommendation cs.IR · 2024 · author #7
- DyDiff: Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement Learning cs.LG · 2024 · author #7
- Vision-Language Foundation Models as Effective Robot Imitators cs.RO · 2023 · author #9
- Triple-to-Text: Converting RDF Triples into High-Quality Natural Languages via Optimizing an Inverse KL Divergence cs.CL · 2019 · author #6
- Dynamically Fused Graph Network for Multi-hop Reasoning cs.CL · 2019 · author #6
- CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario cs.MA · 2019 · author #7
- Deep Landscape Forecasting for Real-time Bidding Advertising cs.IR · 2019 · author #5
- Lifelong Sequential Modeling with Personalized Memorization for User Response Prediction cs.IR · 2019 · author #4
- Towards Efficient and Unbiased Implementation of Lipschitz Continuity in GANs cs.LG · 2019 · author #4
- Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space cs.LG · 2019 · author #3
- Lipschitz Generative Adversarial Nets cs.LG · 2019 · author #6
- Large-scale Interactive Recommendation with Tree-structured Policy Gradient cs.LG · 2018 · author #4
- Layout Design for Intelligent Warehouse by Evolution with Fitness Approximation cs.AI · 2018 · author #5
- AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods cs.LG · 2018 · author #5
- Retrieval-Enhanced Adversarial Training for Neural Response Generation cs.CL · 2018 · author #3
- Deep Recurrent Survival Analysis cs.LG · 2018 · author #5
- Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising cs.IR · 2018 · author #3
- AceKG: A Large-scale Knowledge Graph for Academic Data Mining cs.IR · 2018 · author #6
- On the Equilibrium of Query Reformulation and Document Retrieval cs.IR · 2018 · author #4
- Understanding the Effectiveness of Lipschitz-Continuity in Generative Adversarial Nets cs.LG · 2018 · author #6
- Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data cs.IR · 2018 · author #3
- Deep Reinforcement Learning for Chinese Zero pronoun Resolution cs.CL · 2018 · author #3
- Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances cs.IR · 2018 · author #1
- Path-Level Network Transformation for Efficient Architecture Search cs.LG · 2018 · author #3
- Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition cs.CL · 2018 · author #5
- CoT: Cooperative Training for Generative Modeling of Discrete Data cs.LG · 2018 · author #5
- QA4IE: A Question Answering based Framework for Information Extraction cs.IR · 2018 · author #4
- A Machine Learning Approach To Prevent Malicious Calls Over Telephony Networks cs.CR · 2018 · author #7
- Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning cs.IR · 2018 · author #5
- Neural Text Generation: Past, Present and Beyond cs.CL · 2018 · author #3
- Collaborative Filtering with Graph-based Implicit Feedback cs.IR · 2018 · author #2
- Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising cs.GT · 2018 · author #2
- Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising stat.ML · 2018 · author #6
- Texygen: A Benchmarking Platform for Text Generation Models cs.CL · 2018 · author #5
- MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence cs.LG · 2017 · author #4
- A Neural Stochastic Volatility Model cs.LG · 2017 · author #2
- GraphGAN: Graph Representation Learning with Generative Adversarial Nets cs.LG · 2017 · author #5
- Improving Negative Sampling for Word Representation using Self-embedded Features cs.LG · 2017 · author #4
- Face Transfer with Generative Adversarial Network cs.CV · 2017 · author #3
- Long Text Generation via Adversarial Training with Leaked Information cs.CL · 2017 · author #4
- A Study of AI Population Dynamics with Million-agent Reinforcement Learning cs.AI · 2017 · author #5
- Inception Score, Label Smoothing, Gradient Vanishing and -log(D(x)) Alternative cs.LG · 2017 · author #2
- Efficient Architecture Search by Network Transformation cs.LG · 2017 · author #3
- Wasserstein Distance Guided Representation Learning for Domain Adaptation stat.ML · 2017 · author #3
- IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models cs.IR · 2017 · author #3
- Activation Maximization Generative Adversarial Nets cs.LG · 2017 · author #6
- Unsupervised Diverse Colorization via Generative Adversarial Networks cs.CV · 2017 · author #3
- Real-Time Bidding by Reinforcement Learning in Display Advertising cs.LG · 2017 · author #3
- Managing Risk of Bidding in Display Advertising cs.GT · 2017 · author #2
- Product-based Neural Networks for User Response Prediction cs.LG · 2016 · author #4
- Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting cs.GT · 2016 · author #2
- SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient cs.LG · 2016 · author #2
- Learning to Start for Sequence to Sequence Architecture cs.CL · 2016 · author #2
- Learning text representation using recurrent convolutional neural network with highway layers cs.CL · 2016 · author #2
- Generating and Exploiting Large-scale Pseudo Training Data for Zero Pronoun Resolution cs.CL · 2016 · author #4
- A Deep Neural Network for Chinese Zero Pronoun Resolution cs.CL · 2016 · author #2
- Feedback Control of Real-Time Display Advertising cs.GT · 2016 · author #1
- Optimal Real-Time Bidding Frameworks Discussion cs.GT · 2016 · author #1
- Implicit Look-alike Modelling in Display Ads: Transfer Collaborative Filtering to CTR Estimation cs.LG · 2016 · author #1
- Deep Learning over Multi-field Categorical Data: A Case Study on User Response Prediction cs.LG · 2016 · author #1
- Statistical Arbitrage Mining for Display Advertising cs.GT · 2015 · author #1
- An Empirical Study on Display Ad Impression Viewability Measurements cs.HC · 2015 · author #1
- Real-Time Bidding Benchmarking with iPinYou Dataset cs.GT · 2014 · author #1
- Feature-Based Matrix Factorization cs.AI · 2011 · author #4
Mentions
- 2506.12851 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.30246 #18 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.15893 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.18239 #24 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.12245 #3 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2410.19276 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.11543 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2602.05791 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.09811 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2604.04226 #16 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.06087 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2602.12628 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.05761 #5 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.05558 #3 · arxiv_oai · confidence 0.70 Weinan Zhang
- 1603.01055 #1 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.04874 #5 · arxiv_oai · confidence 0.70 Weinan Zhang
- 1506.03837 #1 · backfill · confidence 0.70 Weinan Zhang
- 1505.05788 #1 · backfill · confidence 0.70 Weinan Zhang
- 2605.13527 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.04638 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2606.00822 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2504.16129 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.31377 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
- 1407.7073 #1 · backfill · confidence 0.70 Weinan Zhang
- 2605.25971 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.12058 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2601.23219 #4 · arxiv_oai · confidence 0.70 Weinan Zhang
- 1109.2271 #4 · backfill · confidence 0.70 Weinan Zhang
- 2507.15698 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2405.19189 #7 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.16986 #6 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.09341 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.16046 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.15721 #11 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2605.15710 #10 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2601.03192 #8 · arxiv_oai · confidence 0.70 Weinan Zhang
- 2311.01378 #9 · arxiv_oai · confidence 0.70 Weinan Zhang
Frequent Coauthors
- Yong Yu 55 shared papers
- Jun Wang 40 shared papers
- Weiwen Liu 22 shared papers
- Jianghao Lin 20 shared papers
- Kan Ren 10 shared papers
- Han Cai 9 shared papers
- Zhiming Zhou 9 shared papers
- Jiachen Zhu 7 shared papers
- Yanru Qu 7 shared papers
- Junwei Liao 6 shared papers
- Lantao Yu 6 shared papers
- Rong Shan 6 shared papers
- Ting Liu 6 shared papers
- Ying Wen 6 shared papers
- Zihan Guo 6 shared papers
- Kangning Zhang 5 shared papers
- Lin Qiu 5 shared papers
- Muning Wen 5 shared papers
- Xingyu Lou 5 shared papers
- Yaoming Zhu 5 shared papers