pith. sign in

Wenhai Wang

Identifiers

  • name variant Wenhai Wang 0.60 · backfill

Papers (21)

  1. In-situ operation of amorphous circuits under heavy-ion irradiation cond-mat.mtrl-sci · 2026 · author #7
  2. Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning cs.AI · 2026 · author #9
  3. LLM-VA: Resolving the Jailbreak-Overrefusal Trade-off via Vector Alignment cs.LG · 2026 · author #5
  4. MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling cs.CL · 2025 · author #35
  5. GenExam: A Multidisciplinary Text-to-Image Exam cs.CV · 2025 · author #6
  6. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #74
  7. ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal cs.SE · 2025 · author #8
  8. ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #18
  9. InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models cs.CV · 2025 · author #51
  10. MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning cs.CV · 2025 · author #9
  11. InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling cs.CV · 2025 · author #13
  12. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #42
  13. Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization cs.CL · 2024 · author #3
  14. InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #19
  15. How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites cs.CV · 2024 · author #35
  16. InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks cs.CV · 2023 · author #3
  17. VideoChat: Chat-Centric Video Understanding cs.CV · 2023 · author #5
  18. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2019 · author #1
  19. Selective Kernel Networks cs.CV · 2019 · author #2
  20. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2018 · author #2
  21. Mixed Link Networks cs.LG · 2018 · author #1

Mentions

  • 2605.31206 #7 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2605.30039 #9 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2407.03320 #19 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2501.12386 #13 · arxiv_oai · confidence 0.70 Wenhai Wang
  • 2411.10442 #3 · arxiv_oai · confidence 0.70 Wenhai Wang

Frequent Coauthors