pith. sign in

Yu-Gang Jiang

Identifiers

  • name variant Yu-Gang Jiang 0.60 · backfill

Papers (62)

  1. BraveGuard: From Open-World Threats to Safer Computer-Use Agents cs.CR · 2026 · author #16
  2. CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping cs.CV · 2026 · author #14
  3. VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models cs.RO · 2026 · author #6
  4. Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation cs.CV · 2026 · author #12
  5. Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance cs.RO · 2026 · author #9
  6. A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook cs.SD · 2026 · author #32
  7. Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling cs.CV · 2026 · author #7 as printed: Yu-gang Jiang
  8. Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026 · author #11
  9. TAME: Test-Time Adversarial Prompt Tuning via Mixture-of-Experts for Vision-Language Models cs.CV · 2026 · author #9
  10. DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models cs.CR · 2026 · author #10
  11. GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization cs.RO · 2026 · author #20
  12. World Action Models: The Next Frontier in Embodied AI cs.RO · 2026 · author #14
  13. Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval cs.CV · 2026 · author #4
  14. From Synthetic to Real: Toward Identity-Consistent Makeup Transfer with Synthetic and Real Data cs.CV · 2026 · author #5
  15. ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models cs.CL · 2026 · author #4
  16. CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026 · author #36
  17. Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses cs.CL · 2026 · author #11
  18. Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models cs.CV · 2026 · author #6
  19. SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning cs.CV · 2026 · author #7
  20. ROSE: Retrieval-Oriented Segmentation Enhancement cs.CV · 2026 · author #4
  21. HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models cs.RO · 2026 · author #11
  22. CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation cs.CV · 2026 · author #13
  23. AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly cs.RO · 2026 · author #6
  24. Steering the Verifiability of Multimodal AI Hallucinations cs.AI · 2026 · author #7
  25. Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses cs.CR · 2026 · author #38
  26. Robotic Grasping and Placement Controlled by EEG-Based Hybrid Visual and Motor Imagery cs.RO · 2026 · author #5
  27. SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents cs.CL · 2026 · author #20
  28. Memory in the Age of AI Agents cs.CL · 2025 · author #46
  29. Boosting Reasoning in Large Multimodal Models via Activation Replay cs.CV · 2025 · author #7
  30. Unify Robot Actions in Camera Frame cs.RO · 2025 · author #12
  31. LeakyCLIP: Extracting Training Data from CLIP cs.CR · 2025 · author #6
  32. Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #48
  33. Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #11
  34. Black-box Adversarial Attacks on Video Recognition Models cs.LG · 2019 · author #5
  35. A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization cs.LG · 2018 · author #5
  36. Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network cs.CV · 2018 · author #5
  37. Composite Binary Decomposition Networks cs.LG · 2018 · author #5
  38. Non-local NetVLAD Encoding for Video Classification cs.CV · 2018 · author #6
  39. Object Detection from Scratch with Deep Supervision cs.CV · 2018 · author #4
  40. NAIS: Neural Attentive Item Similarity Model for Recommendation cs.IR · 2018 · author #5
  41. Recurrent Fusion Network for Image Captioning cs.CV · 2018 · author #3
  42. Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks cs.CV · 2018 · author #6
  43. Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging cs.CV · 2018 · author #4
  44. Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images cs.CV · 2018 · author #6
  45. Learning to score the figure skating sports videos cs.MM · 2018 · author #5
  46. Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #7
  47. Dual Skipping Networks cs.CV · 2017 · author #3
  48. Recent Advances in Zero-shot Recognition cs.CV · 2017 · author #3
  49. Multi-scale Deep Learning Architectures for Person Re-identification cs.CV · 2017 · author #3
  50. DSOD: Learning Deeply Supervised Object Detectors from Scratch cs.CV · 2017 · author #4
  51. Learning Fashion Compatibility with Bidirectional LSTMs cs.CV · 2017 · author #3
  52. Aggregating Frame-level Features for Large-Scale Video Classification cs.CV · 2017 · author #6
  53. Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification cs.MM · 2017 · author #1
  54. Weakly Supervised Dense Video Captioning cs.CV · 2017 · author #6
  55. Iterative Object and Part Transfer for Fine-Grained Recognition cs.CV · 2017 · author #2
  56. Deep Learning for Video Classification and Captioning cs.CV · 2016 · author #4
  57. The THUMOS Challenge on Action Recognition for Videos "in the Wild" cs.CV · 2016 · author #3
  58. Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization cs.CV · 2015 · author #3
  59. Fusing Multi-Stream Deep Networks for Video Classification cs.CV · 2015 · author #2
  60. Evaluating Two-Stream CNN for Video Classification cs.CV · 2015 · author #5
  61. Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification cs.CV · 2015 · author #3
  62. Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks cs.CV · 2015 · author #1

Mentions

  • 2605.12369 #20 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2606.01166 #16 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2602.12984 #20 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.30774 #14 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.29562 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.25195 #12 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.02900 #38 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.24203 #9 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2508.00756 #6 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.20266 #32 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.18868 #10 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.18599 #7 · arxiv_oai · confidence 0.70 Yu-gang Jiang
  • 2605.18059 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2605.17577 #9 · arxiv_oai · confidence 0.70 Yu-Gang Jiang
  • 2604.25850 #11 · arxiv_oai · confidence 0.70 Yu-Gang Jiang

Frequent Coauthors