pith. sign in

Guanglu Song

Identifiers

  • name variant Guanglu Song 0.60 · backfill

Papers (44)

  1. Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation cs.CV · 2026 · author #7
  2. AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization cs.CV · 2026 · author #6
  3. VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping cs.CV · 2024 · author #4
  4. EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM cs.CV · 2024 · author #4
  5. See Further When Clear: Curriculum Consistency Model cs.CV · 2024 · author #5
  6. Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning cs.RO · 2024 · author #6
  7. MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines cs.CV · 2024 · author #10
  8. Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models cs.CV · 2024 · author #3
  9. Phased Consistency Models cs.LG · 2024 · author #9
  10. Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models cs.CV · 2024 · author #6
  11. MoVA: Adapting Mixture of Vision Experts to Multimodal Context cs.CV · 2024 · author #4
  12. Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance cs.CV · 2024 · author #2
  13. CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching cs.CV · 2024 · author #2
  14. Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning cs.CV · 2024 · author #4
  15. Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation cs.CV · 2024 · author #6
  16. FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis cs.CV · 2024 · author #4
  17. AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data cs.CV · 2024 · author #6
  18. Towards Large-scale Masked Face Recognition cs.CV · 2023 · author #3
  19. Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection cs.CV · 2023 · author #2
  20. RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths cs.CV · 2023 · author #2
  21. Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising cs.CV · 2023 · author #3
  22. Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction cs.CV · 2023 · author #3
  23. DETRs with Collaborative Hybrid Assignments Training cs.CV · 2022 · author #2
  24. Teach-DETR: Better Training DETR with Teachers cs.CV · 2022 · author #3
  25. Large-batch Optimization for Dense Visual Predictions cs.CV · 2022 · author #3
  26. Towards Robust Face Recognition with Comprehensive Search cs.CV · 2022 · author #2
  27. Unifying Visual Perception by Dispersible Points Learning cs.CV · 2022 · author #2
  28. Rethinking Robust Representation Learning Under Fine-grained Noisy Faces cs.CV · 2022 · author #2
  29. UniNet: Unified Architecture Search with Convolution, Transformer, and MLP cs.CV · 2022 · author #3
  30. UniFormer: Unifying Convolution and Self-attention for Visual Recognition cs.CV · 2022 · author #5
  31. UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning cs.CV · 2022 · author #4
  32. Self-slimmed Vision Transformer cs.CV · 2021 · author #3
  33. INTERN: A New Learning Paradigm Towards General Vision cs.CV · 2021 · author #12
  34. UniNet: Unified Architecture Search with Convolution, Transformer, and MLP cs.CV · 2021 · author #3
  35. FNAS: Uncertainty-Aware Fast Neural Architecture Search cs.LG · 2021 · author #5
  36. Discriminability Distillation in Group Representation Learning cs.CV · 2020 · author #2
  37. 1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020 cs.CV · 2020 · author #3
  38. 1st Place Solutions for OpenImage2019 -- Object Detection and Instance Segmentation cs.CV · 2020 · author #2
  39. KPNet: Towards Minimal Face Detector cs.CV · 2020 · author #1
  40. Revisiting the Sibling Head in Object Detector cs.CV · 2020 · author #1
  41. Top-1 Solution of Multi-Moments in Time Challenge 2019 cs.CV · 2020 · author #3
  42. Towards Flops-constrained Face Recognition cs.CV · 2019 · author #2
  43. Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy cs.CV · 2018 · author #1
  44. Region-based Quality Estimation Network for Large-scale Person Re-identification cs.CV · 2017 · author #1

Mentions

  • 2412.11279 #4 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2412.09618 #4 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2412.06295 #5 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2406.11831 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2405.18407 #9 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2409.12959 #10 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2404.03653 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2403.16999 #4 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2404.13046 #4 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2201.09450 #5 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2402.00769 #6 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2410.01529 #6 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2405.00760 #6 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2404.05384 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2403.13745 #6 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2403.12963 #4 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2305.18295 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2310.16364 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2310.15955 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2211.12860 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2305.18264 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2304.00967 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2211.11953 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2210.11078 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2208.13600 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2208.08630 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2207.05420 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2111.12624 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2208.04352 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2111.08687 #12 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2201.04676 #4 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2110.04035 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2105.11694 #5 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2008.10850 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2006.09116 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2003.07557 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2003.07543 #1 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2003.07540 #1 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2003.05837 #3 · arxiv_oai · confidence 0.70 Guanglu Song
  • 1909.00632 #2 · arxiv_oai · confidence 0.70 Guanglu Song
  • 1804.05197 #1 · arxiv_oai · confidence 0.70 Guanglu Song
  • 1711.08766 #1 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2604.03118 #7 · arxiv_oai · confidence 0.70 Guanglu Song
  • 2603.17461 #6 · arxiv_oai · confidence 0.70 Guanglu Song

Frequent Coauthors