Commit: Coordinated instruction tuning for multimodal large language models.arXiv preprint arXiv:2407.20454, 2024

Junda Wu, Xintong Li, Tong Yu, Yu Wang, Xiang Chen, Jiuxiang Gu, Lina Yao, Jingbo Shang, Julian McAuley · 2024 · arXiv 2407.20454

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Pareto LoRA: Mitigating Modality Imbalance in Unified Multimodal Models via Pareto-Optimal Gradient Integration

cs.CV · 2026-06-15 · unverdicted · novelty 6.0

Pareto LoRA applies Pareto-optimal gradient integration to balance text and image objectives in LoRA-based fine-tuning of unified multimodal models, reporting up to 44.9% gains in image quality on the CoMM benchmark with Emu2 while preserving text performance.

citing papers explorer

Showing 1 of 1 citing paper.

Pareto LoRA: Mitigating Modality Imbalance in Unified Multimodal Models via Pareto-Optimal Gradient Integration cs.CV · 2026-06-15 · unverdicted · none · ref 40
Pareto LoRA applies Pareto-optimal gradient integration to balance text and image objectives in LoRA-based fine-tuning of unified multimodal models, reporting up to 44.9% gains in image quality on the CoMM benchmark with Emu2 while preserving text performance.

Commit: Coordinated instruction tuning for multimodal large language models.arXiv preprint arXiv:2407.20454, 2024

fields

years

verdicts

representative citing papers

citing papers explorer