pith. sign in

arxiv: 2606.10617 · v1 · pith:3CFSYBS3new · submitted 2026-06-09 · 💻 cs.CV

SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models

Pith reviewed 2026-06-27 13:34 UTC · model grok-4.3

classification 💻 cs.CV
keywords LoRA mergingdiffusion modelssubspace signal routingtraining-free mergingparameter interferenceOLS optimalitystreaming algorithmgenerative models
0
0 comments X

The pith

SSR merges LoRAs by routing decorrelated signals through a unified subspace instead of combining parameters directly.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Subspace Signal Routing to combine multiple trained LoRAs for diffusion models without retraining or task data. It builds a single subspace by joining the LoRAs along the rank dimension, applies an inverse correlation matrix to separate their mixed signals, and uses a directional guide matrix to steer each signal back to its original task subspace. This construction is shown to match the solution obtained from ordinary least squares, which the authors treat as mathematical optimality. A streaming version exploits the additivity of sufficient statistics to support efficient on-the-fly updates. Experiments indicate the method outperforms prior merging techniques at comparable computational cost.

Core claim

SSR first constructs a unified subspace by concatenating candidate LoRAs along the rank dimension. Next, SSR employs an inverse correlation matrix to decorrelate mixed signals within this space. Finally, a directional guide matrix steers these purified signals into their respective task-specific subspaces. The approach aligns with the Ordinary Least Squares solution, ensuring mathematical optimality, and supports a streaming algorithm via the additivity of sufficient statistics.

What carries the argument

Subspace Signal Routing via rank-wise subspace concatenation, followed by an inverse correlation matrix for decorrelation and a directional guide matrix for task-specific routing.

If this is right

  • Merging works without task-specific data or additional training for any collection of LoRAs.
  • Parameter interference is removed through explicit decorrelation rather than heuristic weighting.
  • On-the-fly updates become possible with reduced memory via the streaming algorithm.
  • Generation quality exceeds that of earlier parameter-space merging methods at similar speed.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same subspace construction could be tested on LoRA-style adapters outside diffusion models.
  • The OLS equivalence may suggest analogous routing methods for other forms of model composition.
  • Dynamic selection among many LoRAs at inference time could reuse the same decorrelation step.
  • Hardware accelerators might exploit the streaming property for real-time multi-task generation.

Load-bearing premise

The rank-wise concatenation into one unified subspace together with the inverse correlation matrix and directional guide matrix fully resolves parameter interference for arbitrary LoRAs without any task data or retraining.

What would settle it

Merge several LoRAs trained on mutually incompatible tasks and check whether the merged model produces outputs whose quality equals or exceeds that of the best individual LoRA under the same prompts.

Figures

Figures reproduced from arXiv: 2606.10617 by Hong Gu, Qi Fan, Shaofeng Zhang, Wenbin Li, Xianhui Lin, Xing Liu, Yi Dong, Zhengxuan Wei, Zonghui Li.

Figure 1
Figure 1. Figure 1: Visualization of Task-LoRA Activation Alignment. We display the max-normed activation intensity between task in￾structions (T) and LoRA modules (L). Left: The static baseline exhibits severe crosstalk, where instructions spuriously activate unrelated modules, indicating high interference. Right: Our SSR￾Merge achieves precise signal routing, showing a clean diagonal structure where each task exclusively ac… view at source ↗
Figure 2
Figure 2. Figure 2: Overview of Subspace Signal Routing (SSR). The framework expands individual LoRA bottlenecks into a unified subspace via Acomb. Within this space, the Router R resolves pa￾rameter interference through a two-stage mechanism: (1) Decorre￾lation (G−1 ), which acts as a decorrelation operator to disentangle mixed intermediate features; and (2) Steering (Q), which precisely guides the purified signals towards t… view at source ↗
Figure 3
Figure 3. Figure 3: Visual comparison of single-task generation results under increasing merging scales with K ranging from 3 to 9. The top row displays ground truth reference images and text prompts, where [V] represents the learned unique identifier token for each specific subject. Subsequent rows present the generated outputs from Linear Average, Task Arithmetic, TIES-Merging, DARE, and SSR. For each column, the unified mo… view at source ↗
Figure 4
Figure 4. Figure 4 [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: Overview of the datasets used in our experiments. The top panel displays the 10 custom subjects curated for the single-task (RQ1) and multi-task (RQ2) benchmarks. The bottom panel illustrates the three specific facial editing instructions designed for the RQ3 benchmark on the FFHQ dataset. B. Dataset and Baseline Details B.1. Dataset Construction Single-Task & Multi-Task Benchmark (RQ1 & RQ2). We curate a … view at source ↗
Figure 7
Figure 7. Figure 7: Visual analysis of RegMean stability. Since the RegMean covariance matrix is singular (d ≫ N), we introduce regularization λI to enable inversion. However, the results reveal a severe trade-off due to the ill-conditioned nature of the matrix: small λ fails to suppress numerical explosion (noise), while large λ dominates the signal, erasing task-specific features (identity loss). The Regularization Dilemma.… view at source ↗
Figure 8
Figure 8. Figure 8: Visual comparison of single-task preservation on Qwen-Image . We illustrate the generation quality of the target subject mixed with increasing numbers of distractor LoRAs (K ∈ {3, 5, 7, 9}). While baseline methods (e.g., Task Arithmetic, DARE) exhibit severe semantic collapse and identity loss—often generating unrecognizable noise or generic concepts under high interference (N = 9)—SSR consistently preserv… view at source ↗
Figure 9
Figure 9. Figure 9: Impact of calibration steps on performance and cost (K = 9). The red solid line indicates the generation fidelity (DINO Similarity), while the blue dashed line represents the calibration time cost. We observe that model performance saturates immediately at T = 1, whereas the computational overhead increases linearly. This justifies our choice of one-shot calibration as the optimal efficiency-performance tr… view at source ↗
read the original abstract

Low-Rank Adaptation (LoRA) merging can efficiently combine diverse generative capabilities from multiple trained LoRAs for a diffusion model. However, existing LoRA merging techniques often suffer from severe parameter interference, causing destructive collisions in the shared parameter space. To address this, we propose Subspace Signal Routing (SSR), which resolves interference by routing internal signals instead of performing parameter-space merge. Specifically, SSR first constructs a unified subspace by concatenating candidate LoRAs along the rank dimension. Next, SSR employs an inverse correlation matrix to decorrelate mixed signals within this space. Finally, a directional guide matrix steers these purified signals into their respective task-specific subspaces. We provide a rigorous theoretical analysis proving that SSR aligns with the Ordinary Least Squares (OLS) solution, thereby ensuring mathematical optimality. We utilize the additivity of sufficient statistics to design a streaming algorithm. This enables on-the-fly updates that significantly reduce memory overhead and computation time. Extensive experiments validate that SSR significantly outperforms state-of-the-art methods while maintaining comparable efficiency. Code is available at https://github.com/nagara214/SSR-Merge.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes Subspace Signal Routing (SSR) for training-free merging of LoRAs in diffusion models to mitigate parameter interference. SSR constructs a unified subspace via rank-wise concatenation of candidate LoRAs, applies an inverse correlation matrix for decorrelation, and uses a directional guide matrix to route signals to task-specific subspaces. It claims a rigorous proof that this procedure aligns with the OLS estimator for mathematical optimality, introduces a streaming algorithm exploiting additivity of sufficient statistics for efficiency, and reports superior empirical performance over prior merging methods.

Significance. If the claimed OLS equivalence is rigorously established without circularity or hidden assumptions, the work would supply a parameter-free optimality guarantee for data-free LoRA merging, a practical streaming implementation, and reproducible code. This could meaningfully advance multi-task adaptation of diffusion models by replacing heuristic merging with a provably optimal routing mechanism.

major comments (2)
  1. [Abstract, §3] Abstract and §3 (theoretical analysis): the central claim that SSR 'aligns with the Ordinary Least Squares (OLS) solution' is load-bearing for the optimality guarantee, yet the provided text supplies no derivation showing that the composite operation (rank-wise concatenation + inverse correlation matrix + directional guide matrix) reproduces the normal equations of any stated merging objective. Without the explicit mapping from the subspace construction to the OLS estimator (including the precise definition of the correlation matrix and the interference model), the equivalence cannot be verified and may reduce to a tautology or hold only under unstated orthogonality assumptions.
  2. [§4] §4 (streaming algorithm) and experiments: the additivity of sufficient statistics is invoked to justify on-the-fly updates, but no explicit statement is given of the underlying statistical model or the precise sufficient statistics being accumulated; this leaves open whether the streaming procedure preserves the claimed OLS optimality when LoRAs are added incrementally.
minor comments (2)
  1. [§3] Notation for the correlation matrix and guide matrix should be introduced with explicit dimensions and a small worked example to clarify how they act on the concatenated subspace.
  2. [§5] The experimental section should report the exact number of LoRAs merged per task, the rank values used, and any data-exclusion criteria applied when computing the correlation matrix.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback on the theoretical claims and streaming implementation. We address each major comment below and will incorporate the requested clarifications and derivations into the revised manuscript.

read point-by-point responses
  1. Referee: [Abstract, §3] Abstract and §3 (theoretical analysis): the central claim that SSR 'aligns with the Ordinary Least Squares (OLS) solution' is load-bearing for the optimality guarantee, yet the provided text supplies no derivation showing that the composite operation (rank-wise concatenation + inverse correlation matrix + directional guide matrix) reproduces the normal equations of any stated merging objective. Without the explicit mapping from the subspace construction to the OLS estimator (including the precise definition of the correlation matrix and the interference model), the equivalence cannot be verified and may reduce to a tautology or hold only under unstated orthogonality assumptions.

    Authors: We agree that an explicit derivation is necessary for verifiability. In the revised manuscript we will expand §3 with a complete proof that maps the SSR operations to the OLS normal equations: rank-wise concatenation constructs the design matrix X whose columns are the LoRA basis vectors; the inverse correlation matrix is defined as (X^T X)^{-1} where the correlation matrix is the Gram matrix of these vectors; and the directional guide matrix implements the projection that solves the multi-task least-squares objective under the additive interference model (parameter collisions as summed updates). This derivation will be presented without hidden orthogonality assumptions beyond those stated in the merging objective. revision: yes

  2. Referee: [§4] §4 (streaming algorithm) and experiments: the additivity of sufficient statistics is invoked to justify on-the-fly updates, but no explicit statement is given of the underlying statistical model or the precise sufficient statistics being accumulated; this leaves open whether the streaming procedure preserves the claimed OLS optimality when LoRAs are added incrementally.

    Authors: We will revise §4 to state the statistical model explicitly as multivariate linear regression with the merged weight as the response and the concatenated LoRA directions as predictors. The sufficient statistics are the accumulated Gram matrix X^T X and the cross-term X^T y. Because these statistics are additive, appending a new LoRA updates them by simple matrix addition, and the resulting OLS solution remains identical to the batch solution. A short proof of invariance under incremental addition will be added. revision: yes

Circularity Check

0 steps flagged

No circularity: OLS alignment presented as independent theoretical result

full rationale

The abstract describes SSR via explicit construction (rank-wise concatenation into unified subspace, inverse correlation matrix for decorrelation, directional guide matrix) and separately claims a rigorous theoretical analysis proving alignment with the OLS solution. No equations, self-citations, or reductions are shown that would make the alignment tautological or force the prediction by definition of the inputs. The method is not defined in terms of OLS; instead, the paper asserts an external equivalence via analysis. No load-bearing self-citation or fitted-input-as-prediction pattern appears. The derivation chain is therefore self-contained against the given text.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The method rests on standard linear algebra assumptions plus the novel construction of the subspace and matrices; no explicit free parameters or new physical entities are introduced in the abstract.

axioms (1)
  • domain assumption Ordinary Least Squares yields the optimal linear estimator for the signal routing task
    Invoked to establish mathematical optimality of SSR.
invented entities (1)
  • Subspace Signal Routing (SSR) mechanism no independent evidence
    purpose: To merge LoRAs by routing rather than parameter addition
    Core proposed technique; no independent evidence supplied beyond the method itself.

pith-pipeline@v0.9.1-grok · 5748 in / 1289 out tokens · 20185 ms · 2026-06-27T13:34:57.183696+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

57 extracted references · 3 linked inside Pith

  1. [1]

    K., Hayase, J., and Srinivasa, S

    Ainsworth, S. K., Hayase, J., and Srinivasa, S. Git re-basin: Merging models modulo permutation symmetries. In ICLR, 2023

  2. [2]

    J., Lu, Z., Wu, E., and Hu, J

    Chen, D., Tan, V. J., Lu, Z., Wu, E., and Hu, J. Openfed: A comprehensive and versatile open-source federated learning framework. In CVPR Workshops, 2023

  3. [3]

    Iteris: Iterative inference-solving alignment for lora merging

    Chen, H., Li, R., Zhu, B., Wang, Z., and Chen, L. Iteris: Iterative inference-solving alignment for lora merging. In CVPR, 2025 a

  4. [4]

    Se-merging: A self-enhanced approach for dynamic model merging

    Chen, Z., Zhou, Z., Zhang, B., Zhang, W., Sun, X., and Yan, J. Se-merging: A self-enhanced approach for dynamic model merging. arXiv preprint arXiv:2506.18135, 2025 b

  5. [5]

    and Belilovsky, E

    Davari, M. and Belilovsky, E. Model breadcrumbs: Scaling multi-task model merging with sparse masks. In ECCV, 2024

  6. [6]

    Arcface: Additive angular margin loss for deep face recognition

    Deng, J., Guo, J., Xue, N., and Zafeiriou, S. Arcface: Additive angular margin loss for deep face recognition. In CVPR, 2019

  7. [7]

    Implicit style-content separation using b-lora

    Frenkel, Y., Vinker, Y., Shamir, A., and Cohen-Or, D. Implicit style-content separation using b-lora. In ECCV, 2024

  8. [8]

    A., Crisostomi, D., Bucarelli, M

    Gargiulo, A. A., Crisostomi, D., Bucarelli, M. S., Scardapane, S., Silvestri, F., and Rodola, E. Task singular vectors: Reducing task interference in model merging. In CVPR, 2025

  9. [9]

    Z., Shi, Y., Chen, Y., Fan, Z., Xiao, W., Zhao, R., Chang, S., Wu, W., et al

    Gu, Y., Wang, X., Wu, J. Z., Shi, Y., Chen, Y., Fan, Z., Xiao, W., Zhao, R., Chang, S., Wu, W., et al. Mix-of-show: Decentralized low-rank adaptation for multi-concept customization of diffusion models. In NeurIPS, 2023

  10. [10]

    Denoising diffusion probabilistic models

    Ho, J., Jain, A., and Abbeel, P. Denoising diffusion probabilistic models. In NeurIPS, 2020

  11. [11]

    Parameter-efficient transfer learning for nlp

    Houlsby, N., Giurgiu, A., Jastrzebski, S., Morrone, B., De Laroussilhe, Q., Gesmundo, A., Attariyan, M., and Gelly, S. Parameter-efficient transfer learning for nlp. In ICML, 2019

  12. [12]

    J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al

    Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al. Lora: Low-rank adaptation of large language models. In ICLR, 2022

  13. [13]

    Y., Pang, T., Du, C., and Lin, M

    Huang, C., Liu, Q., Lin, B. Y., Pang, T., Du, C., and Lin, M. Lorahub: Efficient cross-task generalization via dynamic lora composition. In COLM, 2023

  14. [14]

    Emr-merging: Tuning-free high-performance model merging

    Huang, C., Ye, P., Chen, T., He, T., Yue, X., and Ouyang, W. Emr-merging: Tuning-free high-performance model merging. In NeurIPS, 2024

  15. [15]

    T., Wortsman, M., Gururangan, S., Schmidt, L., Hajishirzi, H., and Farhadi, A

    Ilharco, G., Ribeiro, M. T., Wortsman, M., Gururangan, S., Schmidt, L., Hajishirzi, H., and Farhadi, A. Editing models with task arithmetic. In ICLR, 2023

  16. [16]

    Dataless knowledge fusion by merging weights of language models

    Jin, X., Ren, X., Preotiuc-Pietro, D., and Cheng, P. Dataless knowledge fusion by merging weights of language models. In ICLR, 2023

  17. [17]

    A style-based generator architecture for generative adversarial networks

    Karras, T., Laine, S., and Aila, T. A style-based generator architecture for generative adversarial networks. In CVPR, 2019

  18. [18]

    Labs, B. F. Flux. https://github.com/black-forest-labs/flux, 2024

  19. [19]

    The power of scale for parameter-efficient prompt tuning

    Lester, B., Al-Rfou, R., and Constant, N. The power of scale for parameter-efficient prompt tuning. In EMNLP, 2021

  20. [20]

    Li, X. L. and Liang, P. Prefix-tuning: Optimizing continuous prompts for generation. In ACL, 2021

  21. [21]

    Grounding dino: Marrying dino with grounded pre-training for open-set object detection

    Liu, S., Zeng, Z., Ren, T., Li, F., Zhang, H., Yang, J., Jiang, Q., Li, C., Yang, J., Su, H., et al. Grounding dino: Marrying dino with grounded pre-training for open-set object detection. In ECCV, 2024

  22. [22]

    Twin-merging: Dynamic integration of modular expertise in model merging

    Lu, Z., Fan, C., Wei, W., Qu, X., Chen, D., and Cheng, Y. Twin-merging: Dynamic integration of modular expertise in model merging. In NeurIPS, 2024

  23. [23]

    Unipelt: A unified framework for parameter-efficient language model tuning

    Mao, Y., Mathias, L., Hou, R., Almahairi, A., Ma, H., Han, J., Yih, S., and Khabsa, M. Unipelt: A unified framework for parameter-efficient language model tuning. In ACL, 2022

  24. [24]

    D., and van de Weijer, J

    Marczak, D., Magistri, S., Cygert, S., Twardowski, B., Bagdanov, A. D., and van de Weijer, J. No task left behind: Isotropic model merging with common and task-specific subspaces. In ICML, 2025

  25. [25]

    Matena, M. S. and Raffel, C. A. Merging models with fisher-weighted averaging. In NeurIPS, 2022

  26. [26]

    Dinov2: Learning robust visual features without supervision

    Oquab, M., Darcet, T., Moutakanni, T., Vo, H., Szafraniec, M., Khalidov, V., Fernandez, P., Haziza, D., Massa, F., El-Nouby, A., et al. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193, 2023

  27. [27]

    K-lora: Unlocking training-free fusion of any subject and style loras

    Ouyang, Z., Li, Z., and Hou, Q. K-lora: Unlocking training-free fusion of any subject and style loras. In CVPR, 2025

  28. [28]

    D., Calderara, S., and van de Weijer, J

    Panariello, A., Marczak, D., Magistri, S., Porrello, A., Twardowski, B., Bagdanov, A. D., Calderara, S., and van de Weijer, J. Accurate and efficient low-rank model merging in core space. In NeurIPS, 2025

  29. [29]

    W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al

    Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al. Learning transferable visual models from natural language supervision. In ICML, 2021

  30. [30]

    Hierarchical text-conditional image generation with clip latents

    Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., and Chen, M. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125, 2022

  31. [31]

    High-resolution image synthesis with latent diffusion models

    Rombach, R., Blattmann, A., Lorenz, D., Esser, P., and Ommer, B. High-resolution image synthesis with latent diffusion models. In CVPR, 2022

  32. [32]

    Duolora: Cycle-consistent and rank-disentangled content-style personalization

    Roy, A., Borse, S., Kadambi, S., Das, D., Mahajan, S., Garrepalli, R., Park, H., Nayak, A., Chellappa, R., Hayat, M., et al. Duolora: Cycle-consistent and rank-disentangled content-style personalization. In ICCV, 2025

  33. [33]

    Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation

    Ruiz, N., Li, Y., Jampani, V., Pritch, Y., Rubinstein, M., and Aberman, K. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation. In CVPR, 2023

  34. [34]

    L., Ghasemipour, K., Gontijo Lopes, R., Karagol Ayan, B., Salimans, T., et al

    Saharia, C., Chan, W., Saxena, S., Li, L., Whang, J., Denton, E. L., Ghasemipour, K., Gontijo Lopes, R., Karagol Ayan, B., Salimans, T., et al. Photorealistic text-to-image diffusion models with deep language understanding. In NeurIPS, 2022

  35. [35]

    Ziplora: Any subject in any style by effectively merging loras

    Shah, V., Ruiz, N., Cole, F., Lu, E., Lazebnik, S., Li, Y., and Jampani, V. Ziplora: Any subject in any style by effectively merging loras. In ECCV, 2024

  36. [36]

    Shenaj, D., Bohdal, O., Ozay, M., Zanuttigh, P., and Michieli, U. Lora. rar: Learning to merge loras via hypernetworks for subject-style conditioned image generation. In ICCV, 2025

  37. [37]

    P., Kumar, A., Ermon, S., and Poole, B

    Song, Y., Sohl-Dickstein, J., Kingma, D. P., Kumar, A., Ermon, S., and Poole, B. Score-based generative modeling through stochastic differential equations. In ICLR, 2021

  38. [38]

    Zipit! merging models from different tasks without training

    Stoica, G., Bolya, D., Bjorner, J., Ramesh, P., Hearn, T., and Hoffman, J. Zipit! merging models from different tasks without training. In ICLR, 2024

  39. [39]

    Model merging with svd to tie the knots

    Stoica, G., Ramesh, P., Ecsedi, B., Choshen, L., and Hoffman, J. Model merging with svd to tie the knots. In ICLR, 2025

  40. [40]

    Concrete subspace learning based interference elimination for multi-task model fusion

    Tang, A., Shen, L., Luo, Y., Ding, L., Hu, H., Du, B., and Tao, D. Concrete subspace learning based interference elimination for multi-task model fusion. arXiv preprint arXiv:2312.06173, 2023

  41. [41]

    Diffusers: State-of-the-art diffusion models

    von Platen, P., Patil, S., Lozhkov, A., Cuenca, P., Lambert, N., Rasul, K., Davaadorj, M., Nair, D., Paul, S., Berman, W., Xu, Y., Liu, S., and Wolf, T. Diffusers: State-of-the-art diffusion models. https://github.com/huggingface/diffusers, 2022

  42. [42]

    Lora-flow: Dynamic lora fusion for large language models in generative tasks

    Wang, H., Ping, B., Wang, S., Han, X., Chen, Y., Liu, Z., and Sun, M. Lora-flow: Dynamic lora fusion for large language models in generative tasks. In ACL, 2024 a

  43. [43]

    Localizing task information for improved model merging and compression

    Wang, K., Dimitriadis, N., Ortiz-Jimenez, G., Fleuret, F., and Frossard, P. Localizing task information for improved model merging and compression. In ICML, 2024 b

  44. [44]

    Modeling multi-task model merging as adaptive projective gradient descent

    Wei, Y., Tang, A., Shen, L., Hu, Z., Yuan, C., and Cao, X. Modeling multi-task model merging as adaptive projective gradient descent. In ICML, 2025

  45. [45]

    Wiggins, W. F. and Tejani, A. S. On the opportunities and risks of foundation models for natural language processing in radiology. Radiology: Artificial Intelligence, 2022

  46. [46]

    L., Gugger, S., Drame, M., Lhoest, Q., and Rush, A

    Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., Cistac, P., Rault, T., Louf, R., Funtowicz, M., Davison, J., Shleifer, S., von Platen, P., Ma, C., Jernite, Y., Plu, J., Xu, C., Scao, T. L., Gugger, S., Drame, M., Lhoest, Q., and Rush, A. M. Transformers: State-of-the-art natural language processing. In EMNLP, 2020

  47. [47]

    Y., Roelofs, R., Gontijo-Lopes, R., Morcos, A

    Wortsman, M., Ilharco, G., Gadre, S. Y., Roelofs, R., Gontijo-Lopes, R., Morcos, A. S., Namkoong, H., Farhadi, A., Carmon, Y., Kornblith, S., et al. Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time. In ICML, 2022

  48. [48]

    Qwen-image technical report, 2025

    Wu, C., Li, J., Zhou, J., Lin, J., Gao, K., Yan, K., ming Yin, S., Bai, S., Xu, X., Chen, Y., Chen, Y., Tang, Z., Zhang, Z., Wang, Z., Yang, A., Yu, B., Cheng, C., Liu, D., Li, D., Zhang, H., Meng, H., Wei, H., Ni, J., Chen, K., Cao, K., Peng, L., Qu, L., Wu, M., Wang, P., Yu, S., Wen, T., Feng, W., Xu, X., Wang, Y., Zhang, Y., Zhu, Y., Wu, Y., Cai, Y., a...

  49. [49]

    Mixture of lora experts

    Wu, X., Huang, S., and Wei, F. Mixture of lora experts. arXiv preprint arXiv:2404.13628, 2024

  50. [50]

    A., and Bansal, M

    Yadav, P., Tam, D., Choshen, L., Raffel, C. A., and Bansal, M. Ties-merging: Resolving interference when merging models. In NeurIPS, 2023

  51. [51]

    Representation surgery for multi-task model merging

    Yang, E., Shen, L., Wang, Z., Guo, G., Chen, X., Wang, X., and Tao, D. Representation surgery for multi-task model merging. In ICML, 2024 a

  52. [52]

    Adamerging: Adaptive model merging for multi-task learning

    Yang, E., Wang, Z., Shen, L., Liu, S., Guo, G., Wang, X., and Tao, D. Adamerging: Adaptive model merging for multi-task learning. In ICLR, 2024 b

  53. [53]

    Lora-composer: Leveraging low-rank adaptation for multi-concept customization in training-free diffusion models

    Yang, Y., Wang, W., Peng, L., Song, C., Chen, Y., Li, H., Yang, X., Lu, Q., Cai, D., He, X., et al. Lora-composer: Leveraging low-rank adaptation for multi-concept customization in training-free diffusion models. IEEE Transactions on Image Processing, 2025

  54. [54]

    Language models are super mario: Absorbing abilities from homologous models as a free lunch

    Yu, L., Yu, B., Yu, H., Huang, F., and Li, Y. Language models are super mario: Absorbing abilities from homologous models as a free lunch. In ICML, 2024

  55. [55]

    Robustmerge: Parameter-efficient model merging for mllms with direction robustness

    Zeng, F., Guo, H., Zhu, F., Shen, L., and Tang, H. Robustmerge: Parameter-efficient model merging for mllms with direction robustness. In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025

  56. [56]

    Adding conditional control to text-to-image diffusion models

    Zhang, L., Rao, A., and Agrawala, M. Adding conditional control to text-to-image diffusion models. In ICCV, 2023

  57. [57]

    Loraretriever: Input-aware lora retrieval and composition for mixed tasks in the wild

    Zhao, Z., Gan, L., Wang, G., Zhou, W., Yang, H., Kuang, K., and Wu, F. Loraretriever: Input-aware lora retrieval and composition for mixed tasks in the wild. arXiv preprint arXiv:2402.09997, 2024