A DIRT-T Approach to Unsupervised Domain Adaptation

Rui Shu , Hung H. Bui , Hirokazu Narui , Stefano Ermon

Authors on Pith no claims yet

classification 📊 stat.ML cs.CVcs.LG

keywords domainadaptationfeaturemodeltrainingadversarialsourcetarget

read the original abstract

Domain adaptation refers to the problem of leveraging labeled data in a source domain to learn an accurate model in a target domain where labels are scarce or unavailable. A recent approach for finding a common representation of the two domains is via domain adversarial training (Ganin & Lempitsky, 2015), which attempts to induce a feature extractor that matches the source and target feature distributions in some feature space. However, domain adversarial training faces two critical limitations: 1) if the feature extraction function has high-capacity, then feature distribution matching is a weak constraint, 2) in non-conservative domain adaptation (where no single classifier can perform well in both the source and target domains), training the model to do well on the source domain hurts performance on the target domain. In this paper, we address these issues through the lens of the cluster assumption, i.e., decision boundaries should not cross high-density data regions. We propose two novel and related models: 1) the Virtual Adversarial Domain Adaptation (VADA) model, which combines domain adversarial training with a penalty term that punishes the violation the cluster assumption; 2) the Decision-boundary Iterative Refinement Training with a Teacher (DIRT-T) model, which takes the VADA model as initialization and employs natural gradient steps to further minimize the cluster assumption violation. Extensive empirical results demonstrate that the combination of these two models significantly improve the state-of-the-art performance on the digit, traffic sign, and Wi-Fi recognition domain adaptation benchmarks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Generalizable Action Representations via Pre-training AEMG
cs.LG 2026-05 unverdicted novelty 7.0

AEMG pre-trains EMG representations by treating neuromuscular signals as language via a novel tokenizer and cross-device vocabulary, yielding 5.79-9.25% zero-shot LOSO gains and over 90% few-shot performance with 5% t...
A General Representation-Based Approach to Multi-Source Domain Adaptation
cs.LG 2026-04 unverdicted novelty 7.0

A representation learning approach for multi-source domain adaptation achieves identifiability by partitioning the label's Markov blanket into parents, children, and spouses.
STDA-Net: Spectrogram-Based Domain Adaptation for cross-dataset Sleep Stage Classification
cs.LG 2026-05 unverdicted novelty 6.0

STDA-Net achieves 89.03% average accuracy and 87.64% macro F1 in cross-dataset sleep staging by processing 2D spectrograms with temporal modeling and unsupervised adversarial alignment, outperforming 1D baselines with...
On the Blessing of Pre-training in Weak-to-Strong Generalization
cs.LG 2026-05 unverdicted novelty 6.0

Pre-training provides a geometric warm start in a single-index model that enables weak-to-strong generalization up to a supervisor-limited bound, with empirical phase-transition evidence in LLMs.