pith. sign in

Animalbooth: multimodal feature enhancement for animal subject personalization

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Personalized animal image generation is challenging due to rich appearance cues and large morphological variability. Existing approaches often exhibit feature misalignment across domains, which leads to identity drift. We present AnimalBooth, a framework that strengthens identity preservation with an Animal Net and an adaptive attention module, mitigating cross domain alignment errors. We further introduce a frequency controlled feature integration module that applies Discrete Cosine Transform filtering in the latent space to guide the diffusion process, enabling a coarse to fine progression from global structure to detailed texture. To advance research in this area, we curate AnimalBench, a high resolution dataset for animal personalization. Extensive experiments show that AnimalBooth consistently outperforms strong baselines on multiple benchmarks and improves both identity fidelity and perceptual quality.

fields

cs.CV 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

Animalbooth: multimodal feature enhancement for animal subject personalization

cs.CV · 2025-09-20 · unverdicted · novelty 5.0

AnimalBooth introduces an Animal Net, adaptive attention module, and frequency-controlled DCT feature integration to improve identity preservation and perceptual quality in personalized animal image generation, supported by a new high-resolution dataset AnimalBench.

citing papers explorer

Showing 1 of 1 citing paper.

  • Animalbooth: multimodal feature enhancement for animal subject personalization cs.CV · 2025-09-20 · unverdicted · none · ref 2 · internal anchor

    AnimalBooth introduces an Animal Net, adaptive attention module, and frequency-controlled DCT feature integration to improve identity preservation and perceptual quality in personalized animal image generation, supported by a new high-resolution dataset AnimalBench.