Explicit Context-Driven Neural Acoustic Modeling for High-Fidelity RIR Generation

Chaitanya Amballa; Chen Si; Qianyi Wu; Romit Roy Choudhury

arxiv: 2509.15210 · v2 · pith:TVTICSNRnew · submitted 2025-09-18 · 💻 cs.SD · cs.AI· cs.LG

Explicit Context-Driven Neural Acoustic Modeling for High-Fidelity RIR Generation

Chen Si , Qianyi Wu , Chaitanya Amballa , Romit Roy Choudhury This is my paper

classification 💻 cs.SD cs.AIcs.LG

keywords explicitgeometricneuralsoundcontextenvironmentfeaturesgiven

0 comments

read the original abstract

Realistic sound simulation plays a critical role in many applications. A key element in sound simulation is the room impulse response (RIR), which characterizes how sound propagates within a given space. Recent studies have applied neural implicit methods to learn RIR using context information collected from the environment, such as scene images. However, these approaches do not effectively leverage explicit geometric information from the environment. To further exploit neural implicit models with direct geometric features, we present MiNAF, which queries a rough room mesh at given locations and extracts distance distributions as an explicit representation of local context. Our approach demonstrates that incorporating explicit local geometric features can better guide the model in generating more accurate RIR predictions. Through comparisons with conventional and state-of-the-art methods, we show that MiNAF performs competitively across various evaluation metrics.

This paper has not been read by Pith yet.

Explicit Context-Driven Neural Acoustic Modeling for High-Fidelity RIR Generation

discussion (0)