Audio Content based Geotagging in Multimedia

Anurag Kumar; Benjamin Elizalde; Bhiksha Raj

arxiv: 1606.02816 · v2 · pith:IDVMCYGNnew · submitted 2016-06-09 · 💻 cs.SD · cs.MM

Audio Content based Geotagging in Multimedia

Anurag Kumar , Benjamin Elizalde , Bhiksha Raj This is my paper

classification 💻 cs.SD cs.MM

keywords audioinformationrecordingclassescontentlocationmultimediasemantic

0 comments

read the original abstract

In this paper we propose methods to extract geographically relevant information in a multimedia recording using its audio. Our method primarily is based on the fact that urban acoustic environment consists of a variety of sounds. Hence, location information can be inferred from the composition of sound events/classes present in the audio. More specifically, we adopt matrix factorization techniques to obtain semantic content of recording in terms of different sound classes. These semantic information are then combined to identify the location of recording.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

GeoGNN: Time Series Geo-Localization using Two-Tower Graph Neural Networks
cs.LG 2026-06 unverdicted novelty 6.0

GeoGNN is a two-tower GNN that learns geographic cell embeddings from adjacency graphs and matches them to temporal representations via dot-product similarity plus classification, improving geolocalization accuracy by...