pith. machine review for the scientific record. sign in

arxiv: 1403.2345 · v1 · submitted 2014-03-07 · 💻 cs.SI · cs.CL· cs.CY

Recognition: unknown

Home Location Identification of Twitter Users

Authors on Pith no claims yet
classification 💻 cs.SI cs.CLcs.CY
keywords userslocationtwitteralgorithmgeographichometimeaccuracy
0
0 comments X
read the original abstract

We present a new algorithm for inferring the home location of Twitter users at different granularities, including city, state, time zone or geographic region, using the content of users tweets and their tweeting behavior. Unlike existing approaches, our algorithm uses an ensemble of statistical and heuristic classifiers to predict locations and makes use of a geographic gazetteer dictionary to identify place-name entities. We find that a hierarchical classification approach, where time zone, state or geographic region is predicted first and city is predicted next, can improve prediction accuracy. We have also analyzed movement variations of Twitter users, built a classifier to predict whether a user was travelling in a certain period of time and use that to further improve the location detection accuracy. Experimental evidence suggests that our algorithm works well in practice and outperforms the best existing algorithms for predicting the home location of Twitter users.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Reddit's Globalization over Twenty Years: Inferring Community Time Zone from Activity Timestamps

    cs.SI 2026-05 unverdicted novelty 6.0

    A 4 a.m. activity minimum heuristic infers community time zones from timestamps with sub-hour accuracy on Reddit, enabling scalable analysis of the platform's globalization without user location data.