pith. sign in

arxiv: 1904.08926 · v1 · pith:3VRRKVX7new · submitted 2019-04-15 · 💻 cs.SI · cs.CL

Characterization of citizens using word2vec and latent topic analysis in a large set of tweets

classification 💻 cs.SI cs.CL
keywords citizenstweetsanalysiscitycommunitiesideaslearningmachine
0
0 comments X
read the original abstract

With the increasing use of the Internet and mobile devices, social networks are becoming the most used media to communicate citizens' ideas and thoughts. This information is very useful to identify communities with common ideas based on what they publish in the network. This paper presents a method to automatically detect city communities based on machine learning techniques applied to a set of tweets from Bogot\'a's citizens. An analysis was performed in a collection of 2,634,176 tweets gathered from Twitter in a period of six months. Results show that the proposed method is an interesting tool to characterize a city population based on a machine learning methods and text analytics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Using machine learning to build public policy agenda from social media conversations

    cs.CY 2026-05 unverdicted novelty 3.0

    A pipeline combining LDA, Top2Vec, GPT-2, similarity analysis, and human evaluation extracts policy agendas from social media with reported good inter-rater agreement and cosine similarity scores.