Inference of Socioeconomic Status in a Communication Graph
In this work, we examine the socio-economic correlations present among users in a mobile phone network in Mexico. First, we find that the distribution of income for a subset of users –for which we have income information given by a large bank in Mexico– follows closely, but not exactly, the income d...
Guardado en:
Autores principales: | , , , , |
---|---|
Formato: | Objeto de conferencia |
Lenguaje: | Inglés |
Publicado: |
2016
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/56824 http://45jaiio.sadio.org.ar/sites/default/files/AGRANDA-09.pdf |
Aporte de: |
id |
I19-R120-10915-56824 |
---|---|
record_format |
dspace |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas mobile phone network socio-economic correlations |
spellingShingle |
Ciencias Informáticas mobile phone network socio-economic correlations Fixman, Martín Berenstein, Ariel Brea, Jorge Minnoni, Martín Sarraute, Carlos Inference of Socioeconomic Status in a Communication Graph |
topic_facet |
Ciencias Informáticas mobile phone network socio-economic correlations |
description |
In this work, we examine the socio-economic correlations present among users in a mobile phone network in Mexico. First, we find that the distribution of income for a subset of users –for which we have income information given by a large bank in Mexico– follows closely, but not exactly, the income distribution for the whole population of Mexico.
We also show the existence of a strong socio-economic homophily in the mobile phone network, where users linked in the network are more likely to have similar income. The main contribution of this work is that we leverage this homophily in order to propose a methodology, based on Bayesian statistics, to infer the socio-economic status for a large subset of users in the network (for which we have no banking information). With our proposed algorithm, we achieve an accuracy of 0.71 in a two-class classification problem (low and high income) which significantly outperforms a simpler method based on a frequentist approach. Finally, we extend the two-class classification problem to multiple classes by using the Dirichlet distribution. |
format |
Objeto de conferencia Objeto de conferencia |
author |
Fixman, Martín Berenstein, Ariel Brea, Jorge Minnoni, Martín Sarraute, Carlos |
author_facet |
Fixman, Martín Berenstein, Ariel Brea, Jorge Minnoni, Martín Sarraute, Carlos |
author_sort |
Fixman, Martín |
title |
Inference of Socioeconomic Status in a Communication Graph |
title_short |
Inference of Socioeconomic Status in a Communication Graph |
title_full |
Inference of Socioeconomic Status in a Communication Graph |
title_fullStr |
Inference of Socioeconomic Status in a Communication Graph |
title_full_unstemmed |
Inference of Socioeconomic Status in a Communication Graph |
title_sort |
inference of socioeconomic status in a communication graph |
publishDate |
2016 |
url |
http://sedici.unlp.edu.ar/handle/10915/56824 http://45jaiio.sadio.org.ar/sites/default/files/AGRANDA-09.pdf |
work_keys_str_mv |
AT fixmanmartin inferenceofsocioeconomicstatusinacommunicationgraph AT berensteinariel inferenceofsocioeconomicstatusinacommunicationgraph AT breajorge inferenceofsocioeconomicstatusinacommunicationgraph AT minnonimartin inferenceofsocioeconomicstatusinacommunicationgraph AT sarrautecarlos inferenceofsocioeconomicstatusinacommunicationgraph |
bdutipo_str |
Repositorios |
_version_ |
1764820477652500480 |