On the Assessment of Information Quality in Spanish Wikipedia

Featured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinar...

Descripción completa

Detalles Bibliográficos
Autores principales: Urquiza, Guido, Soria, Matías, Pérez Casseignau, Sebastián, Ferretti, Edgardo, Gómez, Sergio Alejandro, Errecalde, Marcelo Luis
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2016
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/56750
Aporte de:
id I19-R120-10915-56750
record_format dspace
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
Featured Articles (FA)
Wikipedia
Quality Flaws Prediction
spellingShingle Ciencias Informáticas
Featured Articles (FA)
Wikipedia
Quality Flaws Prediction
Urquiza, Guido
Soria, Matías
Pérez Casseignau, Sebastián
Ferretti, Edgardo
Gómez, Sergio Alejandro
Errecalde, Marcelo Luis
On the Assessment of Information Quality in Spanish Wikipedia
topic_facet Ciencias Informáticas
Featured Articles (FA)
Wikipedia
Quality Flaws Prediction
description Featured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinary articles in order to improve their quality is a recent key research trend. Most of the approaches developed in these research trends have been proposed for the English Wikipedia. However, few efforts have been accomplished in Spanish Wikipedia, despite being Spanish, one of the most spoken languages in the world by native speakers. In this respect, we present a first breakdown of Spanish Wikipedia’s quality flaw structure. Besides, we carry out a study to automatically assess information quality in Spanish Wikipedia, where FA identification is evaluated as a binary classification task. The results obtained show that FA identification can be performed with an F1 score of 0.81, using a document model consisting of only twenty six features and AdaBoosted C4.5 decision trees as classification algorithm.
format Objeto de conferencia
Objeto de conferencia
author Urquiza, Guido
Soria, Matías
Pérez Casseignau, Sebastián
Ferretti, Edgardo
Gómez, Sergio Alejandro
Errecalde, Marcelo Luis
author_facet Urquiza, Guido
Soria, Matías
Pérez Casseignau, Sebastián
Ferretti, Edgardo
Gómez, Sergio Alejandro
Errecalde, Marcelo Luis
author_sort Urquiza, Guido
title On the Assessment of Information Quality in Spanish Wikipedia
title_short On the Assessment of Information Quality in Spanish Wikipedia
title_full On the Assessment of Information Quality in Spanish Wikipedia
title_fullStr On the Assessment of Information Quality in Spanish Wikipedia
title_full_unstemmed On the Assessment of Information Quality in Spanish Wikipedia
title_sort on the assessment of information quality in spanish wikipedia
publishDate 2016
url http://sedici.unlp.edu.ar/handle/10915/56750
work_keys_str_mv AT urquizaguido ontheassessmentofinformationqualityinspanishwikipedia
AT soriamatias ontheassessmentofinformationqualityinspanishwikipedia
AT perezcasseignausebastian ontheassessmentofinformationqualityinspanishwikipedia
AT ferrettiedgardo ontheassessmentofinformationqualityinspanishwikipedia
AT gomezsergioalejandro ontheassessmentofinformationqualityinspanishwikipedia
AT errecaldemarceloluis ontheassessmentofinformationqualityinspanishwikipedia
bdutipo_str Repositorios
_version_ 1764820477558128642