On the Assessment of Information Quality in Spanish Wikipedia
Featured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinar...
Autores principales: | , , , , , |
---|---|
Formato: | Objeto de conferencia |
Lenguaje: | Inglés |
Publicado: |
2016
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/56750 |
Aporte de: |
id |
I19-R120-10915-56750 |
---|---|
record_format |
dspace |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas Featured Articles (FA) Wikipedia Quality Flaws Prediction |
spellingShingle |
Ciencias Informáticas Featured Articles (FA) Wikipedia Quality Flaws Prediction Urquiza, Guido Soria, Matías Pérez Casseignau, Sebastián Ferretti, Edgardo Gómez, Sergio Alejandro Errecalde, Marcelo Luis On the Assessment of Information Quality in Spanish Wikipedia |
topic_facet |
Ciencias Informáticas Featured Articles (FA) Wikipedia Quality Flaws Prediction |
description |
Featured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinary articles in order to improve their quality is a recent key research trend. Most of the approaches developed in these research trends have been proposed for the English Wikipedia. However, few efforts have been accomplished in Spanish Wikipedia, despite being Spanish, one of the most spoken languages in the world by native speakers.
In this respect, we present a first breakdown of Spanish Wikipedia’s quality flaw structure. Besides, we carry out a study to automatically assess information quality in Spanish Wikipedia, where FA identification is evaluated as a binary classification task. The results obtained show that FA identification can be performed with an F1 score of 0.81, using a document model consisting of only twenty six features and AdaBoosted C4.5 decision trees as classification algorithm. |
format |
Objeto de conferencia Objeto de conferencia |
author |
Urquiza, Guido Soria, Matías Pérez Casseignau, Sebastián Ferretti, Edgardo Gómez, Sergio Alejandro Errecalde, Marcelo Luis |
author_facet |
Urquiza, Guido Soria, Matías Pérez Casseignau, Sebastián Ferretti, Edgardo Gómez, Sergio Alejandro Errecalde, Marcelo Luis |
author_sort |
Urquiza, Guido |
title |
On the Assessment of Information Quality in Spanish Wikipedia |
title_short |
On the Assessment of Information Quality in Spanish Wikipedia |
title_full |
On the Assessment of Information Quality in Spanish Wikipedia |
title_fullStr |
On the Assessment of Information Quality in Spanish Wikipedia |
title_full_unstemmed |
On the Assessment of Information Quality in Spanish Wikipedia |
title_sort |
on the assessment of information quality in spanish wikipedia |
publishDate |
2016 |
url |
http://sedici.unlp.edu.ar/handle/10915/56750 |
work_keys_str_mv |
AT urquizaguido ontheassessmentofinformationqualityinspanishwikipedia AT soriamatias ontheassessmentofinformationqualityinspanishwikipedia AT perezcasseignausebastian ontheassessmentofinformationqualityinspanishwikipedia AT ferrettiedgardo ontheassessmentofinformationqualityinspanishwikipedia AT gomezsergioalejandro ontheassessmentofinformationqualityinspanishwikipedia AT errecaldemarceloluis ontheassessmentofinformationqualityinspanishwikipedia |
bdutipo_str |
Repositorios |
_version_ |
1764820477558128642 |