Submitted by Igor Leturia on 09/30/2010 - 11:03
| Title | Errores ortográficos y de competencia en textos de la web en euskera |
| Publication Type | Journal Article |
| Year of Publication | 2010 |
| Authors | Alegria, Iñaki, Izaskun Etxeberria, and Igor Leturia |
| Journal | Revista de la Asociación Española para el Procesamiento del Lenguaje Natural |
| Volume | 45 |
| ISSN | 1135-5948 |
| Abstract | The objective of the work presented in this paper is to estimate the quality of corpora retrieved from the Basque Web. The methodology i followed is similar to that used for English and Germany by Ringlstetter et al. (2006). The main difference lies in the fact that we reuse spelling checkers for detecting errors. We think that by this way we obtain a higher error coverage and that the method can be applied to other languages with practically no manual work provided such tools are available for them. The results |
| URL | http://www.elhuyar.org/hizkuntza-zerbitzuak/informazioa/corpus-tresnak/Errores_web.pdf |


