Methodology and steps towards the construction of EPEC, a corpus of written Basque tagged at morphological and syntactic levels for the automatic processing

Abstract : This article describes the different steps in the construction of EPEC (Reference Corpus for the Processing of Basque). EPEC is a corpus of standard written Basque that has been manually tagged at different levels (morphology, surface syntax, phrases) and is currently being hand tagged at deep syntax level following the Dependency Structure-based Scheme. It is aimed to be a "reference" corpus for the development and improvement of several NLP tools for Basque. This corpus has already been used for the construction of some tools such as a morphological analyser, a lemmatiser, or a shallow syntactic analyser.
Type de document :
Chapitre d'ouvrage
56, Rodopi. Book series: Language and Computers., pp.1-15, 2006
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://artxiker.ccsd.cnrs.fr/artxibo-00080508
Contributeur : Izaskun Aldezabal <>
Soumis le : jeudi 22 juin 2006 - 13:19:14
Dernière modification le : jeudi 22 juin 2006 - 13:58:44
Document(s) archivé(s) le : lundi 20 septembre 2010 - 16:04:56

Fichier

Identifiants

  • HAL Id : artxibo-00080508, version 2

Collections

Citation

I. Aduriz, M. Aranzabe, J. Arriola, A. Atutxa, A. Diaz-De-Ilarraza, et al.. Methodology and steps towards the construction of EPEC, a corpus of written Basque tagged at morphological and syntactic levels for the automatic processing. 56, Rodopi. Book series: Language and Computers., pp.1-15, 2006. 〈artxibo-00080508v2〉

Partager

Métriques

Consultations de la notice

692

Téléchargements de fichiers

599