Creating an MTT treebank of Spanish

Simon Mille; Vanesa Vidal; Alicia Burga; Leo Wanner

Ayuda

Creating an MTT treebank of Spanish

Autores: Simon Mille, Vanesa Vidal, Alicia Burga, Leo Wanner
Localización: Proceedings [of the] Fourth International Conference on Meaning-Text Theory [Recurso electrónico] / David Beck (ed. lit.), Kim Gerdes (ed. lit.), Jasmina Milicevic (ed. lit.), Alain Polguère (ed. lit.), 2009, ISBN 978-2-9811149-0-7, págs. 287-297
Idioma: inglés
Enlaces
- Texto Completo Libro
Resumen
- We present a cost effective strategy for the creation of a mid-size fine-grained dependency treebank of surface- and deep-syntactic structures as defined in the Meaning-Text Theory for Spanish. The strategy starts from a small seed dependency corpus, the AnCora corpus, whose annotation is considerably more coarse-grained than our target annotation. We show that this discrepancy can be bridged largely by automatic means, relying upon contextual information and leaving thus minimal work to the annotators. This allows us to develop the resources with limited human effort within a limited period of time. We also propose a preliminary evaluation of the actual amount of work that the annotation process requires.We present a cost effective strategy for the creation of a mid-size fine-grained dependency treebank of surface- and deep-syntactic structures as defined in the Meaning-Text Theory for Spanish. The strategy starts from a small seed dependency corpus, the AnCora corpus, whose annotation is considerably more coarse-grained than our target annotation. We show that this discrepancy can be bridged largely by automatic means, relying upon contextual information and leaving thus minimal work to the annotators. This allows us to develop the resources with limited human effort within a limited period of time. We also propose a preliminary evaluation of the actual amount of work that the annotation process requires.

Acceso de usuarios registrados

¿Olvidó su contraseña?

¿Es nuevo? Regístrese

Ventajas de registrarse

Dialnet Plus

Opciones de compartir

Opciones de entorno

Sugerencia / Errata

Coordinado por: