Ayuda
Ir al contenido

Dialnet


Corpus annotation of macro discourse structures

  • Autores: Marion Ho-Dac, Cécile Fabre, Marie-Paule Péry-Woodley, Josette Rebeyrolle
  • Localización: A survey of corpus-based research [Recurso electrónico] / Pascual Cantos Gómez (ed. lit.), Aquilino Sánchez Pérez (ed. lit.), 2009, ISBN 978-84-692-2198-3, págs. 894-905
  • Idioma: inglés
  • Enlaces
  • Resumen
    • We present our discourse annotation project, ANNODIS, which aims to make available a diversified French corpus annotated with discourse information, along with a set of tools for annotation and corpus exploitation. An original aspect of the project is that it combines two theoretically and methodologically different points of view on discourse: bottom-up and topdown.

      In the bottom-up perspective, basic constituents are identified and linked via discourse relations. In a complementary manner, the top-down approach starts from the text as a whole and focuses on the identification of configurations of cues signalling higher-level text segments, in an attempt to address the interplay of continuity and discontinuity within discourse. The focus of this paper is the annotation scheme used in the top-down approach, which revolves around enumerative structures. These structures, which are of particular interest to our project because of their ability to occur in nested configurations and at all levels of granularity (from within a sentence to across text sections), are the discourse object chosen to "bootstrap" our approach. We describe the different stages involved: corpus selection, pre-processing and "marking" techniques, and the specific interface facilities, designed to make it possible for coders to navigate and scan the text in order to identify relevant spans at different granularity levels.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno