Ayuda
Ir al contenido

Dialnet


Structure and usage of the Tartu University Corpus of written Estonian

    1. [1] University of Tartu

      University of Tartu

      Tartu linn, Estonia

  • Localización: International journal of corpus linguistics, ISSN-e 1569-9811, ISSN 1384-6655, Vol. 3, Nº 2, 1998, págs. 279-304
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • This paper provides an overview of the first computer corpus of the Estonian language compiled at the University of Tartu. It was based on the design principles of the LOB and Brown corpora. The main part of the corpus was assembled from 1991-1995 and contains about 1 million textual words. It was compiled by an interdepartmental computational linguistics research group of the university. This paper gives a survey of the text groups in the corpus and of the problems the compilers had to solve together with the proposed solutions and outlines the main differences from the model corpora and the underlying reasons for them. These are followed by a review of the available computer routines for processing the corpus.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno