Ayuda
Ir al contenido

Dialnet


Automatic transcription of the Polish newsreel

    1. [1] Polish-Japanese Academy of Information Technology

      Polish-Japanese Academy of Information Technology

      Warszawa, Polonia

  • Localización: Poznan Studies in Contemporary Linguistics, ISSN 1732-0747, ISSN-e 1897-7499, Vol. 55, Nº. 2, 2019 (Ejemplar dedicado a: Current state of the art in language technology for polish), págs. 183-209
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • This paper describes an automatic transcription system for the Polish Newsreel, which is a collection of mid to late 20th century news segments presented in audio and video form. They are characterized by their use of archaic language and poor audio quality, which makes them a demanding problem for speech recognition systems. Acoustic and language models had to be retrained using data from in-domain corpora. During the adaptation of the models, experiments were carried out to select optimal adaptation parameters. The experiments showed that the adaptation of the speech recognition system to a narrow and clearly defined domain significantly increases its efficiency. The final word error rate obtained for this domain was 10.97%


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno