Automatic closed captions and subtitles in academic video presentations: possibilities and shortcomings

Keywords: Automatic closed captions, subtitles, machine translation, Quality Assessment, academic presentations, Auto-generated captions
Agencies: The research presented in this study has been (partially) carried out in the framework of research project

Abstract

In light of the increasing number of academic events being recorded or held online since the onset of the COVID-19 pandemic, the present work combines automation processes in audiovisual translation and academic texts–more specifically, video presentations. The research questions are whether the automatic generation of captions is functional to ensure accessibility in academic events and how much post-editing effort would such content require in case a machine translation of the subtitles is to be applied. The research method comprises several phases. First, in a corpus of video presentations of specialised content in English, captions were generated automatically using YouTube Studio to ascertain the general quality and the type of errors generated in the automatically generated closed captions according to Multidimensional Quality Metrics (MQM) framework. These auto-generated captions were corrected and annotated by considering the following parameters: a) pre-editing time, b) type of error according to MQM framework, and c) severity of the error. Second, the auto-generated captions and corrected were machine translated into Spanish. Furthermore, errors detected in the machine translation of the subtitles (English-Spanish) were post-edited and errors were analysed following the MQM. Reception by a potential audience was also studied, as evaluated by academics from the same field of expertise. The main conclusion is that most errors in machine-translated subtitles stem from incorrect caption segmentation and lack of context awareness, making it essential to correct the closed captions before translation. This thesis is supported by the reception study in which the level of comprehension was higher when the transcription was pre-edited, as most of the problems arise from the closed captions rather than from the translation itself.

 

Downloads

Download data is not yet available.
View citations

Article download

Crossmark

Metrics

Published
2024-10-01
How to Cite
Veroz-González M. A. y Castillo Bernal P. (2024). Automatic closed captions and subtitles in academic video presentations: possibilities and shortcomings . Complutense Journal of English Studies, 32, e94649. https://doi.org/10.5209/cjes.94649
Section
Articles