Forensic Phonetics: definition and main areas of application
- Eugenia San Segundo Fernández UNED https://orcid.org/0000-0002-0127-552X
Abstract
This work critically reviews the field of applied linguistics known as Forensic Phonetics. From the very name of this discipline we find some terminological controversies, not only concerning how to refer to this branch of knowledge, but also concerning what are –and how to name– its main areas of application. Thanks to a detailed literature review, we describe the five major fields of Forensic Phonetics, with an emphasis in demystifying possible misconceptions about the scope of this discipline. Likewise, an effort has been made to present the results of the most recent research, especially in the field of forensic speaker comparison, which is the best known task in Forensic Phonetics. For this reason, more space is devoted to this sub-area, focusing on current methodological approaches, as well as on the most used phonetic parameters.
Downloads
References
Anguera, Xavier, Simon Bozonnet, Nicholas Evans, Corinne Fredouille, Gerald Friedland, y Oriol Vinyals. (2012). Speaker diarization: A review of recent research. IEEE Transactions on audio, speech, and language processing, 20(2), 356-370. http://dx.doi.org/10.1109/TASL.2011.2125954
Battaner, Elena., Gil, Juana, Marrero, Victoria, Llisterri, Joaquim, Carbó, Carme, Machuca, María Jesús, … y Ríos, Antonio. (2003). VILE: Estudio acústico de la variación inter e intralocutor en español. In SEAF 2003: Actas del II Congreso de la Sociedad Española de Acústica Forense (pp. 59-70).
Blecua, Beatriz, Cicres, Jordi, y Gil, Juana. (2014).Variación en las róticas del español y su implicación en la identificación del locutor. Revista de Filología Románica, 31, 13-35. http://dx.doi.org/10.5209/rev_RFRM.2014.v31.n1.51021
Boersma, Paul, y Weenink, David. (2022). Praat: doing phonetics by computer [Programa informático] (Versión 6.2.05). Obtenido de http://www.praat.org
Braun, Almut. (2012). Speaker-recognition ability of blind and sighted subjects. International Journal of Speech, Language and the Law, 19(2), 159-187. http://dx.doi.org/10.1558/ijsll.v19i2.159
Braun, Almut, Llamas, Carmen, Watt, Dominic, French, John Peter y Robertson, Duncan. (2018). Sub-regional ‘other-accent’effects on lay listeners’ speaker identification abilities: a voice line-up study with speakers and listeners from the North East of England. International Journal of Speech, Language and the Law, 231-255. https://doi.org/10.1558/ijsll.37340
Braun, Angelika y Rosin, Annabelle. (2015). On the speaker specificity of hesitation markers. Proceedings of the 18th International Congress of Phonetic Sciences (ICPhS), University of Glasgow.
Byrne, Catherine & Foulkes, Paul. (2004). The mobile phone effect on vowel formants. International Journal of Speech, Language and the Law, 11(1), 83-102. http://dx.doi.org/10.1558/sll.2004.11.1.83
Cao, Honglin & Wang, Yingli. (2011). A Forensic Aspect of Articulation Rate Variation in Chinese. In Proceedings of the International Congress of Phonetic Sciences (pp. 396-399).
Cirnes Zuñiga, Sergio. H. (2000). Diccionarios jurídicos temáticos. Volumen 6: Criminalística y Ciencias forenses. Oxford University Press.
Cicres, Jordi. (2007). Análisis discriminante de un conjunto de parámetros fonético acústicos de las pausas llenas para identificar hablantes, Síntesis Tecnológica, 3(2), 87-96, http://dx.doi.org/10.4206/sint.tecnol.2007.v3n2-04
Cicres, Jordi. (2007). Aplicació de l’anàlisi de l’entonació i de l’alineació tonal a la identificació de parlants en fonètica forense. Unpublished PhD thesis, Universitat Pompeu Fabra.
Cicres, Jordi. (2011). Transcripció i autenticació de gravacions en contextos judicials. LSC–Llengua, societat i comunicació, 26-32.
Clifford, Brian. R., Rathborn, Harriet y Bull, Ray. (1981). The effects of delay on voice recognition accuracy. Law and Human Behavior, 5(2), 201-208. http://dx.doi.org/10.1007/BF01044763
Cooper, Alan. J. (2009). An automated approach to the Electric Network Frequency (ENF) criterion: theory and practice. International Journal of Speech, Language & the Law, 16(2). http://dx.doi.org/10.1558/ijsll.v16i2.193
Cooper, Alan. J. (2011). Further considerations for the analysis of ENF data for forensic audio and video applications. International Journal of Speech, Language & the Law, 18(1). http://dx.doi.org/10.1558/ijsll.v18i1.99
Cortés Rodríguez, Luis y Camacho Adarve, M.ª Matilde. (2003). ¿Qué es el análisis del discurso? Barcelona: Octaedro.
de Jong-Lendle, Gea, Nolan, Francis, McDougall, Kirsty y Hudson, Toby. (2015). Voice lineups: a practical guide. In Proceedings of the International Congress of Phonetic Sciences (pp. 10-14).
Delgado Romero, Carlos. (1998). Pasaporte vocal: utilidad de la estratificación del uso lingüístico. Ciencia policial: revista del Instituto de Estudios de Policía, (40), 57-89.
ENFSI (2021). Best Practice Manual for the Methodology of Forensic Speaker Comparison, European Network of Forensic Science Institutes (ENFSI).
Enzinger, Ewald. (2010). Characterising Formant Tracks in Viennese Diphthongs for Forensic Speaker Comparison. In Proceedings of the 39th International AES Conference: Audio Forensics, Practices and Challenges (pp. 47–52).
Eriksson, Anders. (2012). Aural/acoustic vs. automatic methods in forensic phonetic case work. Forensic Speaker Recognition (pp. 41-69). Springer, New York, NY. http://dx.doi.org/10.1007/978-1-4614-0263-3_3
Evett, Ian W. (1995). Avoiding the transposed conditional. Science and Justice, 35(2), 127-132. http://dx.doi.org/10.1016/S1355-0306(95)72645-4
Faundez‐Zanuy, Marcos, Lucena‐Molina, José Juan & Hagmüller, Martin. (2010). Speech watermarking: an approach for the forensic analysis of digital telephonic recordings. Journal of forensic sciences, 55(4), 1080-1087.
Forensic Science Regulator (2016a). Codes of practice and conduct for forensic science providers and practitioners in the Criminal Justice System, Issue 3. Birmingham: Forensic Science Regulator Publications.
Forensic Science Regulator (2016b). Codes of practice and conduct for forensic science providers and practitioners in the Criminal Justice System, Appendix: Speech and Audio Forensic Services, FSR-C134, Issue 1. Birmingham: Forensic Science Regulator Publications.
French, Peter. (1994). An overview of forensic phonetics with particular reference to speaker identification. International Journal of Speech Language and the Law,1(2), 169-181. http://dx.doi.org/10.1558/ijsll.v1i2.169
French, Peter y Fraser, Helen. (2018). Why “ad hoc experts” should not provide transcripts of indistinct forensic audio, and a proposal for a better approach. Criminal Law Journal, 42(5), 298-302.
Gibbons, John. (1999). Linguistics and the Law. Annual Review of Applied Linguistics, 19: 156-173.
Gil, Juana; Alves, Helena y José Antonio Hierro (2012). Proposition raisonnée de protocole de capture de voix connue à des fins judiciaires. Revue Internationale de Criminologie et de Police Scientifique et Technique, LXV, 319-345.
Gil, Juana y Eugenia San Segundo (2013). El disimulo de la cualidad de voz en fonética judicial: un estudio perceptivo para un caso de pinzamiento de nariz. En A. Penas (Ed.) Panorama de la Fonética Española Actual (pp. 321-366). Madrid: Arco Libros.
Gil, Juana y Eugenia San Segundo (2014). La cualidad de voz en fonética judicial. In E. Garayzábal, M. Jiménez & M. Reigosa (Eds.) Lingüística Forense: La lingüística en el ámbito policial y judicial (pp. 153-197). Madrid: Euphonía Ediciones.
Gómez, Pedro, San Segundo, Eugenia, Mazaira, Luis Miguel, Álvarez, Agustín, & Rodellar, Victoria. (2014). Using dysphonic voice to characterize speaker’s biometry. Language and Law/Linguagem e Direito, 1(2).
Grigoras, Catalin (2005). Digital audio recording analysis–the electric network frequency criterion. International Journal of Speech Language and the Law, 12(1), 63-76. http://dx.doi.org/10.1558/sll.2005.12.1.63
Grigoras, Catalin, Cooper, Alan & Michalek, Marcin. (2009). Forensic speech and audio analysis Working Group Best Practice Guidelines for ENF analysis in forensic authentication of digital evidence. European Network of Forensic Science Institutes (ENFSI).
Haworth, Kate (2006). The dynamics of power and resistance in police interview discourse. Discourse & society, 17(6), 739-759. http://dx.doi.org/10.1177/0957926506068430
Hernández, José Antonio. (1995, 5 octubre). "Esta voz es la de mi fontanero", fue el testimonio clave sobre Anabel Segura. El País. Consultado en https://elpais.com/diario/1995/10/05/madrid/812895881_850215.html#:~:text=La%20pista%20fiable%20que%20condujo,el%20fontanero%20de%20mi%20pueblo%22.
Hudson, Toby, De Jong, Gea, McDougall, Kirsty, Harrison, Philip, & Nolan, Francis. (2007). F0 statistics for 100 young male speakers of Standard Southern British English. Proceedings of the 16th international congress of phonetic sciences (Vol. 6, No. 10).
Jessen, Michael. (2007). Speaker classification in forensic phonetics and acoustics. In Speaker classification I (pp. 180-204). Springer, Berlin, Heidelberg. http://dx.doi.org/10.1007/978-3-540-74200-5_10
Jessen, Michael. (2008). Forensic phonetics. Language and linguistics compass, 2(4), 671-711. http://dx.doi.org/10.1111/j.1749-818X.2008.00066.x
Jessen, Michael. (2018). Forensic Voice Comparison. En Visconti (Ed). Handbook of Communication in the Legal Sphere. De Gruyter Mouton. http://dx.doi.org/10.1515/9781614514664-012
Jessen, Michael. (2020). Speaker profiling and forensic voice comparison. In Coulhard, May and Sousa-Silva (Eds.) The Routledge Handbook of Forensic Linguistics. New York: Routledge. http://dx.doi.org/10.4324/9780429030581-31
Jessen, Michael, Koster, Olaf y Gfroerer, Stefan. (2005). Influence of vocal effort on average and variability of fundamental frequency. International Journal of Speech, Language and the Law, 12(2), 174-213. http://dx.doi.org/10.1558/sll.2005.12.2.174
Kavanagh, Colleen. (2012). New consonantal acoustic parameters for forensic speaker comparison (Doctoral dissertation, University of York).
Kerbrat-Orecchioni, Catherine. (1996). La conversation. París: Seuil.
Künzel, Hermann. J. (2001). Beware of the 'telephone effect': the influence of telephone transmissions on the measurement of formant frequencies. Forensic Linguistics, 8 (1), 80-99. http://dx.doi.org/10.1558/ijsll.v8i1.80
Künzel, Hermann. J. (2011). La prueba de voz en la investigación criminalística. Ciencia Forense, INACIPE-Academia Iberoamericana de Criminalística y Estudios Forenses, 1(1), 37-50.
Laver, John. (1980). The phonetic description of voice quality. Cambridge: Cambridge University Press
Leemann, Adrien., Kolly, Marie-Jose y Dellwo, Volker. (2014). Speaker-individuality in Suprasegmental Temporal Features: Implications for Forensic Voice Comparison. Forensic Science International, 238, 59-67. http://dx.doi.org/10.1016/j.forsciint.2014.02.019
Lindh, Jonas & Eriksson, Anders. (2007). Robustness of long time measures of fundamental frequency. In Eighth Annual Conference of the International Speech Communication Association.
Manzanero, Antonio L.; López, Beatriz y Contreras, María José. (2009). Efectos de interferencia en el reconocimiento de personas: Exactitud, discriminabilidad y sesgo de respuesta. En F. Expósito y S. Peña (Eds.): Procesos Judiciales. Psicología Jurídica de la Familia y del Menor (pp. 21-28). Murcia: Sociedad Española de Psicología Jurídica y Forense.
Marrero, Victoria. (Coord.) (2017). Introducción a la fonética judicial: variación inter e intralocutor en español, el proyecto VILE. Valencia: Tirant lo Blanch.
Martínez-Celdrán Eugenio & Fernández-Planas, Ana María. (2007). Manual de fonética española. Barcelona: Ariel.
McDougall, Kirsty. (2006). Dynamic features of speech and the characterization of speakers: Toward a new approach using formant frequencies. International Journal of Speech Language and the Law, 13(1), 89-126. http://dx.doi.org/10.1558/sll.2006.13.1.89
McDougall, Kirsty. (2013). Assessing perceived voice similarity using Multidimensional Scaling for the construction of voice parades. International Journal of Speech, Language and the Law, 20(2), 163-172. http://dx.doi.org/10.1558/ijsll.v20i2.163
Morrison, Geoffrey Stewart. (2008). Forensic voice comparison using likelihood ratios based on polynomial curves fitted to the formant trajectories of Australian English/aI/. International Journal of Speech, Language & the Law, 15(2).
Morrison, Geoffrey Stewart. (2009). Forensic voice comparison and the paradigm shift. Science & Justice, 49(4), 298-308. http://dx.doi.org/10.1016/j.scijus.2009.09.002
Morrison, Geoffrey Stewart. (2011). La comparación forense de la voz y el cambio de paradigma. Estudios Fónicos/Cuadernos de Trabajo, 1, 1-38.
Morrison, Geoffrey Stewart, Farhan Hyder Sahito, Gaëlle Jardine, Djordje Djokic, Sophie Clavet, Sabine Berghs, y Caroline Goemans Dorny. INTERPOL survey of the use of speaker identification by law enforcement agencies. Forensic Science International, 263, 92-100. http://dx.doi.org/10.1016/j.forsciint.2016.03.044
Nolan, Francis. (1983). The phonetic bases of speaker recognition. Cambridge: Cambridge University Press. http://dx.doi.org/10.1016/0167-6393(87)90039-2
Nolan, Francis. (1997). Speaker recognition and forensic phonetics. In: W. Hardcastle and J. Laver (eds), A Handbook of Phonetic Science. Oxford: Blackwell.
Nolan, Francis. (2003). A recent voice parade. The International Journal of Speech, Language and the Law, 10(2), 277-291. http://dx.doi.org/10.1558/sll.2003.10.2.277
Nolan, Francis y Grabe, Esther. (1996). Preparing a voice lineup. International Journal of Speech, Language and the Law, 3(1), 74-94. http://dx.doi.org/10.1558/ijsll.v3i1.74
Nolan, Francis y Grigoras, Catalin. (2005). A case for formant analysis in forensic speaker identification. International Journal of Speech, Language and the Law, 12(2), 143-173. http://dx.doi.org/10.1558/sll.2005.12.2.143
Olson, John. (2004). Forensic linguistics: an introduction to Language, crime and the law. London, New York: Continuum.
Paver, Alice, Harriet MJ Smith, Nikolas Pautz, Kirsty McDougall, Katrin Mueller-Johnson y Francis Nolan. (2021). Voice parade parameters: Investigating the effect of parade size and voice sample duration on earwitness identification accuracy. Poster presented at Cambridge Language Sciences Interdisciplinary Research Centre, July 2021.
Ramírez Salado, Mercedes. (2017). Antecedentes de la lingüística forense: ¿desde cuándo se estudia el lenguaje como evidencia? Pragmalingüística, (25), 525-539. http://dx.doi.org/10.25267/Pragmalinguistica.2017.i25.26
Ramos-Castro, Daniel. (2007). Forensic evaluation of the evidence using automatic speaker recognition systems (Doctoral dissertation). Universidad Autónoma de Madrid.
Richardson, Emma, Haworth, Kate y Deamer, Felicity. (2022). For the Record: Questioning transcription processes in legal contexts. Applied Linguistics. 1–22.
Rietveld, A.C.M., Broeders, A.P.A. (1991). Testing the fairness of voice parades: the similarity criterion. Proc. of the 12th International Congress of Phonetic Sciences. Aix-en-Provence, Université de Provence, Service des Publications, 5: 46-49.
Rose, Philip. (2002). Forensic speaker identification. London: Taylor & Francis.
Saks, Michael J. y Koehler, Jonathan J. (2005). The coming paradigm shift in forensic identification science. Science, 309(5736), 892-895.
Saks, Michael J. y Koehler, Jonathan J. (2008). The individualization fallacy in forensic science evidence. Vand. L. Rev., 61, 199.
Saks, Michael J. (2010). Forensic identification: from a faith-based “Science” to a scientific science. Forensic Science International, 201(1-3), 14-17.
San Segundo, Eugenia. (2011). Acústica forense basada en relaciones de verosimilitud: representaciones paramétricas de las trayectorias formánticas de algunas combinaciones vocálicas del español peninsular. Tecniacústica, Número especial de la Revista de Acústica, 128 (3-4), 1-8.
San Segundo, Eugenia. (2014a). Forensic speaker comparison of Spanish twins and non-twin siblings. Tesis doctoral, Consejo Superior de Investigaciones Científicas & Universidad Internacional Menéndez Pelayo.
San Segundo, Eugenia. (2014b). El entrenamiento musical y otros factores que pueden influir en el reconocimiento perceptivo de hablantes. En Fonética experimental, educación superior e investigación (pp. 571-588). Madrid: Arco Libros.
San Segundo, Eugenia; Foulkes, Paul; French, Peter, Harrison, Philip; Hughes, Vincent; Kavanagh, Colleen. (2019). The use of the Vocal Profile Analysis for speaker characterization: Methodological proposals. Journal of the International Phonetic Association, Vol. 49, Issue 3, pp. 353-380. http://dx.doi.org/10.1017/S0025100318000130
San Segundo, Eugenia, Foulkes, Paul & Hughes, Vincent. (2016). Holistic perception of voice quality matters more than L1 when judging speaker similarity in short stimuli. In Proc. 16th Australas. Int. Conf. Speech Sci. Technolog. (pp. 309-312).
San Segundo, Eugenia y José Mompeán (2017). A simplified Vocal Profile Analysis Protocol for the assessment of voice quality and speaker similarity. Journal of Voice 31(5), 644.e11–644.e27. http://dx.doi.org/10.1016/j.jvoice.2017.01.005
San Segundo, Eugenia; Schwab, Sandra; Dellwo, Volker; He, Lei y Mompeán, José. (2017). Perception of vocal tract tension: Exploring possible prosodic correlates. In V. Marrero & E. Estebas (Eds.) Current Trends in Experimental Phonetics: Cross-disciplines in the Hundredth Anniversary of Manual de Pronunciación Española (Tomás Navarro Tomás) (pp. 79-82). Madrid: UNED.
San Segundo, Eugenia y Skarnitzl, Radek. (2021). A Computer-Based Tool for the Assessment of Voice Quality Through Visual Analogue Scales: VAS-Simplified Vocal Profile Analysis. Journal of Voice, 35(3), 497-e9. http://dx.doi.org/10.1016/j.jvoice.2019.10.007
San Segundo, Eugenia; Univaso, Pedro y Gurlekian, Jorge. (2019). Sistema multiparamétrico para la comparación forense de hablantes. Estudios de fonética experimental, 28, 13-45.
San Segundo, Eugenia. (2021). International survey on voice quality: Forensic practitioners versus voice therapists. Est. de Fonética Exper, 30: 9-34.
Schweitzer, Nicholas J. & Saks, Michael J. (2007). The CSI effect: Popular fiction about forensic science affects the public's expectations about real forensic science. Jurimetrics, 357-364.
Tsanas, Athanasios, San Segundo, Eugenia, & Gómez-Vilda, P. (2017). Exploring Pause Fillers in Conversational Speech for Forensic Phonetics: Findings in a Spanish Cohort Including Twins. In IET Conference Proceedings. The Institution of Engineering & Technology.
Tusón, Amparo. (1995). El análisis de la conversación. Barcelona: Ariel, 1997.
Univaso, Pedro. (2017). Forensic speaker identification: A tutorial. IEEE Latin America Transactions, 15(9), 1754-1770.
Wells, John C. (1997). SAMPA computer readable phonetic alphabet. Handbook of standards and resources for spoken language systems, 4, 684-732.
Article download
License
In order to support the global exchange of knowledge, the journal Círculo de Lingüística Aplicada a la Comunicación is allowing unrestricted access to its content as from its publication in this electronic edition, and as such it is an open-access journal. The originals published in this journal are the property of the Complutense University of Madrid and any reproduction thereof in full or in part must cite the source. All content is distributed under a Creative Commons Attribution 4.0 use and distribution licence (CC BY 4.0). This circumstance must be expressly stated in these terms where necessary. You can view the summary and the complete legal text of the licence.