ASVspoof 5: Crowdsourced speech data, deepfakes, and adversarial attacks at scale

Wang, Xin ; Delgado, Hector ; Tak, Hemlata ; Jung, Jee-weon ; Shim, Hye-jin ; Todisco, Massimiliano ; Kukanov, Ivan ; Liu, Xuechen ; Sahidullah, Md ; Kinnunen, Tomi ; Evans, Nicholas ; Aik Lee, Kong ; Yamagishi, Junichi
Submitted to ArXiV, 16 August 2024

ASVspoof 5 is the fifth edition in a series of challenges that promote the study of speech spoofing and deepfake attacks, and the design of detection solutions. Compared to previous challenges, the ASVspoof 5 database is built from crowdsourced data collected from a vastly greater number of speakers in diverse acoustic conditions. Attacks, also crowdsourced, are generated and tested using surrogate detection models, while adversarial attacks are incorporated for the first time. New metrics support the evaluation of spoofing-robust automatic speaker verification (SASV) as well as stand-alone detection solutions, i.e., countermeasures without ASV. We describe the two challenge tracks, the new database, the evaluation metrics, baselines, and the evaluation platform, and present a summary of the results. Attacks significantly compromise the baseline systems, while submissions bring substantial improvements.


Type:
Conference
Date:
2024-08-16
Department:
Digital Security
Eurecom Ref:
7829
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in Submitted to ArXiV, 16 August 2024 and is available at :

PERMALINK : https://www.eurecom.fr/publication/7829