ASVspoof 5: Crowdsourced speech data, deepfakes, and adversarial attacks at scale

Wang, Xin ; Delgado, Hector ; Tak, Hemlata ; Jung, Jee-weon ; Shim, Hye-jin ; Todisco, Massimiliano ; Kukanov, Ivan ; Liu, Xuechen ; Sahidullah, Md ; Kinnunen, Tomi ; Evans, Nicholas ; Aik Lee, Kong ; Yamagishi, Junichi

Submitted to ArXiV, 16 August 2024

ASVspoof 5 is the fifth edition in a series of challenges that promote the study of speech spoofing and deepfake attacks, and the design of detection solutions. Compared to previous challenges, the ASVspoof 5 database is built from crowdsourced data collected from a vastly greater number of speakers in diverse acoustic conditions. Attacks, also crowdsourced, are generated and tested using surrogate detection models, while adversarial attacks are incorporated for the first time. New metrics support the evaluation of spoofing-robust automatic speaker verification (SASV) as well as stand-alone detection solutions, i.e., countermeasures without ASV. We describe the two challenge tracks, the new database, the evaluation metrics, baselines, and the evaluation platform, and present a summary of the results. Attacks significantly compromise the baseline systems, while submissions bring substantial improvements.

Detail

ARXIV

BIBTEX

Type:

Conference

Date:

2024-08-16

Department:

Digital Security

Eurecom Ref:

7829