SPEECHFAKES: Generalized Voice Anti-Spoofing and Voice Biometrics
Funders
Main funder
Academy of Finland research project
Artificial speech can nowadays be flexibly generated with speech synthesis and there are methods to alter speaker identity. Quality of artificial speech is no longer robotic but has reached the limit where a listener may no longer hear difference of real and artificial speech. Artificial and modified speech are known to deteriorate the performance of automatic speaker verification (ASV) in the form of spoofing attacks, and in the future we may face with new forms of “speech deepfakes”. Speech anti-spoofing is the task of computer-based differentiation of human and artificial speech from audio waveforms. SPEECHFAKES addresses anti-spoofing with a special focus on improving generality across datasets and attacks to improve their fault tolerance and explainability. Part of the project is implemented in collaboration with international collaborators. The project contributes new machine learning based detectors and contributes to public-domain datasets to promote further research.
News
-
A paper accepted to IEEE T-PAMI
We're proud to announce a journal paper accepted to IEEE T-PAMI, one of the high-impact journals in the field of machine learning! T. Kinnunen, K.A.…
Cooperation
Publications
6 items-
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
Liu, Xuechen; Wang, Xin; Sahidullah, Md; Patino, Jose; Delgado, Hector; Kinnunen, Tomi; Todisco, Massimiliano; Yamagishi, Junichi; Evans, Nicholas; Nautsch, Andreas; Lee, Kong Aik. 2023. IEEE/ACM transactions on audio, speech, and language processing. 31: 2507-2522 A1 Journal article (refereed), original research -
How to Construct Perfect and Worse-than-Coin-Flip Spoofing Countermeasures: A Word of Warning on Shortcut Learning
Shim, Hye-jin; Gonzalez Hautamäki, Rosa; Sahidullah, Md; Kinnunen, Tomi. Teoksessa: (toim.) , 2023. Proceedings of Interspeech 2023. s. 785-789. International Speech Communication Association (ISCA) A4 Conference proceedings -
Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing
Shim, Hye-jin; Jung, Jee-weon; Kinnunen, Tomi. Teoksessa: (toim.) , 2023. Proceedings of Interspeech 2023. s. 3804-3808. International Speech Communication Association (ISCA) A4 Conference proceedings -
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech
Singh, Vishwanath Pratap; Sahidullah, Md; Kinnunen, Tomi. Teoksessa: (toim.) , 2023. Proceedings of Interspeech 2023. s. 1948-1952. International Speech Communication Association (ISCA) A4 Conference proceedings -
Speaker-Aware Anti-spoofing
Liu, Xuechen; Sahidullah, Md; Lee, Kong Aik; Kinnunen, Tomi. Teoksessa: (toim.) , 2023. Proceedings of Interspeech 2023. s. 2498-2502. International Speech Communication Association (ISCA) A4 Conference proceedings -
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings
Mun, Sung Hwan; Shim, Hye-jin; Tak, Hemlata; Wang, Xin; Liu, Xuechen; Sahidullah, Md; Jeong, Myeonghun; Han, Min Hyun; Todisco, Massimiliano; Lee, Kong Aik; Yamagishi, Junichi; Evans, Nicholas; Kinnunen, Tomi; Kim, Nam Soo; Jung, Jee-weon. Teoksessa: (toim.) , 2023. Proceedings of Interspeech 2023. s. 3989-3993. International Speech Communication Association (ISCA) A4 Conference proceedings