Open Windows Speech Recognition Tutorial

Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC

Abstract: Multi-talker speech recognition (MTASR) faces unique challenges in disentangling and transcribing overlapping speech. To address these challenges, this paper investigates the role of ...

Tech Critter

NVIDIA’s Audio2Face technology moves to open source space for industry-wide face-accurate speech on AI avatars

NVIDIA has open sourced its Audio2Face technology, making lifelike AI avatars more accessible to developers, researchers, and ...

IEEE

Audio Steganography Based Backdoor Attack for Speech Recognition Software

Abstract: With the growing prevalence of deep learning in the speech area, speech recognition, voice control, and related applications have become integral parts of people's lives. However, the rise ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC

NVIDIA’s Audio2Face technology moves to open source space for industry-wide face-accurate speech on AI avatars

Audio Steganography Based Backdoor Attack for Speech Recognition Software

Trending now