default search action
Computer Speech & Language, Volume 86
Volume 86, 2024
- Geoffroy Vanderreydt, Kris Demuynck:
A novel channel estimate for noise robust speech recognition. 101598 - Souvik Sinha, Spandan Dey, Goutam Saha:
Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion. 101599 - Francesca Alloatti, Francesca Grasso, Roger Ferrod, Giovanni Siragusa, Luigi Di Caro, Federica Cena:
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents. 101603 - Asalah Thiab, Luay Alawneh, Mohammad Al-Smadi:
Contextual emotion detection using ensemble deep learning. 101604 - Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan:
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement. 101605 - Long Dai, Jiarong Mao, Liaoran Xu, Xuefeng Fan, Xiaoyi Zhou:
SecNLP: An NLP classification model watermarking framework based on multi-task learning. 101606 - Asma Mekki, Inès Zribi, Mariem Ellouze, Lamia Hadrich Belguith:
TTK: A toolkit for Tunisian linguistic analysis. 101617 - Yihao Li, Meng Sun, Xiongwei Zhang, Hugo Van hamme:
Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement. 101618 - Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances. 101619 - Yi Zhu, Tiago H. Falk:
Spectral-temporal saliency masks and modulation tensorgrams for generalizable COVID-19 detection. 101620 - B. M. Mala, Smita Sandeep Darandale:
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model. 101621 - Titouan Parcollet, Ha Nguyen, Solène Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia A. Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Estève, Mickael Rouvier, Jérôme Goulian, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier:
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech. 101622 - Bowen Jiang, Qianhui Dong, Guojin Liu:
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure. 101624 - Sania Gul, Muhammad Salman Khan, Muhammad Fazeel:
Single-channel speech enhancement using colored spectrograms. 101626
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.