default search action
Dhananjaya Gowda
Person information
- affiliation: Aalto University, Espoo, Finland
- affiliation: IIIT Hyderabad, India
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j11]Joshua Tian Jin Tee, Kang Zhang, Hee Suk Yoon, Dhananjaya N. Gowda, Chanwoo Kim, Chang D. Yoo:
Physics Informed Distillation for Diffusion Models. Trans. Mach. Learn. Res. 2024 (2024) - [c44]Abhinav Garg, Jiyeon Kim, Sushil Khyalia, Chanwoo Kim, Dhananjaya Gowda:
Data Driven Grapheme-to-Phoneme Representations for a Lexicon-Free Text-to-Speech. ICASSP 2024: 11091-11095 - [i16]Abhinav Garg, Jiyeon Kim, Sushil Khyalia, Chanwoo Kim, Dhananjaya Gowda:
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech. CoRR abs/2401.10465 (2024) - 2023
- [j10]Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda:
Refining a deep learning-based formant tracker using linear prediction methods. Comput. Speech Lang. 81: 101515 (2023) - [c43]Othman Istaiteh, Yasmeen Kussad, Yahya Daqour, Maria Habib, Mohammad Habash, Dhananjaya Gowda:
A Transformer-Based E2E SLU Model for Improved Semantic Parsing. ICASSP 2023: 1-2 - [c42]Mehul Kumar, Jiyeon Kim, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim:
Self-Supervised Accent Learning for Under-Resourced Accents Using Native Language Data. ICASSP 2023: 1-5 - [c41]Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John B. Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo:
Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction. INTERSPEECH 2023: 2028-2032 - [i15]Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John B. Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo:
Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction. CoRR abs/2308.08442 (2023) - [i14]Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda:
Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods. CoRR abs/2308.09051 (2023) - [i13]Dhananjaya Gowda, Sudarsana Reddy Kadiri, Brad H. Story, Paavo Alku:
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals. CoRR abs/2308.16540 (2023) - [i12]Nagaraj Adiga, Jinhwan Park, Chintigari Shiva Kumar, Shatrughan Singh, Kyungmin Lee, Chanwoo Kim, Dhananjaya Gowda:
On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition. CoRR abs/2312.09842 (2023) - 2022
- [c40]Seongkyu Mun, Dhananjaya Gowda, Jihwan Lee, Changwoo Han, Dokyun Lee, Chanwoo Kim:
Prototypical speaker-interference loss for target voice separation using non-parallel audio samples. INTERSPEECH 2022: 276-280 - [c39]Jash Rathod, Nauman Dawalatabad, Shatrughan Singh, Dhananjaya Gowda:
Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition. INTERSPEECH 2022: 1691-1695 - [i11]Dhananjaya Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku:
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks. CoRR abs/2201.01525 (2022) - [i10]Nauman Dawalatabad, Tushar Vatsal, Ashutosh Gupta, Sungsoo Kim, Shatrughan Singh, Dhananjaya Gowda, Chanwoo Kim:
Two-Pass End-to-End ASR Model Compression. CoRR abs/2201.02741 (2022) - [i9]Jash Rathod, Nauman Dawalatabad, Shatrughan Singh, Dhananjaya Gowda:
Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition. CoRR abs/2210.00169 (2022) - 2021
- [j9]Dhananjaya N. Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku:
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks. IEEE Access 9: 151631-151640 (2021) - [j8]Jinfang Wang, Ke Lv, Chang Liu, Xinli Nie, Dhananjaya Gowda, Shuxin Luan:
Automatic Assessment for Severe Self-Reported Depressive Symptoms Using Speech Cues. IEEE Trans. Cogn. Dev. Syst. 13(4): 875-884 (2021) - [c38]Sachin Singh, Ashutosh Gupta, Aman Maghan, Dhananjaya Gowda, Shatrughan Singh, Chanwoo Kim:
Comparative Study of Different Tokenization Strategies for Streaming End-to-End ASR. ASRU 2021: 388-394 - [c37]Dhananjaya Gowda, Abhinav Garg, Jiyeon Kim, Mehul Kumar, Sachin Singh, Ashutosh Gupta, Ankur Kumar, Nauman Dawalatabad, Aman Maghan, Shatrughan Singh, Chanwoo Kim:
HiTNet: Byte-to-BPE Hierarchical Transcription Network for End-to-End Speech Recognition. ASRU 2021: 395-402 - [c36]Nauman Dawalatabad, Tushar Vatsal, Ashutosh Gupta, Sungsoo Kim, Shatrughan Singh, Dhananjaya Gowda, Chanwoo Kim:
Two-Pass End-to-End ASR Model Compression. ASRU 2021: 403-410 - [c35]Ashutosh Gupta, Aditya Jayasimha, Aman Maghan, Shatrughan Singh, Dhananjaya Gowda, Chanwoo Kim:
Voice to Action: Spoken Language Understanding for Memory-Constrained Systems. ASRU 2021: 473-479 - [c34]Jiyeon Kim, Mehul Kumar, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim:
Semi-Supervised Transfer Learning for Language Expansion of End-to-End Speech Recognition Models to Low-Resource Languages. ASRU 2021: 984-988 - [c33]Jiyeon Kim, Mehul Kumar, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim:
A Comparison of Streaming Models and Data Augmentation Methods for Robust Speech Recognition. ASRU 2021: 989-995 - [c32]Ashutosh Gupta, Ankur Kumar, Dhananjaya Gowda, Kwangyoun Kim, Sachin Singh, Shatrughan Singh, Chanwoo Kim:
Neural Utterance Confidence Measure for RNN-Transducers and Two Pass Models. ICASSP 2021: 6398-6402 - [c31]Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han:
Streaming End-to-End Speech Recognition with Jointly Trained Neural Feature Enhancement. ICASSP 2021: 6773-6777 - [i8]Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han:
Streaming end-to-end speech recognition with jointly trained neural feature enhancement. CoRR abs/2105.01254 (2021) - [i7]Jiyeon Kim, Mehul Kumar, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim:
A comparison of streaming models and data augmentation methods for robust speech recognition. CoRR abs/2111.10043 (2021) - [i6]Jiyeon Kim, Mehul Kumar, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim:
Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages. CoRR abs/2111.10047 (2021) - 2020
- [j7]Dhananjaya N. Gowda, Sudarsana Reddy Kadiri, Brad H. Story, Paavo Alku:
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1901-1914 (2020) - [c30]Chanwoo Kim, Dhananjaya Gowda, Dongsoo Lee, Jiyeon Kim, Ankur Kumar, Sungsoo Kim, Abhinav Garg, Changwoo Han:
A Review of On-Device Fully Neural End-to-End Automatic Speech Recognition Algorithms. ACSSC 2020: 277-283 - [c29]Abhinav Garg, Ashutosh Gupta, Dhananjaya Gowda, Shatrughan Singh, Chanwoo Kim:
Hierarchical Multi-Stage Word-to-Grapheme Named Entity Corrector for Automatic Speech Recognition. INTERSPEECH 2020: 1793-1797 - [c28]Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Hejung Yang, Abhinav Garg, Sachin Singh, Jiyeon Kim, Mehul Kumar, Sichen Jin, Shatrughan Singh, Chanwoo Kim:
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition. INTERSPEECH 2020: 2827-2831 - [c27]Abhinav Garg, Gowtham P. Vadisetti, Dhananjaya Gowda, Sichen Jin, Aditya Jayasimha, Youngho Han, Jiyeon Kim, Junmo Park, Kwangyoun Kim, Sooyeon Kim, Young-Yoon Lee, Kyungbo Min, Chanwoo Kim:
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing. INTERSPEECH 2020: 3371-3375 - [c26]Ankur Kumar, Sachin Singh, Dhananjaya Gowda, Abhinav Garg, Shatrughan Singh, Chanwoo Kim:
Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios. INTERSPEECH 2020: 4357-4361 - [i5]Kwangyoun Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim:
Attention based on-device streaming speech recognition with large speech corpus. CoRR abs/2001.00577 (2020) - [i4]Chanwoo Kim, Dhananjaya Gowda, Dongsoo Lee, Jiyeon Kim, Ankur Kumar, Sungsoo Kim, Abhinav Garg, Changwoo Han:
A review of on-device fully neural end-to-end automatic speech recognition algorithms. CoRR abs/2012.07974 (2020)
2010 – 2019
- 2019
- [c25]Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Improved Multi-Stage Training of Online Attention-Based Encoder-Decoder Models. ASRU 2019: 70-77 - [c24]Chanwoo Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim:
End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System. ASRU 2019: 562-569 - [c23]Kwangyoun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim:
Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus. ASRU 2019: 956-963 - [c22]Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda:
Power-Law Nonlinearity with Maximally Uniform Distribution Criterion for Improved Neural Network Training in Automatic Speech Recognition. ASRU 2019: 988-995 - [c21]Chanwoo Kim, Minkyu Shin, Abhinav Garg, Dhananjaya Gowda:
Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System. INTERSPEECH 2019: 739-743 - [c20]Dhananjaya Gowda, Abhinav Garg, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition. INTERSPEECH 2019: 2783-2787 - [i3]Chanwoo Kim, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda:
end-to-end training of a large vocabulary end-to-end speech recognition system. CoRR abs/1912.11040 (2019) - [i2]Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda:
power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition. CoRR abs/1912.11041 (2019) - [i1]Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim:
Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models. CoRR abs/1912.12384 (2019) - 2018
- [j6]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction. Speech Commun. 99: 62-79 (2018) - 2017
- [c19]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions. INTERSPEECH 2017: 1512-1516 - 2016
- [j5]Jinfang Wang, Yongqiang Shang, Shuangshuang Jiang, Dhananjaya N. Gowda, Ke Lv:
Whispered Speech Detection Using Fusion of Group-Delay-Based Subband Modulation Spectrum and Correntropy Features. IEEE Signal Process. Lett. 23(8): 1042-1046 (2016) - [c18]Dhananjaya N. Gowda, Manu Airaksinen, Paavo Alku:
Quasi closed phase analysis of speech signals using time varying weighted linear prediction for accurate formant tracking. ICASSP 2016: 4980-4984 - [c17]Dhananjaya N. Gowda, Paavo Alku:
Time-Varying Quasi-Closed-Phase Weighted Linear Prediction Analysis of Speech for Accurate Formant Detection and Tracking. INTERSPEECH 2016: 1760-1764 - 2015
- [c16]Dhananjaya N. Gowda, Rahim Saeidi, Paavo Alku:
AM-FM based filter bank analysis for estimation of spectro-temporal envelopes and its application for speaker recognition in noisy reverberant environments. INTERSPEECH 2015: 1166-1170 - [c15]Rizwan Ishaq, Dhananjaya N. Gowda, Paavo Alku, Begonya Garcia-Zapirain:
Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping. SLPAT@Interspeech 2015: 55-59 - 2014
- [c14]Antti Suni, Tuomo Raitio, Dhananjaya Gowda, Reima Karhila, Matthew Gibson, Oliver Watts:
The Simple4All entry to the Blizzard Challenge 2014. Blizzard Challenge 2014 - [c13]Dhananjaya N. Gowda, Heikki Kallasjoki, Reima Karhila, Cristian Contan, Kalle J. Palomäki, Mircea Giurgiu, Mikko Kurimo:
On the role of missing data imputation and NMF feature enhancement in building synthetic voices using reverberant speech. INTERSPEECH 2014: 2947-2951 - 2013
- [j4]Chetana Prakash, Dhananjaya N. Gowda, Suryakanth V. Gangashetty:
Analysis of Acoustic Events in Speech Signals Using Bessel Series Expansion. Circuits Syst. Signal Process. 32(6): 2915-2938 (2013) - [j3]Bayya Yegnanarayana, Dhananjaya N. Gowda:
Spectro-temporal analysis of speech signals using zero-time windowing and group delay function. Speech Commun. 55(6): 782-795 (2013) - [c12]Dhananjaya N. Gowda, Jouni Pohjalainen, Mikko Kurimo, Paavo Alku:
Robust formant detection using group delay function and stabilized weighted linear prediction. INTERSPEECH 2013: 49-53 - [c11]Dhananjaya N. Gowda, Mikko Kurimo:
Analysis of breathy, modal and pressed phonation based on low frequency spectral density. INTERSPEECH 2013: 3206-3210 - [c10]Dhananjaya N. Gowda, Jouni Pohjalainen, Paavo Alku, Mikko Kurimo:
Robust spectral representation using group delay function and stabilized weighted linear prediction for additive noise degradations. SpeD 2013: 1-7 - 2012
- [c9]Vinay Kumar Mittal, N. Dhananjaya, Bayya Yegnanarayana:
Effect of Tongue Tip Trilling on the Glottal Excitation Source. INTERSPEECH 2012: 1596-1599 - 2011
- [c8]N. Dhananjaya, B. Yegnanarayana, Suryakanth V. Gangashetty:
Acoustic-phonetic information from excitation source for refining manner hypotheses of a phone recognizer. ICASSP 2011: 5252-5255 - [c7]Bayya Yegnanarayana, Anand Joseph Xavier Medabalimi, Suryakanth V. Gangashetty, N. Dhananjaya:
Decomposition of speech signals for analysis of aperiodic components of excitation. ICASSP 2011: 5396-5399 - [c6]Chetana Prakash, N. Dhananjaya, Suryakanth V. Gangashetty:
Exploring Bessel Features for Detection of Glottal Closure Instants. INTERSPEECH 2011: 1985-1988 - 2010
- [j2]N. Dhananjaya, B. Yegnanarayana:
Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs. IEEE Signal Process. Lett. 17(3): 273-276 (2010)
2000 – 2009
- 2008
- [j1]N. Dhananjaya, B. Yegnanarayana:
Speaker change detection in casual conversations using excitation source features. Speech Commun. 50(2): 153-161 (2008) - [c5]C. Krishna Mohan, N. Dhananjaya, B. Yegnanarayana:
Video Shot Segmentation Using Late Fusion Technique. ICMLA 2008: 267-270 - [c4]N. Dhananjaya, S. Rajendran, B. Yegnanarayana:
Features for automatic detection of voice bars in continuous speech. INTERSPEECH 2008: 1321-1324 - [c3]B. Yegnanarayana, S. Rajendran, Hussien Seid Worku, N. Dhananjaya:
Analysis of glottal stops in speech signals. INTERSPEECH 2008: 1481-1484 - 2006
- [c2]N. Dhananjaya, B. Yegnanarayana:
Correlation-Based Similarity Between Signals for Speaker Verification with Limited Amount of Speech Data. MRCS 2006: 17-25 - 2004
- [c1]N. Dhananjaya, S. Guruprasad, B. Yegnanarayana:
Speaker Segmentation Based on Subsegmental Features and Neural Network Models. ICONIP 2004: 1210-1215
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-28 01:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint