default search action

combined dblp search
author search
venue search
publication search

ask others

Hagen Soltau

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangSS00YS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangSS00YS24
Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan Cao, Dian Yu, Laurent El Shafey:
Retrieval Augmented End-to-End Spoken Dialog Models. ICASSP 2024: 12056-12060
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-01828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-01828
Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan Cao, Dian Yu, Laurent El Shafey:
Retrieval Augmented End-to-End Spoken Dialog Models. CoRR abs/2402.01828 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-13640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-13640
Ying Ma, Owen Burns, Mingqiu Wang, Gang Li, Nan Du, Laurent El Shafey, Liqiang Wang, Izhak Shafran, Hagen Soltau:
Knowledge Graph Reasoning with Self-supervised Reinforcement Learning. CoRR abs/2405.13640 (2024)
2023
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SoltauSODUBSWJB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SoltauSODUBSWJB23
Hagen Soltau, Izhak Shafran, Alex Ottenwess, Joseph R. Duffy, Rene L. Utianski, Leland R. Barnard, John L. Stricker, Daniela A. Wiepert, David T. Jones, Hugo Botha:
Detecting Speech Abnormalities With a Perceiver-Based Sequence Classifier that Leverages a Universal Speech Model. ASRU 2023: 1-7
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangHSWCCCZSRZYPSSW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangHSWCCCZSRZYPSSW23
Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul K. Rubenstein, Lukas Zilka, Dian Yu, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu:
SLM: Bridge the Thin Gap Between Speech and Text Foundation Models. ASRU 2023: 1-8
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/Zhao0G0RWSSW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/Zhao0G0RWSSW23
Jeffrey Zhao, Yuan Cao, Raghav Gupta, Harrison Lee, Abhinav Rastogi, Mingqiu Wang, Hagen Soltau, Izhak Shafran, Yonghui Wu:
AnyTOD: A Programmable Task-Oriented Dialog System. EMNLP 2023: 16189-16204
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauSWRZJ00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauSWRZJ00M23
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Abhinav Rastogi, Jeffrey Zhao, Ye Jia, Wei Han, Yuan Cao, Aramys Miranda:
Speech Aware Dialog System Technology Challenge (DSTC11). INTERSPEECH 2023: 4668-4672
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01037
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07944
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07944
Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan Cao, Dian Yu, Laurent El Shafey:
Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding. CoRR abs/2306.07944 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08131
Nanxin Chen, Izhak Shafran, Yu Zhang, Chung-Cheng Chiu, Hagen Soltau, James Qin, Yonghui Wu:
Efficient Adapters for Giant Speech Models. CoRR abs/2306.08131 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00230
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00230
Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Yongqiang Wang, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul K. Rubenstein, Lukas Zilka, Dian Yu, Zhong Meng, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu:
SLM: Bridge the thin gap between speech and text foundation models. CoRR abs/2310.00230 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13010
Hagen Soltau, Izhak Shafran, Alex Ottenwess, Joseph R. Duffy, Rene L. Utianski, Leland R. Barnard, John L. Stricker, Daniela A. Wiepert, David T. Jones, Hugo Botha:
Detecting Speech Abnormalities with a Perceiver-based Sequence Classifier that Leverages a Universal Speech Model. CoRR abs/2310.13010 (2023)
2022
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/YuW0SSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YuW0SSS22
Dian Yu, Mingqiu Wang, Yuan Cao, Laurent El Shafey, Izhak Shafran, Hagen Soltau:
Knowledge-grounded Dialog State Tracking. EMNLP (Findings) 2022: 3428-3435
[c54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauSWS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauSWS22
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Laurent El Shafey:
RNN Transducers for Named Entity Recognition with constraints on alignment for understanding medical conversations. INTERSPEECH 2022: 1901-1905
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/YuW0SSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/YuW0SSS22
Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau:
Unsupervised Slot Schema Induction for Task-oriented Dialog. NAACL-HLT 2022: 1174-1193
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-03543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-03543
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Laurent El Shafey:
RNN Transducers for Nested Named Entity Recognition with constraints on alignment for long sequences. CoRR abs/2203.03543 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-04515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-04515
Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau:
Unsupervised Slot Schema Induction for Task-oriented Dialog. CoRR abs/2205.04515 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06656
Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau:
Knowledge-grounded Dialog State Tracking. CoRR abs/2210.06656 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08704
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Abhinav Rastogi, Jeffrey Zhao, Ye Jia, Wei Han, Yuan Cao, Aramys Miranda:
Speech Aware Dialog System Technology Challenge (DSTC11). CoRR abs/2212.08704 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-09939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-09939
Jeffrey Zhao, Yuan Cao, Raghav Gupta, Harrison Lee, Abhinav Rastogi, Mingqiu Wang, Hagen Soltau, Izhak Shafran, Yonghui Wu:
AnyTOD: A Programmable Task-Oriented Dialog System. CoRR abs/2212.09939 (2022)
2021
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangSSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangSSS21
Mingqiu Wang, Hagen Soltau, Laurent El Shafey, Izhak Shafran:
Word-Level Confidence Estimation for RNN Transducers. ASRU 2021: 1170-1177
[c51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauWSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauWSS21
Hagen Soltau, Mingqiu Wang, Izhak Shafran, Laurent El Shafey:
Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction. Interspeech 2021: 4418-4422
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02219
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02219
Hagen Soltau, Mingqiu Wang, Izhak Shafran, Laurent El Shafey:
Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction. CoRR abs/2104.02219 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15222
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15222
Mingqiu Wang, Hagen Soltau, Laurent El Shafey, Izhak Shafran:
Word-level confidence estimation for RNN transducers. CoRR abs/2110.15222 (2021)
2020
[c50]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/lrec/ShafranDTPKKDHC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/ShafranDTPKKDHC20
Izhak Shafran, Nan Du, Linh Tran, Amanda Perry, Lauren Keyes, Mark Knichel, Ashley Domin, Lei Huang, Yuhui Chen, Gang Li, Mingqiu Wang, Laurent El Shafey, Hagen Soltau, Justin S. Paul:
The Medical Scribe: Corpus Development and Model Performance Analyses. LREC 2020: 2036-2044
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-11531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-11531
Izhak Shafran, Nan Du, Linh Tran, Amanda Perry, Lauren Keyes, Mark Knichel, Ashley Domin, Lei Huang, Yuhui Chen, Gang Li, Mingqiu Wang, Laurent El Shafey, Hagen Soltau, Justin S. Paul:
The Medical Scribe: Corpus Development and Model Performance Analyses. CoRR abs/2003.11531 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TripathiLSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TripathiLSS19
Anshuman Tripathi, Han Lu, Hasim Sak, Hagen Soltau:
Monotonic Recurrent Neural Network Transducer and Decoding Strategies. ASRU 2019: 944-948
[c48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShafeySS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShafeySS19
Laurent El Shafey, Hagen Soltau, Izhak Shafran:
Joint Speech Recognition and Speaker Diarization via Sequence Transduction. INTERSPEECH 2019: 396-400
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-05337
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-05337
Laurent El Shafey, Hagen Soltau, Izhak Shafran:
Joint Speech Recognition and Speaker Diarization via Sequence Transduction. CoRR abs/1907.05337 (2019)
2017
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SoltauLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SoltauLS17
Hagen Soltau, Hank Liao, Hasim Sak:
Reducing the computational complexity for whole word models. ASRU 2017: 63-68
[c46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauLS17
Hagen Soltau, Hank Liao, Hasim Sak:
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition. INTERSPEECH 2017: 3707-3711
2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SoltauLS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SoltauLS16
Hagen Soltau, Hank Liao, Hasim Sak:
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition. CoRR abs/1610.09975 (2016)
2015
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/SainathKSSMDR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/SainathKSSMDR15
Tara N. Sainath, Brian Kingsbury, George Saon, Hagen Soltau, Abdel-rahman Mohamed, George E. Dahl, Bhuvana Ramabhadran:
Deep Convolutional Neural Networks for Large-scale Speech Tasks. Neural Networks 64: 39-48 (2015)
2014
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasGSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasGSS14
Samuel Thomas, Sriram Ganapathy, George Saon, Hagen Soltau:
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions. ICASSP 2014: 2519-2523
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NoldenSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NoldenSN14
David Nolden, Hagen Soltau, Hermann Ney:
Progress in dynamic network decoding. ICASSP 2014: 3276-3280
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonS14
George Saon, Hagen Soltau:
A comparison of two optimization techniques for sequence discriminative training of deep neural networks. ICASSP 2014: 5567-5571
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSS14
Hagen Soltau, George Saon, Tara N. Sainath:
Joint training of convolutional and non-convolutional neural networks. ICASSP 2014: 5572-5576
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KuoKMSB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KuoKMSB14
Hong-Kwang Kuo, Ellen Eide Kislal, Lidia Mangu, Hagen Soltau, Tomás Beran:
Out-of-vocabulary word detection in a speech-to-speech translation system. ICASSP 2014: 7108-7112
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ManguKSKP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ManguKSKP14
Lidia Mangu, Brian Kingsbury, Hagen Soltau, Hong-Kwang Kuo, Michael Picheny:
Efficient spoken term detection using confusion networks. ICASSP 2014: 7844-7848
[c39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonSEP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonSEP14
George Saon, Hagen Soltau, Ahmad Emami, Michael Picheny:
Unfolded recurrent neural networks for speech recognition. INTERSPEECH 2014: 343-347
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NoldenSPGMN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NoldenSPGMN14
David Nolden, Hagen Soltau, Daniel Povey, Pegah Ghahremani, Lidia Mangu, Hermann Ney:
Removing redundancy from lattices. INTERSPEECH 2014: 656-660
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/tanlp/SoltauSMKKCB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/tanlp/SoltauSMKKCB14
Hagen Soltau, George Saon, Lidia Mangu, Hong-Kwang Kuo, Brian Kingsbury, Stephen M. Chu, Fadi Biadsy:
Automatic Speech Recognition. NLP of Semitic Languages 2014: 409-459
2013
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SainathKSR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SainathKSR13
Tara N. Sainath, Brian Kingsbury, Hagen Soltau, Bhuvana Ramabhadran:
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks. IEEE Trans. Speech Audio Process. 21(11): 2267-2276 (2013)
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonSNP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonSNP13
George Saon, Hagen Soltau, David Nahamoo, Michael Picheny:
Speaker adaptation of neural network acoustic models using i-vectors. ASRU 2013: 55-59
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ManguSKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ManguSKS13
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, George Saon:
The IBM keyword search system for the DARPA RATS program. ASRU 2013: 204-209
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to Deep Convolutional Neural Networks for LVCSR. ASRU 2013: 315-320
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ManguSKKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ManguSKKS13
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, Brian Kingsbury, George Saon:
Exploiting diversity for spoken term detection. ICASSP 2013: 8282-8286
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MousaKMS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MousaKMS13
Amr El-Desoky Mousa, Hong-Kwang Jeff Kuo, Lidia Mangu, Hagen Soltau:
Morpheme-based feature-rich language models using Deep Neural Networks for LVCSR of Egyptian Arabic. ICASSP 2013: 8435-8439
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauKMSB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauKMSB13
Hagen Soltau, Hong-Kwang Kuo, Lidia Mangu, George Saon, Tomás Beran:
Neural network acoustic models for the DARPA RATS program. INTERSPEECH 2013: 3092-3096
[c31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonTSGK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonTSGK13
George Saon, Samuel Thomas, Hagen Soltau, Sriram Ganapathy, Brian Kingsbury:
The IBM speech activity detection system for the DARPA RATS program. INTERSPEECH 2013: 3497-3501
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to deep convolutional neural networks for LVCSR. CoRR abs/1309.1501 (2013)
2012
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/SaonS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/SaonS12
George Saon, Hagen Soltau:
Boosting systems for large vocabulary continuous speech recognition. Speech Commun. 54(2): 212-218 (2012)
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KingsburySS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KingsburySS12
Brian Kingsbury, Tara N. Sainath, Hagen Soltau:
Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization. INTERSPEECH 2012: 10-13
2011
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SoltauMB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SoltauMB11
Hagen Soltau, Lidia Mangu, Fadi Biadsy:
From Modern Standard Arabic to Levantine ASR: Leveraging GALE for dialects. ASRU 2011: 266-271
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ManguKCKSSB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ManguKCKSSB11
Lidia Mangu, Hong-Kwang Kuo, Stephen M. Chu, Brian Kingsbury, George Saon, Hagen Soltau, Fadi Biadsy:
The IBM 2011 GALE Arabic speech transcription system. ASRU 2011: 272-277
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KingsburySSCKMRMJ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KingsburySSCKMRMJ11
Brian Kingsbury, Hagen Soltau, George Saon, Stephen M. Chu, Hong-Kwang Kuo, Lidia Mangu, Suman V. Ravuri, Nelson Morgan, Adam Janin:
The IBM 2009 GALE Arabic speech transcription system. ICASSP 2011: 4672-4675
2010
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonSCCKKMP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonSCCKKMP10
George Saon, Hagen Soltau, Upendra V. Chaudhari, Stephen M. Chu, Brian Kingsbury, Hong-Kwang Kuo, Lidia Mangu, Daniel Povey:
The IBM 2008 GALE Arabic speech transcription system. ICASSP 2010: 4378-4381
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaKSCCML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaKSCCML10
Chengyuan Ma, Hong-Kwang Jeff Kuo, Hagen Soltau, Xiaodong Cui, Upendra V. Chaudhari, Lidia Mangu, Chin-Hui Lee:
A comparative study on system combination schemes for LVCSR. ICASSP 2010: 4394-4397
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EmamiCISZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EmamiCISZ10
Ahmad Emami, Stanley F. Chen, Abraham Ittycheriah, Hagen Soltau, Bing Zhao:
Decoding with shrinkage-based language models. INTERSPEECH 2010: 1033-1036
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonS10
George Saon, Hagen Soltau:
Boosting systems for LVCSR. INTERSPEECH 2010: 1341-1344
[c22]
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/odyssey/BiadsySMNH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/BiadsySMNH10
Fadi Biadsy, Hagen Soltau, Lidia Mangu, Jirí Navrátil, Julia Hirschberg:
Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers. Odyssey 2010: 44
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SoltauSK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SoltauSK10
Hagen Soltau, George Saon, Brian Kingsbury:
The IBM Attila speech recognition toolkit. SLT 2010: 97-102

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SoltauSKKMPE09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SoltauSKKMPE09
Hagen Soltau, George Saon, Brian Kingsbury, Hong-Kwang Jeff Kuo, Lidia Mangu, Daniel Povey, Ahmad Emami:
Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program. IEEE Trans. Speech Audio Process. 17(5): 884-894 (2009)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SoltauS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SoltauS09
Hagen Soltau, George Saon:
Dynamic network decoding revisited. ASRU 2009: 276-281
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonPS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonPS09
George Saon, Daniel Povey, Hagen Soltau:
Large margin semi-tied covariance transforms for discriminative training. ICASSP 2009: 3753-3756
2008
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoveyKS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoveyKS08
Daniel Povey, Hong-Kwang Jeff Kuo, Hagen Soltau:
Fast speaker adaptive training for speech recognition. INTERSPEECH 2008: 1245-1248
2007
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSKKMPZ07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSKKMPZ07
Hagen Soltau, George Saon, Brian Kingsbury, Hong-Kwang Jeff Kuo, Lidia Mangu, Daniel Povey, Geoffrey Zweig:
The IBM 2006 Gale Arabic ASR System. ICASSP (4) 2007: 349-352
2006
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenKMPSSZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenKMPSSZ06
Stanley F. Chen, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon, Hagen Soltau, Geoffrey Zweig:
Advances in speech transcription at IBM under the DARPA EARS program. IEEE Trans. Speech Audio Process. 14(5): 1596-1608 (2006)
2005
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/de/Soltau2005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/de/Soltau2005
Hagen Soltau:
Compensating hyperarticulation for automatic speech recognition. Karlsruhe Institute of Technology, Germany, 2005, pp. 1-156
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauKMPSZ05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauKMPSZ05
Hagen Soltau, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon, Geoffrey Zweig:
The IBM 2004 Conversational Telephony System for Rich Transcription. ICASSP (1) 2005: 205-208
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PoveyKMSSZ05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PoveyKMSSZ05
Daniel Povey, Brian Kingsbury, Lidia Mangu, George Saon, Hagen Soltau, Geoffrey Zweig:
fMPE: Discriminatively Trained Features for Speech Recognition. ICASSP (1) 2005: 961-964
2004
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauYMFJJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauYMFJJ04
Hagen Soltau, Hua Yu, Florian Metze, Christian Fügen, Qin Jin, Szu-Chen Stan Jou:
The 2003 ISL rich transcription system for conversational telephony speech. ICASSP (1) 2004: 773-776
2002
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauMFW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauMFW02
Hagen Soltau, Florian Metze, Christian Fügen, Alex Waibel:
Efficient language model lookahead through polymorphic linguistic context assignment. ICASSP 2002: 709-712
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauMW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauMW02
Hagen Soltau, Florian Metze, Alex Waibel:
Compensating for hyperarticulation by modeling articulatory properties. INTERSPEECH 2002: 841-844
2001
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSMW01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSMW01
Hagen Soltau, Thomas Schaaf, Florian Metze, Alex Waibel:
The ISL evaluation system for Verbmobil-II. ICASSP 2001: 65-68
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/McDonoughMSW01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/McDonoughMSW01
John W. McDonough, Florian Metze, Hagen Soltau, Alex Waibel:
Speaker compensation with sine-log all-pass transforms. ICASSP 2001: 369-372
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WaibelBMRSSSYZ01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WaibelBMRSSSYZ01
Alex Waibel, Michael Bett, Florian Metze, Klaus Ries, Thomas Schaaf, Tanja Schultz, Hagen Soltau, Hua Yu, Klaus Zechner:
Advances in automatic meeting record creation and access. ICASSP 2001: 597-600
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MetzeMS01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MetzeMS01
Florian Metze, John W. McDonough, Hagen Soltau:
Speech recognition over netmeeting connections. INTERSPEECH 2001: 2389-2392
[c7]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/naacl/WaibelYSPBWSSM01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WaibelYSPBWSSM01
Alex Waibel, Hua Yu, Tanja Schultz, Yue Pan, Michael Bett, Martin Westphal, Hagen Soltau, Thomas Schaaf, Florian Metze:
Advances in meeting recognition. HLT 2001
2000
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauW00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauW00
Hagen Soltau, Alex Waibel:
Specialized acoustic models for hyperarticulated speech. ICASSP 2000: 1779-1782
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeKSSS00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeKSSS00
Florian Metze, Thomas Kemp, Thomas Schaaf, Tanja Schultz, Hagen Soltau:
Confidence measure based language identification. ICASSP 2000: 1827-1830
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauW00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauW00
Hagen Soltau, Alex Waibel:
Phone dependent modeling of hyperarticulated effects#. INTERSPEECH 2000: 105-108

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1998
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauSWW98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauSWW98
Hagen Soltau, Tanja Schultz, Martin Westphal, Alex Waibel:
Recognition of music types. ICASSP 1998: 1137-1140
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauW98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauW98
Hagen Soltau, Alex Waibel:
On the influence of hyperarticulated speech on recognition performance. ICSLP 1998
1996
[c1]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/konvens/SchultzS96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/konvens/SchultzS96
Tanja Schultz, Hagen Soltau:
Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen. KONVENS 1996: 102-110

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.