default search action
Mark J. F. Gales
Person information
- affiliation: University of Cambridge, UK
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j61]Guanfeng Wu, Abbas Haider, Xing Tian, Erfan Loweimi, Chi-Ho Chan, Mengjie Qian, Muhammad Junaid Awan, Ivor T. A. Spence, Rob Cooper, Wing W. Y. Ng, Josef Kittler, Mark J. F. Gales, Hui Wang:
Multi-modal video search by examples - A video quality impact analysis. IET Comput. Vis. 18(7): 1017-1033 (2024) - [c309]Luran Wang, Mark J. F. Gales, Vatsal Raina:
An Information-Theoretic Approach to Analyze NLP Classification Tasks. ACL (1) 2024: 530-551 - [c308]Adian Liusie, Yassir Fathullah, Mark J. F. Gales:
Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models. ACL (Findings) 2024: 1376-1387 - [c307]Stefano Bannò, Hari Krishna Vydana, Kate M. Knill, Mark J. F. Gales:
Can GPT-4 do L2 analytic assessment? BEA 2024: 149-164 - [c306]Asma Farajidizaji, Vatsal Raina, Mark J. F. Gales:
Is It Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models. LREC/COLING 2024: 9325-9339 - [c305]Adian Liusie, Potsawee Manakul, Mark J. F. Gales:
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models. EACL (1) 2024: 139-151 - [c304]Yassir Fathullah, Puria Radmard, Adian Liusie, Mark J. F. Gales:
Who Needs Decoders? Efficient Estimation of Sequence-Level Attributes with Proxies. EACL (1) 2024: 1478-1496 - [c303]Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark J. F. Gales:
Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons. EMNLP 2024: 6835-6855 - [c302]Vyas Raina, Adian Liusie, Mark J. F. Gales:
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment. EMNLP 2024: 7499-7517 - [c301]Vyas Raina, Rao Ma, Charles McGhee, Kate M. Knill, Mark J. F. Gales:
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models. EMNLP 2024: 7549-7565 - [c300]Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark J. F. Gales, Mario Fritz:
LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History. EMNLP 2024: 14633-14652 - [c299]Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales:
Towards End-to-End Spoken Grammatical Error Correction. ICASSP 2024: 10791-10795 - [c298]Hui Wang, Josef Kittler, Mark J. F. Gales, Rob Cooper, Maurice D. Mulvenna, Wing W. Y. Ng, Yang Hua, Richard Gault, Abbas Haider, Guanfeng Wu:
MVRMLM 2024: Multimodal Video Retrieval and Multimodal Language Modelling. ICMR 2024: 1345-1346 - [c297]Yassir Fathullah, Mark J. F. Gales:
Efficient Sample-Specific Encoder Perturbations. NAACL (Short Papers) 2024: 663-671 - [c296]Piotr Molenda, Adian Liusie, Mark J. F. Gales:
WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models. NAACL-HLT (Findings) 2024: 3515-3525 - [c295]Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill:
Investigating the Emergent Audio Classification Ability of ASR Foundation Models. NAACL-HLT 2024: 4746-4760 - [i76]Luran Wang, Mark J. F. Gales, Vatsal Raina:
An Information-Theoretic Approach to Analyze NLP Classification Tasks. CoRR abs/2402.00978 (2024) - [i75]Vyas Raina, Adian Liusie, Mark J. F. Gales:
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment. CoRR abs/2402.14016 (2024) - [i74]Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark J. F. Gales, Mario Fritz:
LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History. CoRR abs/2402.18216 (2024) - [i73]Adian Liusie, Yassir Fathullah, Mark J. F. Gales:
Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models. CoRR abs/2403.13590 (2024) - [i72]Piotr Molenda, Adian Liusie, Mark J. F. Gales:
WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models. CoRR abs/2403.19548 (2024) - [i71]Vatsal Raina, Mark J. F. Gales:
Question Difficulty Ranking for Multiple-Choice Reading Comprehension. CoRR abs/2404.10704 (2024) - [i70]Stefano Bannò, Hari Krishna Vydana, Kate M. Knill, Mark J. F. Gales:
Can GPT-4 do L2 analytic assessment? CoRR abs/2404.18557 (2024) - [i69]Yassir Fathullah, Mark J. F. Gales:
Efficient Sample-Specific Encoder Perturbations. CoRR abs/2405.01601 (2024) - [i68]Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark J. F. Gales:
Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons. CoRR abs/2405.05894 (2024) - [i67]Vyas Raina, Rao Ma, Charles McGhee, Kate M. Knill, Mark J. F. Gales:
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models. CoRR abs/2405.06134 (2024) - [i66]Vatsal Raina, Mark J. F. Gales:
Question-Based Retrieval using Atomic Units for Enterprise RAG. CoRR abs/2405.12363 (2024) - [i65]Guangzhi Sun, Potsawee Manakul, Adian Liusie, Kunat Pipatanakul, Chao Zhang, Philip C. Woodland, Mark J. F. Gales:
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models. CoRR abs/2405.13684 (2024) - [i64]Rao Ma, Yassir Fathullah, Mengjie Qian, Siyuan Tang, Mark J. F. Gales, Kate M. Knill:
Cross-Lingual Transfer Learning for Speech Translation. CoRR abs/2407.01130 (2024) - [i63]Vyas Raina, Mark J. F. Gales:
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models. CoRR abs/2407.04482 (2024) - [i62]Mengjie Qian, Siyuan Tang, Rao Ma, Kate M. Knill, Mark J. F. Gales:
Learn and Don't Forget: Adding a New Language to ASR Foundation Models. CoRR abs/2407.06800 (2024) - [i61]Stefano Bannò, Kate M. Knill, Mark J. F. Gales:
Grammatical Error Feedback: An Implicit Evaluation Approach. CoRR abs/2408.09565 (2024) - [i60]Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill:
ASR Error Correction using Large Language Models. CoRR abs/2409.09554 (2024) - [i59]Vatsal Raina, Adian Liusie, Mark J. F. Gales:
Finetuning LLMs for Comparative Assessment Tasks. CoRR abs/2409.15979 (2024) - 2023
- [c294]Potsawee Manakul, Yassir Fathullah, Adian Liusie, Vyas Raina, Vatsal Raina, Mark J. F. Gales:
CUED at ProbSum 2023: Hierarchical Ensemble of Summarization Models. BioNLP@ACL 2023: 516-523 - [c293]Potsawee Manakul, Adian Liusie, Mark J. F. Gales:
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models. EMNLP 2023: 9004-9017 - [c292]Vatsal Raina, Adian Liusie, Mark J. F. Gales:
Assessing Distractors in Multiple-Choice Tests. Eval4NLP 2023: 12-22 - [c291]Yi Yang, Qihua Li, Xing Tian, Wing W. Y. Ng, Hui Wang, Josef Kittler, Mark J. F. Gales, Rob Cooper:
Unsupervised Multi-Hashing for Image Retrieval in Non-stationary Environments. ICACI 2023: 1-6 - [c290]Tian Huey Teh, Vivian Hu, Devang S. Ram Mohan, Zack Hodari, Christopher G. R. Wallis, Tomás Gómez Ibarrondo, Alexandra Torresquintero, James Leoni, Mark J. F. Gales, Simon King:
Ensemble Prosody Prediction For Expressive Speech Synthesis. ICASSP 2023: 1-5 - [c289]Potsawee Manakul, Adian Liusie, Mark J. F. Gales:
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization. IJCNLP (1) 2023: 39-53 - [c288]Vyas Raina, Mark J. F. Gales:
Minimum Bayes' Risk Decoding for System Combination of Grammatical Error Correction Systems. IJCNLP (2) 2023: 105-112 - [c287]Adian Liusie, Potsawee Manakul, Mark J. F. Gales:
Mitigating Word Bias in Zero-shot Prompt-based Classifiers. IJCNLP (Findings) 2023: 327-335 - [c286]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245 - [c285]Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill:
Adapting an Unadaptable ASR System. INTERSPEECH 2023: 989-993 - [c284]Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian:
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space. INTERSPEECH 2023: 3267-3271 - [c283]Diane Nicholls, Kate M. Knill, Mark J. F. Gales, Anton Ragni, Paul Ricketts:
Speak & Improve: L2 English Speaking Practice Tool. INTERSPEECH 2023: 3669-3670 - [c282]Nataliia Molchanova, Vatsal Raina, Andrey Malinin, Francesco La Rosa, Henning Müller, Mark J. F. Gales, Cristina Granziera, Mara Graziani, Meritxell Bach Cuadra:
Novel Structural-Scale Uncertainty Measures and Error Retention Curves: Application to Multiple Sclerosis. ISBI 2023: 1-5 - [c281]Vatsal Raina, Nataliia Molchanova, Mara Graziani, Andrey Malinin, Henning Müller, Meritxell Bach Cuadra, Mark J. F. Gales:
Tackling Bias in the Dice Similarity Coefficient: Introducing NDSC for White Matter Lesion Segmentation. ISBI 2023: 1-5 - [c280]Vatsal Raina, Adian Liusie, Mark J. F. Gales:
Analyzing Multiple-Choice Reading and Listening Comprehension Tests. SLaTE 2023: 1-5 - [c279]Charles McGhee, Katherine M. Knill, Mark J. F. Gales:
Towards Acoustic-to-Articulatory Inversion for Pronunciation Training. SLaTE 2023: 66-70 - [c278]Simon W. McKnight, Arda Civelekoglu, Mark J. F. Gales, Stefano Bannò, Adian Liusie, Katherine M. Knill:
Automatic Assessment of Conversational Speaking Tests. SLaTE 2023: 99-103 - [c277]Rao Ma, Mengjie Qian, Mark J. F. Gales, Katherine M. Knill:
Adapting an ASR Foundation Model for Spoken Language Assessment. SLaTE 2023: 104-108 - [c276]Stefano Bannò, Katherine M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales:
Assessment of L2 Oral Proficiency Using Self-Supervised Speech Representation Learning. SLaTE 2023: 126-130 - [c275]Katherine M. Knill, Diane Nicholls, Mark J. F. Gales, Pawel Stroinski, Alex Watkinson:
Annotation of L2 English Speech for Developing and Evaluating End-to-End Spoken Grammatical Error Correction. SLaTE 2023: 146-150 - [c274]Yassir Fathullah, Guoxuan Xia, Mark J. F. Gales:
Logit-based ensemble distribution distillation for robust autoregressive sequence uncertainties. UAI 2023: 582-591 - [i58]Potsawee Manakul, Adian Liusie, Mark J. F. Gales:
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization. CoRR abs/2301.12307 (2023) - [i57]Vyas Raina, Mark J. F. Gales:
Identifying Adversarially Attackable and Robust Samples. CoRR abs/2301.12896 (2023) - [i56]Vatsal Raina, Nataliia Molchanova, Mara Graziani, Andrey Malinin, Henning Müller, Meritxell Bach Cuadra, Mark J. F. Gales:
Tackling Bias in the Dice Similarity Coefficient: Introducing nDSC for White Matter Lesion Segmentation. CoRR abs/2302.05432 (2023) - [i55]Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian:
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space. CoRR abs/2303.00456 (2023) - [i54]Potsawee Manakul, Adian Liusie, Mark J. F. Gales:
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models. CoRR abs/2303.08896 (2023) - [i53]Vyas Raina, Mark J. F. Gales:
Sentiment Perception Adversarial Attacks on Neural Machine Translation Systems. CoRR abs/2305.01437 (2023) - [i52]Yassir Fathullah, Puria Radmard, Adian Liusie, Mark J. F. Gales:
Who Needs Decoders? Efficient Estimation of Sequence-level Attributes. CoRR abs/2305.05098 (2023) - [i51]Yassir Fathullah, Guoxuan Xia, Mark J. F. Gales:
Logit-Based Ensemble Distribution Distillation for Robust Autoregressive Sequence Uncertainties. CoRR abs/2305.10384 (2023) - [i50]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023) - [i49]Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill:
Adapting an Unadaptable ASR System. CoRR abs/2306.01208 (2023) - [i48]Potsawee Manakul, Yassir Fathullah, Adian Liusie, Vyas Raina, Vatsal Raina, Mark J. F. Gales:
CUED at ProbSum 2023: Hierarchical Ensemble of Summarization Models. CoRR abs/2306.05317 (2023) - [i47]Vyas Raina, Mark J. F. Gales:
Sample Attackability in Natural Language Adversarial Attacks. CoRR abs/2306.12043 (2023) - [i46]Adian Liusie, Vatsal Raina, Andrew Mullooly, Kate M. Knill, Mark J. F. Gales:
CamChoice: A Corpus of Multiple Choice Questions and Candidate Response Distributions. CoRR abs/2306.13047 (2023) - [i45]Vatsal Raina, Adian Liusie, Mark J. F. Gales:
Analyzing Multiple-Choice Reading and Listening Comprehension Tests. CoRR abs/2307.01076 (2023) - [i44]Rao Ma, Mengjie Qian, Potsawee Manakul, Mark J. F. Gales, Kate M. Knill:
Can Generative Large Language Models Perform ASR Error Correction? CoRR abs/2307.04172 (2023) - [i43]Adian Liusie, Potsawee Manakul, Mark J. F. Gales:
Zero-shot NLG evaluation through Pairware Comparisons with LLMs. CoRR abs/2307.07889 (2023) - [i42]Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill:
Adapting an ASR Foundation Model for Spoken Language Assessment. CoRR abs/2307.09378 (2023) - [i41]Adian Liusie, Potsawee Manakul, Mark J. F. Gales:
Mitigating Word Bias in Zero-shot Prompt-based Classifiers. CoRR abs/2309.04992 (2023) - [i40]Vyas Raina, Mark J. F. Gales:
Minimum Bayes' Risk Decoding for System Combination of Grammatical Error Correction Systems. CoRR abs/2309.06520 (2023) - [i39]Mengjie Qian, Rao Ma, Adian Liusie, Erfan Loweimi, Kate M. Knill, Mark J. F. Gales:
Zero-shot Audio Topic Reranking using Large Language Models. CoRR abs/2309.07606 (2023) - [i38]Asma Farajidizaji, Vatsal Raina, Mark J. F. Gales:
Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models. CoRR abs/2309.12551 (2023) - [i37]Vatsal Raina, Adian Liusie, Mark J. F. Gales:
Assessing Distractors in Multiple-Choice Tests. CoRR abs/2311.04554 (2023) - [i36]Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales:
Towards End-to-End Spoken Grammatical Error Correction. CoRR abs/2311.05550 (2023) - [i35]Nataliia Molchanova, Vatsal Raina, Andrey Malinin, Francesco La Rosa, Adrien Depeursinge, Mark J. F. Gales, Cristina Granziera, Henning Müller, Mara Graziani, Meritxell Bach Cuadra:
Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation. CoRR abs/2311.08931 (2023) - [i34]Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill:
Investigating the Emergent Audio Classification Ability of ASR Foundation Models. CoRR abs/2311.09363 (2023) - 2022
- [j60]Anton Ragni, Mark J. F. Gales, Oliver Rose, Katherine M. Knill, Alexandros Kastanos, Qiujia Li, Preben Ness:
Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1319-1329 (2022) - [c273]Vatsal Raina, Mark J. F. Gales:
Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension. ACL (Findings) 2022: 1020-1034 - [c272]Andrew McDonald, Mark J. F. Gales, Anurag Agarwal:
Detection of Heart Murmurs in Phonocardiograms with Parallel Hidden Semi-Markov Models. CinC 2022: 1-4 - [c271]Adian Liusie, Vatsal Raina, Vyas Raina, Mark J. F. Gales:
Analyzing Biases to Spurious Correlations in Text Classification Tasks. AACL/IJCNLP (2) 2022: 78-84 - [c270]Vyas Raina, Yiting Lu, Mark J. F. Gales:
Grammatical Error Correction Systems for Automated Assessment: Are They Susceptible to Universal Adversarial Attacks? AACL/IJCNLP (1) 2022: 158-171 - [c269]Stefano Bannò, Bhanu Balusu, Mark J. F. Gales, Kate M. Knill, Konstantinos Kyriakopoulos:
View-Specific Assessment of L2 Spoken English. INTERSPEECH 2022: 4471-4475 - [c268]Vyas Raina, Mark J. F. Gales:
Residue-Based Natural Language Adversarial Attack Detection. NAACL-HLT 2022: 3836-3848 - [c267]Adian Liusie, Mengjie Qian, Xiang Li, Mark J. F. Gales:
University of Cambridge at TREC Cast 2022. TREC 2022 - [c266]Yassir Fathullah, Mark J. F. Gales:
Self-distribution distillation: efficient uncertainty estimation. UAI 2022: 663-673 - [i33]Yassir Fathullah, Mark J. F. Gales:
Self-Distribution Distillation: Efficient Uncertainty Estimation. CoRR abs/2203.08295 (2022) - [i32]Vyas Raina, Mark J. F. Gales:
Residue-Based Natural Language Adversarial Attack Detection. CoRR abs/2204.10192 (2022) - [i31]Andrey Malinin, Andreas Athanasopoulos, Muhamed Barakovic, Meritxell Bach Cuadra, Mark J. F. Gales, Cristina Granziera, Mara Graziani, Nikolay Kartashev, Konstantinos Kyriakopoulos, Po-Jui Lu, Nataliia Molchanova, Antonis Nikitakis, Vatsal Raina, Francesco La Rosa, Eli Sivena, Vasileios Tsarsitalidis, Efi Tsompopoulou, Elena Volf:
Shifts 2.0: Extending The Dataset of Real Distributional Shifts. CoRR abs/2206.15407 (2022) - [i30]Vyas Raina, Mark J. F. Gales:
Gender Bias and Universal Substitution Adversarial Attacks on Grammatical Error Correction Systems for Automated Assessment. CoRR abs/2208.09466 (2022) - [i29]Potsawee Manakul, Mark J. F. Gales:
Podcast Summary Assessment: A Resource for Evaluating Summary Assessment Methods. CoRR abs/2208.13265 (2022) - [i28]Vatsal Raina, Mark J. F. Gales:
Multiple-Choice Question Generation: Towards an Automated Assessment Framework. CoRR abs/2209.11830 (2022) - [i27]Qingyun Dou, Mark J. F. Gales:
Deliberation Networks and How to Train Them. CoRR abs/2211.03217 (2022) - [i26]Qingyun Dou, Mark J. F. Gales:
Parallel Attention Forcing for Machine Translation. CoRR abs/2211.03237 (2022) - [i25]Nataliia Molchanova, Vatsal Raina, Andrey Malinin, Francesco La Rosa, Henning Müller, Mark J. F. Gales, Cristina Granziera, Mara Graziani, Meritxell Bach Cuadra:
Novel structural-scale uncertainty measures and error retention curves: application to multiple sclerosis. CoRR abs/2211.04825 (2022) - [i24]Adian Liusie, Vatsal Raina, Mark J. F. Gales:
"World Knowledge" in Multiple Choice Reading Comprehension. CoRR abs/2211.07040 (2022) - [i23]Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales:
L2 proficiency assessment using self-supervised speech representations. CoRR abs/2211.08849 (2022) - 2021
- [c265]Potsawee Manakul, Mark J. F. Gales:
Long-Span Summarization via Local Attention and Content Selection. ACL/IJCNLP (1) 2021: 6026-6041 - [c264]Potsawee Manakul, Mark J. F. Gales:
Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems. EMNLP (1) 2021: 9359-9368 - [c263]Yassir Fathullah, Mark J. F. Gales, Andrey Malinin:
Ensemble Distillation Approaches for Grammatical Error Correction. ICASSP 2021: 2745-2749 - [c262]Yiting Lu, Yu Wang, Mark J. F. Gales:
Efficient Use of End-to-End Data in Spoken Language Processing. ICASSP 2021: 7518-7522 - [c261]Xizi Wei, Mark J. F. Gales, Kate M. Knill:
Analysing Bias in Spoken Language Assessment Using Concept Activation Vectors. ICASSP 2021: 7753-7757 - [c260]Andrey Malinin, Mark J. F. Gales:
Uncertainty Estimation in Autoregressive Structured Prediction. ICLR 2021 - [c259]Qingyun Dou, Xixin Wu, Moquan Wan, Yiting Lu, Mark J. F. Gales:
Deliberation-Based Multi-Pass Speech Synthesis. Interspeech 2021: 136-140 - [c258]Andrey Malinin, Neil Band, Yarin Gal, Mark J. F. Gales, Alexander Ganshin, German Chesnokov, Alexey Noskov, Andrey Ploskonosov, Liudmila Prokhorenkova, Ivan Provilkov, Vatsal Raina, Vyas Raina, Denis Roginskiy, Mariya Shmatova, Panagiotis Tigas, Boris Yangel:
Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks. NeurIPS Datasets and Benchmarks 2021 - [c257]Max Ryabinin, Andrey Malinin, Mark J. F. Gales:
Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets. NeurIPS 2021: 6023-6035 - [i22]Xixin Wu, Mark J. F. Gales:
Should Ensemble Members Be Calibrated? CoRR abs/2101.05397 (2021) - [i21]Qingyun Dou, Yiting Lu, Potsawee Manakul, Xixin Wu, Mark J. F. Gales:
Attention Forcing for Machine Translation. CoRR abs/2104.01264 (2021) - [i20]Potsawee Manakul, Mark J. F. Gales:
Long-Span Dependencies in Transformer-based Summarization Systems. CoRR abs/2105.03801 (2021) - [i19]Max Ryabinin, Andrey Malinin, Mark J. F. Gales:
Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets. CoRR abs/2105.06987 (2021) - [i18]Vatsal Raina, Mark J. F. Gales:
An Initial Investigation of Non-Native Spoken Question-Answering. CoRR abs/2107.04691 (2021) - [i17]Andrey Malinin, Neil Band, Alexander Ganshin, German Chesnokov, Yarin Gal, Mark J. F. Gales, Alexey Noskov, Andrey Ploskonosov, Liudmila Prokhorenkova, Ivan Provilkov, Vatsal Raina, Vyas Raina, Mariya Shmatova, Panos Tigas, Boris Yangel:
Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks. CoRR abs/2107.07455 (2021) - [i16]Potsawee Manakul, Mark J. F. Gales:
Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems. CoRR abs/2109.03888 (2021) - 2020
- [c256]Vatsal Raina, Mark J. F. Gales, Kate M. Knill:
Complementary Systems for Off-Topic Spoken Response Detection. BEA@ACL 2020: 41-51 - [c255]Alexandros Kastanos, Anton Ragni, Mark J. F. Gales:
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks. ICASSP 2020: 6329-6333 - [c254]Andrey Malinin, Bruno Mlodozeniec, Mark J. F. Gales:
Ensemble Distribution Distillation. ICLR 2020 - [c253]Kate M. Knill, Linlin Wang, Yu Wang, Xixin Wu, Mark J. F. Gales:
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems. INTERSPEECH 2020: 255-259 - [c252]Konstantinos Kyriakopoulos, Kate M. Knill, Mark J. F. Gales:
Automatic Detection of Accent and Lexical Pronunciation Errors in Spontaneous Non-Native English Speech. INTERSPEECH 2020: 3052-3056 - [c251]Yiting Lu, Mark J. F. Gales, Yu Wang:
Spoken Language 'Grammatical Error Correction'. INTERSPEECH 2020: 3840-3844 - [c250]Vyas Raina, Mark J. F. Gales, Kate M. Knill:
Universal Adversarial Attacks on Spoken Language Assessment Systems. INTERSPEECH 2020: 3855-3859 - [c249]Xixin Wu, Kate M. Knill, Mark J. F. Gales, Andrey Malinin:
Ensemble Approaches for Uncertainty in Spoken Language Assessment. INTERSPEECH 2020: 3860-3864 - [c248]Qingyun Dou, Joshua Efiong, Mark J. F. Gales:
Attention Forcing for Speech Synthesis. INTERSPEECH 2020: 4014-4018 - [c247]Potsawee Manakul, Mark J. F. Gales, Linlin Wang:
Abstractive Spoken Document Summarization Using Hierarchical Model with Multi-Stage Attention Diversity Optimization. INTERSPEECH 2020: 4248-4252 - [c246]Potsawee Manakul, Mark J. F. Gales:
CUED_SPEECH at TREC 2020 Podcast Summarisation Track. TREC 2020 - [i15]Andrey Malinin, Mark J. F. Gales:
Uncertainty in Structured Prediction. CoRR abs/2002.07650 (2020) - [i14]Andrey Malinin, Sergey Chervontsev, Ivan Provilkov, Mark J. F. Gales:
Regression Prior Networks. CoRR abs/2006.11590 (2020) - [i13]Potsawee Manakul, Mark J. F. Gales:
CUED_speech at TREC 2020 Podcast Summarisation Track. CoRR abs/2012.02535 (2020) - [i12]Yassir Fathullah, Mark J. F. Gales, Andrey Malinin:
Ensemble Distillation Approaches for Grammatical Error Correction. CoRR abs/2012.07535 (2020)
2010 – 2019
- 2019
- [j59]Xie Chen, Xunying Liu, Yu Wang, Anton Ragni, Jeremy Heng Meng Wong, Mark J. F. Gales:
Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(9): 1444-1454 (2019) - [j58]Jeremy Heng Meng Wong, Mark John Francis Gales, Yu Wang:
General Sequence Teacher-Student Learning. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1725-1736 (2019) - [c245]Jeremy Heng Meng Wong, Mark J. F. Gales, Yu Wang:
Learning Between Different Teacher and Student Models in ASR. ASRU 2019: 93-99 - [c244]Douglas W. Oard, Marine Carpuat, Petra Galuscáková, Joseph Barrow, Suraj Nair, Xing Niu, Han-Chin Shing, Weijia Xu, Elena Zotkina, Kathleen R. McKeown, Smaranda Muresan, Efsun Selin Kayi, Ramy Eskander, Chris Kedzie, Yan Virin, Dragomir R. Radev, Rui Zhang, Mark J. F. Gales, Anton Ragni, Kenneth Heafield:
Surprise Languages: Rapid-Response Cross-Language IR. EVIA@NTCIR 2019 - [c243]Qiujia Li, Preben Ness, Anton Ragni, Mark J. F. Gales:
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation. ICASSP 2019: 6755-6759 - [c242]Kate M. Knill, Mark J. F. Gales, P. P. Manakul, Andrew Caines:
Automatic Grammatical Error Detection of Non-native Spoken Learner English. ICASSP 2019: 8127-8131 - [c241]Konstantinos Kyriakopoulos, Kate M. Knill, Mark J. F. Gales:
A Deep Learning Approach to Automatic Characterisation of Rhythm in Non-Native English Speech. INTERSPEECH 2019: 1836-1840 - [c240]Yiting Lu, Mark J. F. Gales, Kate M. Knill, P. P. Manakul, Linlin Wang, Yu Wang:
Impact of ASR Performance on Spoken Grammatical Error Detection. INTERSPEECH 2019: 1876-1880 - [c239]Andrey Malinin, Mark J. F. Gales:
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness. NeurIPS 2019: 14520-14531 - [c238]Yiting Lu, Mark J. F. Gales, Katherine M. Knill, Potsawee Manakul, Yu Wang:
Disfluency Detection for Spoken Learner English. SLaTE 2019: 74-78 - [i11]Andrey Malinin, Bruno Mlodozeniec, Mark J. F. Gales:
Ensemble Distribution Distillation. CoRR abs/1905.00076 (2019) - [i10]Andrey Malinin, Mark J. F. Gales:
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness. CoRR abs/1905.13472 (2019) - [i9]Qingyun Dou, Yiting Lu, Joshua Efiong, Mark J. F. Gales:
Attention Forcing for Sequence-to-sequence Model Training. CoRR abs/1909.12289 (2019) - [i8]Linlin Wang, Yu Wang, Mark J. F. Gales:
Non-native Speaker Verification for Spoken Language Assessment. CoRR abs/1909.13695 (2019) - [i7]Alexandros Kastanos, Anton Ragni, Mark J. F. Gales:
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks. CoRR abs/1910.11933 (2019) - 2018
- [j57]Yu Wang, Mark J. F. Gales, Kate M. Knill, Konstantinos Kyriakopoulos, Andrey Malinin, Rogier C. van Dalen, M. Rashid:
Towards automatic assessment of spontaneous spoken English. Speech Commun. 104: 47-56 (2018) - [j56]Gilles Degottex, Pierre Lanchantin, Mark J. F. Gales:
A Log Domain Pulse Model for Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 57-70 (2018) - [j55]Chunyang Wu, Mark J. F. Gales, Anton Ragni, Penny Karanasou, Khe Chai Sim:
Improving Interpretability and Regularization in Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 256-265 (2018) - [c237]Yu Wang, Xie Chen, Mark J. F. Gales, Anton Ragni, Jeremy Heng Meng Wong:
Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription. ICASSP 2018: 5899-5903 - [c236]Yu Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland:
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems. INTERSPEECH 2018: 872-876 - [c235]Moquan Wan, Gilles Degottex, Mark J. F. Gales:
Waveform-Based Speaker Representations for Speech Synthesis. INTERSPEECH 2018: 897-901 - [c234]Konstantinos Kyriakopoulos, Kate M. Knill, Mark J. F. Gales:
A Deep Learning Approach to Assessing Non-native Pronunciation of English Using Phone Distances. INTERSPEECH 2018: 1626-1630 - [c233]Kate M. Knill, Mark J. F. Gales, Konstantinos Kyriakopoulos, Andrey Malinin, Anton Ragni, Yu Wang, Andrew Caines:
Impact of ASR Performance on Free Speaking Language Assessment. INTERSPEECH 2018: 1641-1645 - [c232]Anton Ragni, Mark J. F. Gales:
Automatic Speech Recognition System Development in the "Wild". INTERSPEECH 2018: 2217-2221 - [c231]Oscar Chen, Anton Ragni, Mark J. F. Gales, Xie Chen:
Active Memory Networks for Language Modeling. INTERSPEECH 2018: 3338-3342 - [c230]Andrey Malinin, Mark J. F. Gales:
Predictive Uncertainty Estimation via Prior Networks. NeurIPS 2018: 7047-7058 - [c229]Anton Ragni, Qiujia Li, Mark J. F. Gales, Yongqiang Wang:
Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks. SLT 2018: 204-211 - [c228]Gilles Degottex, Mark J. F. Gales:
A Spectrally Weighted Mixture of Least Square Error and Wasserstein Discriminator Loss for Generative SPSS. SLT 2018: 603-609 - [c227]Qingyun Dou, Moquan Wan, Gilles Degottex, Zhiyi Ma, Mark J. F. Gales:
Hierarchical RNNs for Waveform-Level Speech Synthesis. SLT 2018: 618-625 - [c226]Marco Del Vecchio, Andrey Malinin, Mark J. F. Gales:
Improved Auto-Marking Confidence for Spoken Language Assessment. SLT 2018: 957-963 - [c225]Yu Wang, Jeremy Heng Meng Wong, Mark J. F. Gales, Kate M. Knill, Anton Ragni:
Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment. SLT 2018: 994-1000 - [i6]Yu Wang, Xie Chen, Mark J. F. Gales, Anton Ragni, Jeremy Heng Meng Wong:
Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription. CoRR abs/1802.00254 (2018) - [i5]Andrey Malinin, Mark J. F. Gales:
Predictive Uncertainty Estimation via Prior Networks. CoRR abs/1802.10501 (2018) - [i4]Qiujia Li, Preben Ness, Anton Ragni, Mark J. F. Gales:
Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation. CoRR abs/1810.13024 (2018) - [i3]Anton Ragni, Qiujia Li, Mark J. F. Gales, Yu Wang:
Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks. CoRR abs/1810.13025 (2018) - [i2]Andrey Malinin, Mark J. F. Gales:
Prior Networks for Detection of Adversarial Attacks. CoRR abs/1812.02575 (2018) - 2017
- [j54]Penny Karanasou, Chunyang Wu, Mark J. F. Gales, Philip C. Woodland:
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 818-828 (2017) - [c224]Andrey Malinin, Anton Ragni, Kate M. Knill, Mark J. F. Gales:
Incorporating Uncertainty into Deep Learning for Spoken Language Assessment. ACL (2) 2017: 45-50 - [c223]Jeremy Heng Meng Wong, Mark J. F. Gales:
Multi-task ensembles with teacher-student training. ASRU 2017: 84-90 - [c222]Xie Chen, X. Liu, Anton Ragni, Y. Wang, Mark J. F. Gales:
Future word contexts in neural network language models. ASRU 2017: 97-103 - [c221]Andrey Malinin, Kate M. Knill, Mark J. F. Gales:
A hierarchical attention based model for off-topic spontaneous spoken response detection. ASRU 2017: 397-403 - [c220]Moquan Wan, Gilles Degottex, Mark J. F. Gales:
Integrated speaker-adaptive speech synthesis. ASRU 2017: 705-711 - [c219]Gilles Degottex, Pierre Lanchantin, Mark J. F. Gales:
Light Supervised Data Selection, Voice Quality Normalized Training and Log Domain Pulse Synthesis. Blizzard Challenge 2017 - [c218]Anton Ragni, Chunyang Wu, Mark J. F. Gales, J. Vasilakes, Kate M. Knill:
Stimulated training for automatic speech recognition and keyword search in limited resource conditions. ICASSP 2017: 4830-4834 - [c217]Anton Ragni, Danielle Saunders, P. Zahemszky, J. Vasilakes, Mark J. F. Gales, Kate M. Knill:
Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search. ICASSP 2017: 5770-5774 - [c216]Xie Chen, Anton Ragni, J. Vasilakes, Xunying Liu, Kate M. Knill, Mark J. F. Gales:
Recurrent neural network language models for keyword search. ICASSP 2017: 5775-5779 - [c215]Jeremy Heng Meng Wong, Mark J. F. Gales:
Student-Teacher Training with Diverse Decision Tree Ensembles. INTERSPEECH 2017: 117-121 - [c214]Xie Chen, Anton Ragni, Xunying Liu, Mark J. F. Gales:
Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition. INTERSPEECH 2017: 269-273 - [c213]Chunyang Wu, Mark J. F. Gales:
Deep Activation Mixture Model for Speech Recognition. INTERSPEECH 2017: 1611-1615 - [c212]Kate M. Knill, Mark J. F. Gales, Konstantinos Kyriakopoulos, Anton Ragni, Yu Wang:
Use of Graphemic Lexicons for Spoken Language Assessment. INTERSPEECH 2017: 2774-2778 - [c211]Konstantinos Kyriakopoulos, Mark J. F. Gales, Kate M. Knill:
Automatic Characterisation of the Pronunciation of Non-native English Speakers using Phone Distance Features. SLaTE 2017: 59-64 - [c210]Andrey Malinin, Kate M. Knill, Anton Ragni, Yu Wang, Mark J. F. Gales:
An attention based model for off-topic spontaneous spoken response detection: An Initial Study. SLaTE 2017: 144-149 - [c209]Mark J. F. Gales, Kate M. Knill, Anton Ragni:
Low-Resource Speech Recognition and Keyword-Spotting. SPECOM 2017: 3-19 - [i1]Xie Chen, Xunying Liu, Anton Ragni, Yu Wang, Mark J. F. Gales:
Future Word Contexts in Neural Network Language Models. CoRR abs/1708.05592 (2017) - 2016
- [j53]Xunying Liu, Xie Chen, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1438-1449 (2016) - [j52]Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2146-2157 (2016) - [c208]Andrey Malinin, Rogier C. van Dalen, Kate M. Knill, Yu Wang, Mark J. F. Gales:
Off-topic Response Detection for Spontaneous Spoken English Assessment. ACL (1) 2016 - [c207]Chunyang Wu, Penny Karanasou, Mark J. F. Gales:
Combining i-vector representation and structured neural networks for rapid adaptation. ICASSP 2016: 5000-5004 - [c206]J. Yang, Chao Zhang, Anton Ragni, Mark J. F. Gales, Philip C. Woodland:
System combination with log-linear models. ICASSP 2016: 5675-5679 - [c205]Linlin Wang, Chao Zhang, Philip C. Woodland, Mark J. F. Gales, Panagiota Karanasou, Pierre Lanchantin, Xunying Liu, Yanmin Qian:
Improved DNN-based segmentation for multi-genre broadcast audio. ICASSP 2016: 5700-5704 - [c204]Xie Chen, Xunying Liu, Y. Qian, Mark J. F. Gales, Philip C. Woodland:
CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models. ICASSP 2016: 6000-6004 - [c203]Chunyang Wu, Penny Karanasou, Mark J. F. Gales, Khe Chai Sim:
Stimulated Deep Neural Network for Speech Recognition. INTERSPEECH 2016: 400-404 - [c202]Jingzhou Yang, Anton Ragni, Mark J. F. Gales, Kate M. Knill:
Log-Linear System Combination Using Structured Support Vector Machines. INTERSPEECH 2016: 1898-1902 - [c201]Souvik Kundu, Khe Chai Sim, Mark J. F. Gales:
Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition. INTERSPEECH 2016: 2359-2363 - [c200]Jeremy Heng Meng Wong, Mark J. F. Gales:
Sequence Student-Teacher Training of Deep Neural Networks. INTERSPEECH 2016: 2761-2765 - [c199]Anton Ragni, Edgar Dakin, Xie Chen, Mark J. F. Gales, Kate M. Knill:
Multi-Language Neural Network Language Models. INTERSPEECH 2016: 3042-3046 - [c198]Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanman Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems. INTERSPEECH 2016: 3057-3061 - [c197]Diane J. Litman, Steve J. Young, Mark J. F. Gales, Kate M. Knill, Karen Ottewell, Rogier C. van Dalen, David Vandyke:
Towards Using Conversations with Spoken Dialogue Systems in the Automated Assessment of Non-Native Speakers of English. SIGDIAL Conference 2016: 270-275 - [c196]Gilles Degottex, Pierre Lanchantin, Mark J. F. Gales:
A Pulse Model in Log-domain for a Uniform Synthesizer. SSW 2016: 214-220 - 2015
- [j51]Takuya Yoshioka, Mark J. F. Gales:
Environmentally robust ASR front-end for deep neural network acoustic models. Comput. Speech Lang. 31(1): 65-86 (2015) - [j50]Langzhou Chen, Norbert Braunschweiler, Mark J. F. Gales:
Speaker and Expression Factorization for Audiobook Data: Expressiveness and Transplantation. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 605-618 (2015) - [c195]Rogier C. van Dalen, Jingzhou Yang, Haipeng Wang, Anton Ragni, Chao Zhang, Mark J. F. Gales:
Structured discriminative models using deep neural-network features. ASRU 2015: 160-166 - [c194]Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Investigation of back-off based interpolation between recurrent neural network and n-gram language models. ASRU 2015: 181-186 - [c193]Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland:
Multilingual representations for low resource speech recognition and keyword search. ASRU 2015: 259-266 - [c192]Shawn Tan, Khe Chai Sim, Mark J. F. Gales:
Improving the interpretability of deep neural networks with stimulated learning. ASRU 2015: 617-623 - [c191]Philip C. Woodland, Xunying Liu, Yanmin Qian, Chao Zhang, Mark J. F. Gales, Penny Karanasou, Pierre Lanchantin, Linlin Wang:
Cambridge university transcription systems for the multi-genre broadcast challenge. ASRU 2015: 639-646 - [c190]Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
The development of the cambridge university alignment systems for the multi-genre broadcast challenge. ASRU 2015: 647-653 - [c189]Penny Karanasou, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
Speaker diarisation and longitudinal linking in multi-genre broadcast data. ASRU 2015: 660-666 - [c188]Peter Bell, Mark J. F. Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, Philip C. Woodland:
The MGB challenge: Evaluating multi-genre broadcast media recognition. ASRU 2015: 687-693 - [c187]Chunyang Wu, Mark J. F. Gales:
Multi-basis adaptive neural network for rapid adaptation in speech recognition. ICASSP 2015: 4315-4319 - [c186]Anton Ragni, Mark J. F. Gales, Kate M. Knill:
A language space representation for speech recognition. ICASSP 2015: 4634-4638 - [c185]Thomas Drugman, Yannis Stylianou, Langzhou Chen, Xie Chen, Mark J. F. Gales:
Robust excitation-based features for Automatic Speech Recognition. ICASSP 2015: 4664-4668 - [c184]Rogier C. van Dalen, Kate M. Knill, Pirros Tsiakoulis, Mark J. F. Gales:
Improving multiple-crowd-sourced transcriptions using a speech recogniser. ICASSP 2015: 4709-4713 - [c183]Mark J. F. Gales, Kate M. Knill, Anton Ragni:
Unicode-based graphemic systems for limited resource languages. ICASSP 2015: 5186-5190 - [c182]Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improving the training and evaluation efficiency of recurrent neural network language models. ICASSP 2015: 5401-5405 - [c181]Xunying Liu, Xie Chen, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic recurrent neural network language models. ICASSP 2015: 5406-5410 - [c180]Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Recurrent neural network language model training with noise contrastive estimation for speech recognition. ICASSP 2015: 5411-5415 - [c179]Gideon Mendels, Erica Cooper, Victor Soto, Julia Hirschberg, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang:
Improving speech recognition and keyword search for low resource languages using web data. INTERSPEECH 2015: 829-833 - [c178]Pierre Lanchantin, Christophe Veaux, Mark J. F. Gales, Simon King, Junichi Yamagishi:
Reconstructing voices within the multiple-average-voice-model framework. INTERSPEECH 2015: 2232-2236 - [c177]Rogier C. van Dalen, Mark J. F. Gales:
Annotating large lattices with the exact word error. INTERSPEECH 2015: 2625-2629 - [c176]Penny Karanasou, Mark J. F. Gales, Philip C. Woodland:
I-vector estimation using informative priors for adaptation of deep neural networks. INTERSPEECH 2015: 2872-2876 - [c175]Xunying Liu, Federico Flego, Linlin Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland:
The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation. INTERSPEECH 2015: 3145-3149 - [c174]Xie Chen, T. Tan, Xunying Liu, Pierre Lanchantin, M. Wan, Mark J. F. Gales, Philip C. Woodland:
Recurrent neural network language model adaptation for multi-genre broadcast speech recognition. INTERSPEECH 2015: 3511-3515 - [c173]Haipeng Wang, Anton Ragni, Mark J. F. Gales, Kate M. Knill, Philip C. Woodland, Chao Zhang:
Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages. INTERSPEECH 2015: 3660-3664 - [c172]Rogier C. van Dalen, Kate M. Knill, Mark J. F. Gales:
Automatically grading learners' English using a Gaussian process. SLaTE 2015: 7-12 - 2014
- [j49]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic language models. Comput. Speech Lang. 28(6): 1298-1316 (2014) - [j48]Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Norbert Braunschweiler, Langzhou Chen, Mark J. F. Gales, Masami Akamine:
Building HMM-TTS Voices on Diverse Data. IEEE J. Sel. Top. Signal Process. 8(2): 296-306 (2014) - [j47]Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate M. Knill:
Integrated Expression Prediction and Speech Synthesis From Text. IEEE J. Sel. Top. Signal Process. 8(2): 323-335 (2014) - [c171]Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Mark J. F. Gales, Yannis Stylianou:
Cluster adaptive training of average voice models. ICASSP 2014: 280-284 - [c170]Pierre Lanchantin, Mark J. F. Gales, Simon King, Junichi Yamagishi:
Multiple-average-voice-based speech synthesis. ICASSP 2014: 285-289 - [c169]Langzhou Chen, Norbert Braunschweiler, Mark J. F. Gales:
Speaker dependent expression predictor from text: Expressiveness and transplantation. ICASSP 2014: 2574-2578 - [c168]Jingzhou Yang, Rogier C. van Dalen, Shi-Xiong Zhang, Mark J. F. Gales:
Infinite structured support vector machines for speech recognition. ICASSP 2014: 3320-3324 - [c167]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic neural network language models. ICASSP 2014: 4903-4907 - [c166]Xunying Liu, Yongqiang Wang, Xie Chen, Mark J. F. Gales, Philip C. Woodland:
Efficient lattice rescoring using recurrent neural network language models. ICASSP 2014: 4908-4912 - [c165]Takuya Yoshioka, Xie Chen, Mark J. F. Gales:
Impact of single-microphone dereverberation on DNN-based meeting transcription systems. ICASSP 2014: 5527-5531 - [c164]Takuya Yoshioka, Anton Ragni, Mark J. F. Gales:
Investigation of unsupervised adaptation of DNN acoustic models with filter bank input. ICASSP 2014: 6344-6348 - [c163]Kate M. Knill, Mark J. F. Gales, Anton Ragni, Shakti P. Rath:
Language independent and unsupervised acoustic models for speech recognition and keyword spotting. INTERSPEECH 2014: 16-20 - [c162]Xie Chen, Yongqiang Wang, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch. INTERSPEECH 2014: 641-645 - [c161]Anton Ragni, Kate M. Knill, Shakti P. Rath, Mark J. F. Gales:
Data augmentation for low resource languages. INTERSPEECH 2014: 810-814 - [c160]Shakti P. Rath, Kate M. Knill, Anton Ragni, Mark J. F. Gales:
Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages. INTERSPEECH 2014: 835-839 - [c159]Xie Chen, Mark J. F. Gales, Kate M. Knill, Catherine Breslin, Langzhou Chen, K. K. Chin, Vincent Wan:
An initial investigation of long-term adaptation for meeting transcription. INTERSPEECH 2014: 954-958 - [c158]BalaKrishna Kolluru, Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Mark J. F. Gales:
Generating multiple-accent pronunciations for TTS using joint sequence model interpolation. INTERSPEECH 2014: 1273-1277 - [c157]Kayoko Yanagisawa, Langzhou Chen, Mark J. F. Gales:
Noise-robust TTS speaker adaptation with statistics smoothing. INTERSPEECH 2014: 1519-1523 - [c156]Penny Karanasou, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Adaptation of deep neural network acoustic models using factorised i-vectors. INTERSPEECH 2014: 2180-2184 - [c155]Javier Latorre, Kayoko Yanagisawa, Vincent Wan, BalaKrishna Kolluru, Mark J. F. Gales:
Speech intonation for TTS: study on evaluation methodology. INTERSPEECH 2014: 2957-2961 - [c154]Mark J. F. Gales, Kate M. Knill, Anton Ragni, Shakti P. Rath:
Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED. SLTU 2014: 16-23 - 2013
- [j46]Xunying Liu, Mark John Francis Gales, Philip C. Woodland:
Use of contexts in language model interpolation and adaptation. Comput. Speech Lang. 27(1): 301-321 (2013) - [j45]Rogier C. van Dalen, Mark John Francis Gales:
Importance sampling to compute likelihoods of noise-corrupted speech. Comput. Speech Lang. 27(1): 322-349 (2013) - [j44]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Language model cross adaptation for LVCSR system combination. Comput. Speech Lang. 27(4): 928-942 (2013) - [j43]Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum for statistical parametric speech synthesis. Speech Commun. 55(5): 606-618 (2013) - [j42]Shi-Xiong Zhang, Mark J. F. Gales:
Structured SVMs for Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 21(3): 544-555 (2013) - [c153]Kate M. Knill, Mark J. F. Gales, Shakti P. Rath, Philip C. Woodland, Chao Zhang, Shi-Xiong Zhang:
Investigation of multilingual deep neural networks for spoken term detection. ASRU 2013: 138-143 - [c152]Javier Latorre, Mark J. F. Gales, Kate M. Knill, Masami Akamine:
Training a supra-segmental parametric F0 model without interpolating F0. ICASSP 2013: 6880-6884 - [c151]Shi-Xiong Zhang, Mark J. F. Gales:
Kernelized log linear models for continuous speech recognition. ICASSP 2013: 6950-6954 - [c150]Rogier C. van Dalen, Anton Ragni, Mark J. F. Gales:
Efficient decoding with generative score-spaces using the expectation semiring. ICASSP 2013: 7619-7623 - [c149]Yongqiang Wang, Mark J. F. Gales:
Tandem system adaptation using multiple linear feature transforms. ICASSP 2013: 7932-7936 - [c148]Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum analysis based on the minimum mean squared error. ICASSP 2013: 7972-7976 - [c147]Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate M. Knill:
Integrated automatic expression prediction and speech synthesis from text. ICASSP 2013: 7977-7981 - [c146]Jonathan Mamou, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Brian Kingsbury, Kate M. Knill, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
System combination and score normalization for spoken term detection. ICASSP 2013: 8272-8276 - [c145]Brian Kingsbury, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Kate M. Knill, Jonathan Mamou, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
A high-performance Cantonese keyword search system. ICASSP 2013: 8277-8281 - [c144]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic language models and combination with neural network language models. ICASSP 2013: 8421-8425 - [c143]Matthew Stephen Seigel, Philip C. Woodland, Mark J. F. Gales:
A confidence-based approach for improving keyword hypothesis scores. ICASSP 2013: 8565-8569 - [c142]Pierre Lanchantin, Peter Bell, Mark J. F. Gales, Thomas Hain, Xunying Liu, Yanhua Long, Jennifer Quinnell, Steve Renals, Oscar Saz, Matthew Stephen Seigel, Pawel Swietojanski, Philip C. Woodland:
Automatic Transcription of Multi-genre Media Archives. SLAM@INTERSPEECH 2013: 26-31 - [c141]Yongqiang Wang, Mark J. F. Gales:
An explicit independence constraint for factorised adaptation in speech recognition. INTERSPEECH 2013: 1233-1237 - [c140]Yanhua Long, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Matthew Stephen Seigel, Philip C. Woodland:
Improving lightly supervised training for broadcast transcription. INTERSPEECH 2013: 2187-2191 - [c139]Ranniery Maia, Mark J. F. Gales, Yannis Stylianou, Masami Akamine:
Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis. INTERSPEECH 2013: 2336-2340 - [c138]Vincent Wan, Robert Anderson, Art Blokland, Norbert Braunschweiler, Langzhou Chen, BalaKrishna Kolluru, Javier Latorre, Ranniery Maia, Björn Stenger, Kayoko Yanagisawa, Yannis Stylianou, Masami Akamine, Mark J. F. Gales, Roberto Cipolla:
Photo-realistic expressive text to talking head synthesis. INTERSPEECH 2013: 2667-2669 - [c137]Jingzhou Yang, Rogier C. van Dalen, Mark J. F. Gales:
Infinite support vector machines in speech recognition. INTERSPEECH 2013: 3303-3307 - [c136]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Cross-domain paraphrasing for improving language modelling using out-of-domain data. INTERSPEECH 2013: 3424-3428 - [c135]Kayoko Yanagisawa, Javier Latorre, Vincent Wan, Mark J. F. Gales, Simon King:
Noise robustness in HMM-TTS speaker adaptation. SSW 2013: 119-124 - 2012
- [j41]Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Morphological decomposition in Arabic ASR systems. Comput. Speech Lang. 26(4): 229-243 (2012) - [j40]Heiga Zen, Mark J. F. Gales, Yoshihiko Nankaku, Keiichi Tokuda:
Product of Experts for Statistical Parametric Speech Synthesis. IEEE Trans. Speech Audio Process. 20(3): 794-805 (2012) - [j39]Heiga Zen, Norbert Braunschweiler, Sabine Buchholz, Mark J. F. Gales, Kate M. Knill, Sacha Krstulovic, Javier Latorre:
Statistical Parametric Speech Synthesis Based on Speaker and Language Factorization. IEEE Trans. Speech Audio Process. 20(6): 1713-1724 (2012) - [j38]Yongqiang Wang, Mark J. F. Gales:
Speaker and Noise Factorization for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 20(7): 2149-2158 (2012) - [c134]Florian Eyben, Sabine Buchholz, Norbert Braunschweiler, Javier Latorre, Vincent Wan, Mark J. F. Gales, Kate M. Knill:
Unsupervised clustering of emotion and voice styles for expressive TTS. ICASSP 2012: 4009-4012 - [c133]Anton Ragni, Mark J. F. Gales:
Inference algorithms for generative score-spaces. ICASSP 2012: 4149-4152 - [c132]Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum as phase information in statistical parametric speech synthesis. ICASSP 2012: 4581-4584 - [c131]Federico Flego, Mark J. F. Gales:
Factor analysis based VTS discriminative adaptive training. ICASSP 2012: 4669-4672 - [c130]Langzhou Chen, Mark J. F. Gales, Vincent Wan, Javier Latorre, Masami Akamine:
Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training. INTERSPEECH 2012: 959-962 - [c129]Javier Latorre, Vincent Wan, Mark J. F. Gales, Langzhou Chen, K. K. Chin, Kate M. Knill, Masami Akamine:
Speech factorization for HMM-TTS based on cluster adaptive training. INTERSPEECH 2012: 971-974 - [c128]Vincent Wan, Javier Latorre, K. K. Chin, Langzhou Chen, Mark J. F. Gales, Heiga Zen, Kate M. Knill, Masami Akamine:
Combining multiple high quality corpora for improving HMM-TTS. INTERSPEECH 2012: 1135-1138 - [c127]Yongqiang Wang, Mark J. F. Gales:
Model-based approaches to adaptive training in reverberant environments. INTERSPEECH 2012: 1195-1198 - [c126]Mark J. F. Gales, Federico Flego:
Model-Based Approaches for Degraded Channel Modelling in Robust ASR. INTERSPEECH 2012: 1199-1202 - [c125]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic Language Models. INTERSPEECH 2012: 1656-1659 - [c124]Zoi Roupakia, Anton Ragni, Mark J. F. Gales:
Rapid Nonlinear Speaker Adaptation for Large-Vocabulary Continuous Speech Recognition. INTERSPEECH 2012: 1784-1787 - [c123]Mark J. F. Gales, Anton Ragni, Austin Zhang, Rogier C. van Dalen:
Structured discriminative models for speech recognition. MLSLP 2012 - [c122]Peter Bell, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanhua Long, Steve Renals, Pawel Swietojanski, Philip C. Woodland:
Transcription of multi-genre media archives using out-of-domain data. SLT 2012: 324-329 - 2011
- [j37]Junho Park, Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
The efficient incorporation of MLP features into automatic speech recognition systems. Comput. Speech Lang. 25(3): 519-534 (2011) - [j36]Zoi Roupakia, Mark J. F. Gales:
Kernel Eigenvoices (Revisited) for Large-Vocabulary Speech Recognition. IEEE Signal Process. Lett. 18(12): 709-712 (2011) - [j35]D. K. Kim, Mark J. F. Gales:
Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition. IEEE Trans. Speech Audio Process. 19(2): 315-325 (2011) - [j34]Rogier C. van Dalen, Mark J. F. Gales:
Extended VTS for Noise-Robust Speech Recognition. IEEE Trans. Speech Audio Process. 19(4): 733-743 (2011) - [j33]Haitian Xu, Mark J. F. Gales, K. K. Chin:
Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition. IEEE Trans. Speech Audio Process. 19(6): 1665-1676 (2011) - [c121]Shi-Xiong Zhang, Mark J. F. Gales:
Extending noise robust structured support vector machines to larger vocabulary tasks. ASRU 2011: 18-23 - [c120]Yongqiang Wang, Mark J. F. Gales:
Improving reverberant VTS for hands-free robust speech recognition. ASRU 2011: 113-118 - [c119]Anton Ragni, Mark J. F. Gales:
Derivative kernels for noise robust ASR. ASRU 2011: 119-124 - [c118]Rogier C. van Dalen, Mark J. F. Gales:
A variational perspective on noise-robust speech recognition. ASRU 2011: 125-130 - [c117]Heiga Zen, Mark J. F. Gales:
Decision tree-based context clustering based on cross validation and hierarchical priors. ICASSP 2011: 4560-4563 - [c116]Yongqiang Wang, Mark J. F. Gales:
Speaker and noise factorisation on the AURORA4 task. ICASSP 2011: 4584-4587 - [c115]Javier Latorre, Mark J. F. Gales, Sabine Buchholz, Kate M. Knill, Masatsune Tamura, Yamato Ohtani, Masami Akamine:
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification? ICASSP 2011: 4724-4727 - [c114]Anton Ragni, Mark John Francis Gales:
Structured discriminative models for noise robust continuous speech recognition. ICASSP 2011: 4788-4791 - [c113]Federico Flego, Mark John Francis Gales:
Factor analysis based VTS and JUD noise estimation and compensation. ICASSP 2011: 4792-4795 - [c112]Xunying Liu, Mark John Francis Gales, Jim L. Hieronymus, Philip C. Woodland:
Investigation of acoustic units for LVCSR systems. ICASSP 2011: 4872-4875 - [c111]Langzhou Chen, Mark J. F. Gales, K. K. Chin:
Constrained discriminative mapping transforms for unsupervised speaker adaptation. ICASSP 2011: 5344-5347 - [c110]K. K. Chin, Haitian Xu, Mark J. F. Gales, Catherine Breslin, Kate M. Knill:
Rapid joint speaker and noise compensation for robust speech recognition. ICASSP 2011: 5500-5503 - [c109]Frank Diehl, Mark John Francis Gales, Xunying Liu, Marcus Tomalin, Philip C. Woodland:
Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems. INTERSPEECH 2011: 777-780 - [c108]Shi-Xiong Zhang, Mark J. F. Gales:
Structured Support Vector Machines for Noise Robust Continuous Speech Recognition. INTERSPEECH 2011: 989-990 - [c107]Catherine Breslin, K. K. Chin, Mark J. F. Gales, Kate M. Knill:
Integrated Online Speaker Clustering and Adaptation. INTERSPEECH 2011: 1085-1088 - [c106]Ranniery Maia, Heiga Zen, Kate M. Knill, Mark J. F. Gales, Sabine Buchholz:
Multipulse Sequences for Residual Signal Modeling. INTERSPEECH 2011: 1833-1836 - [c105]T. Li, Philip C. Woodland, Frank Diehl, Mark J. F. Gales:
Graphone Model Interpolation and Arabic Pronunciation Generation. INTERSPEECH 2011: 2309-2312 - [c104]Nicholas Pilkington, Heiga Zen, Mark J. F. Gales:
Gaussian Process Experts for Voice Conversion. INTERSPEECH 2011: 2761-2764 - [c103]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation. INTERSPEECH 2011: 2857-2860 - [p1]Mark J. F. Gales:
Model-Based Approaches to Handling Uncertainty. Robust Speech Recognition of Uncertain or Missing Data 2011: 101-125 - 2010
- [j32]Mark J. F. Gales, Federico Flego:
Discriminative classifiers with adaptive kernels for noise robust speech recognition. Comput. Speech Lang. 24(4): 648-662 (2010) - [j31]Kai Yu, Mark J. F. Gales, Lan Wang, Philip C. Woodland:
Unsupervised training and directed manual transcription for LVCSR. Speech Commun. 52(7-8): 652-663 (2010) - [j30]Shi-Xiong Zhang, Anton Ragni, Mark J. F. Gales:
Structured Log Linear Models for Noise Robust Speech Recognition. IEEE Signal Process. Lett. 17(11): 945-948 (2010) - [c102]Heiga Zen, Mark J. F. Gales, Yoshihiko Nankaku, Keiichi Tokuda:
Statistical parametric speech synthesis based on product of experts. ICASSP 2010: 4242-4245 - [c101]Marcus Tomalin, Frank Diehl, Mark J. F. Gales, Junho Park, Philip C. Woodland:
Recent improvements to the Cambridge Arabic Speech-to-Text systems. ICASSP 2010: 4382-4385 - [c100]Xunying Liu, Mark J. F. Gales, Jim L. Hieronymus, Philip C. Woodland:
Language model combination and adaptation usingweighted finite state transducers. ICASSP 2010: 5390-5393 - [c99]Mark J. F. Gales, Kai Yu:
Canonical state models for automatic speech recognition. INTERSPEECH 2010: 58-61 - [c98]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Language model cross adaptation for LVCSR system combination. INTERSPEECH 2010: 342-345 - [c97]Rogier C. van Dalen, Mark J. F. Gales:
Asymptotically exact noise-corrupted speech likelihoods. INTERSPEECH 2010: 709-712 - [c96]Junho Park, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improved neural network based language modelling and adaptation. INTERSPEECH 2010: 1041-1044 - [c95]Catherine Breslin, K. K. Chin, Mark J. F. Gales, Kate M. Knill, Haitian Xu:
Prior information for rapid speaker adaptation. INTERSPEECH 2010: 1644-1647 - [c94]Javier Latorre, Mark J. F. Gales, Heiga Zen:
Training a parametric-based logF0 model with the minimum generation error criterion. INTERSPEECH 2010: 2174-2177 - [c93]Norbert Braunschweiler, Mark J. F. Gales, Sabine Buchholz:
Lightly supervised recognition for automatic alignment of large coherent speech recordings. INTERSPEECH 2010: 2222-2225 - [c92]Ranniery Maia, Heiga Zen, Mark J. F. Gales:
Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters. SSW 2010: 88-93
2000 – 2009
- 2009
- [j29]Catherine Breslin, Mark J. F. Gales:
Directed decision trees for generating complementary systems. Speech Commun. 51(3): 284-295 (2009) - [j28]Kai Yu, Mark J. F. Gales, Philip C. Woodland:
Unsupervised Adaptation With Discriminative Mapping Transforms. IEEE Trans. Speech Audio Process. 17(4): 714-723 (2009) - [j27]Chris Longworth, Mark J. F. Gales:
Combining Derivative and Parametric Kernels for Speaker Verification. IEEE Trans. Speech Audio Process. 17(4): 748-757 (2009) - [c91]Mark J. F. Gales:
Acoustic modelling for speech recognition: Hidden Markov models and beyond? ASRU 2009: 44 - [c90]Federico Flego, Mark J. F. Gales:
Discriminative adaptive training with VTS and JUD. ASRU 2009: 170-175 - [c89]Mark J. F. Gales, Anton Ragni, H. AlDamarki, C. Gautier:
Support vector machines for noise robust ASR. ASRU 2009: 205-210 - [c88]Haitian Xu, Mark J. F. Gales, K. K. Chin:
Improving joint uncertainty decoding performance by predictive methods for noise robust speech recognition. ASRU 2009: 222-227 - [c87]Mark J. F. Gales, Federico Flego:
Combining VTS model compensation and support vector machines. ICASSP 2009: 3821-3824 - [c86]Rogier C. van Dalen, Mark J. F. Gales:
Extended VTS for noise-robust speech recognition. ICASSP 2009: 3829-3832 - [c85]Federico Flego, Mark J. F. Gales:
Incremental predictive and adaptive noise compensation. ICASSP 2009: 3837-3840 - [c84]Chandra Kant Raut, Mark J. F. Gales:
Bayesian discriminative adaptation for speech recognition. ICASSP 2009: 4361-4364 - [c83]Junho Park, Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Training and adapting MLP features for Arabic speech recognition. ICASSP 2009: 4461-4464 - [c82]Junho Park, Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Efficient generation and use of MLP features for Arabic speech recognition. INTERSPEECH 2009: 236-239 - [c81]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Use of contexts in language model interpolation and adaptation. INTERSPEECH 2009: 360-363 - [c80]Jim L. Hieronymus, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Exploiting Chinese character models to improve speech recognition performance. INTERSPEECH 2009: 364-367 - [c79]Federico Flego, Mark J. F. Gales:
Incremental adaptation with VTS and joint adaptively trained systems. INTERSPEECH 2009: 1251-1254 - [c78]Chris Longworth, Rogier C. van Dalen, Mark J. F. Gales:
Variational dynamic kernels for speaker verification. INTERSPEECH 2009: 1571-1574 - [c77]D. K. Kim, Mark J. F. Gales:
Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition. INTERSPEECH 2009: 2383-2386 - [c76]Rogier C. van Dalen, Federico Flego, Mark J. F. Gales:
Transforming features to compensate speech recogniser models for noise. INTERSPEECH 2009: 2499-2502 - [c75]Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Morphological analysis and decomposition for Arabic speech-to-text systems. INTERSPEECH 2009: 2675-2678 - 2008
- [j26]Hank Liao, Mark J. F. Gales:
Issues with uncertainty decoding for noise robust automatic speech recognition. Speech Commun. 50(4): 265-277 (2008) - [c74]Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Phonetic pronunciations for arabic speech-to-text systems. ICASSP 2008: 1573-1576 - [c73]Chris Longworth, Mark J. F. Gales:
Multiple kernel learning for speaker verification. ICASSP 2008: 1581-1584 - [c72]Kai Yu, Mark J. F. Gales, Philip C. Woodland:
Unsupervised discriminative adaptation using discriminative mapping transforms. ICASSP 2008: 4273-4276 - [c71]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Context dependent language model adaptation. INTERSPEECH 2008: 837-840 - [c70]Chris Longworth, Mark J. F. Gales:
A generalised derivative kernel for speaker verification. INTERSPEECH 2008: 1381-1384 - [c69]Chandra Kant Raut, Kai Yu, Mark J. F. Gales:
Adaptive training using discriminative mapping transforms. INTERSPEECH 2008: 1697-1700 - [c68]Mark J. F. Gales, Chris Longworth:
Discriminative classifiers with generative kernels for noise robust ASR. INTERSPEECH 2008: 1996-1999 - [c67]Rogier C. van Dalen, Mark J. F. Gales:
Covariance modelling for noise-robust speech recognition. INTERSPEECH 2008: 2000-2003 - 2007
- [j25]Khe Chai Sim, Mark J. F. Gales:
Discriminative semi-parametric trajectory model for speech recognition. Comput. Speech Lang. 21(4): 669-687 (2007) - [j24]Mark J. F. Gales, Steve J. Young:
The Application of Hidden Markov Models in Speech Recognition. Found. Trends Signal Process. 1(3): 195-304 (2007) - [j23]Xunying Liu, Mark J. F. Gales:
Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions. IEEE Trans. Speech Audio Process. 15(4): 1414-1424 (2007) - [j22]Kai Yu, Mark J. F. Gales:
Bayesian Adaptive Inference and Adaptive Training. IEEE Trans. Speech Audio Process. 15(6): 1932-1943 (2007) - [j21]Martin I. Layton, Mark J. F. Gales:
Acoustic Modelling Using Continuous Rational Kernels. J. VLSI Signal Process. 48(1-2): 67-82 (2007) - [c66]Mark J. F. Gales, Frank Diehl, Chandra Kant Raut, Marcus Tomalin, Philip C. Woodland, Kai Yu:
Development of a phonetic system for large vocabulary Arabic speech recognition. ASRU 2007: 24-29 - [c65]Mark J. F. Gales, Rogier C. van Dalen:
Predictive linear transforms for noise robust speech recognition. ASRU 2007: 59-64 - [c64]Xunying Liu, William J. Byrne, Mark J. F. Gales, Adrià de Gispert, Marcus Tomalin, Philip C. Woodland, Kai Yu:
Discriminative language model adaptation for Mandarin broadcast speech transcription and translation. ASRU 2007: 153-158 - [c63]Marcus Tomalin, Mark J. F. Gales, X. Andrew Liu, Khe Chai Sim, Rohit Sinha, Lan Wang, Philip C. Woodland, Kai Yu:
Improving Speech Transcription for Mandarin-English Translation. ICASSP (4) 2007: 97-100 - [c62]Khe Chai Sim, William J. Byrne, Mark J. F. Gales, Hichem Sahbi, Philip C. Woodland:
Consensus Network Decoding for Statistical Machine Translation System Combination. ICASSP (4) 2007: 105-108 - [c61]Catherine Breslin, Mark J. F. Gales:
Complementary System Generation using Directed Decision Trees. ICASSP (4) 2007: 337-340 - [c60]Lan Wang, Mark J. F. Gales, Philip C. Woodland:
Unsupervised Training for Mandarin Broadcast News and Conversation Transcription. ICASSP (4) 2007: 353-356 - [c59]Hank Liao, Mark J. F. Gales:
Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data. ICASSP (4) 2007: 389-392 - [c58]Mark J. F. Gales, Xunying Liu, Rohit Sinha, Philip C. Woodland, Kai Yu, Spyros Matsoukas, Tim Ng, Kham Nguyen, Long Nguyen, Jean-Luc Gauvain, Lori Lamel, Abdelkhalek Messaoudi:
Speech Recognition System Combination for Machine Translation. ICASSP (4) 2007: 1277-1280 - [c57]Chris Longworth, Mark J. F. Gales:
Derivative and parametric kernels for speaker verification. INTERSPEECH 2007: 310-313 - [c56]Catherine Breslin, Mark J. F. Gales:
Building multiple complementary systems using directed decision trees. INTERSPEECH 2007: 1441-1444 - [c55]Kai Yu, Mark J. F. Gales, Philip C. Woodland:
Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio. INTERSPEECH 2007: 1709-1712 - 2006
- [j20]Mark J. F. Gales, S. S. Airey:
Product of Gaussians for speech recognition. Comput. Speech Lang. 20(1): 22-40 (2006) - [j19]Mark J. F. Gales, Martin I. Layton:
Training Augmented Models Using SVMs. IEICE Trans. Inf. Syst. 89-D(3): 892-899 (2006) - [j18]Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Corrections to "Automatic Transcription of Conversational Telephone Speech". IEEE Trans. Speech Audio Process. 14(2): 727-727 (2006) - [j17]Khe Chai Sim, Mark J. F. Gales:
Minimum phone error training of precision matrix models. IEEE Trans. Speech Audio Process. 14(3): 882-889 (2006) - [j16]Mark J. F. Gales, Do Yeong Kim, Philip C. Woodland, Ho Yin Chan, David Mrva, Rohit Sinha, S. E. Tranter:
Progress in the CU-HTK broadcast news transcription system. IEEE Trans. Speech Audio Process. 14(5): 1513-1525 (2006) - [j15]Kai Yu, Mark J. F. Gales:
Discriminative cluster adaptive training. IEEE Trans. Speech Audio Process. 14(5): 1694-1703 (2006) - [c54]Martin I. Layton, Mark J. F. Gales:
Augmented Statistical Models for Speech Recognition. ICASSP (1) 2006: 129-132 - [c53]Kai Yu, Mark J. F. Gales:
Incremental Adaptation using Bayesian Inference. ICASSP (1) 2006: 217-220 - [c52]Rohit Sinha, Mark J. F. Gales, Do Yeong Kim, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland:
The Cu-Htk Mandarin Broadcast News Transcription System. ICASSP (1) 2006: 1077-1080 - [c51]Catherine Breslin, Mark J. F. Gales:
Generating complementary systems for speech recognition. INTERSPEECH 2006 - [c50]Hank Liao, Mark J. F. Gales:
Issues with uncertainty decoding for noise robust speech recognition. INTERSPEECH 2006 - [c49]Chris Longworth, Mark J. F. Gales:
Discriminative adaptation for speaker verification. INTERSPEECH 2006 - 2005
- [j14]Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Automatic transcription of conversational telephone speech. IEEE Trans. Speech Audio Process. 13(6): 1173-1185 (2005) - [c48]Khe Chai Sim, Mark J. F. Gales:
Adaptation of Precision Matrix Models on Large Vocabulary Continuous Speech Recognition. ICASSP (1) 2005: 97-100 - [c47]Gunnar Evermann, Ho Yin Chan, Mark J. F. Gales, Bin Jia, David Mrva, Philip C. Woodland, Kai Yu:
Training LVCSR Systems on Thousands of Hours of Data. ICASSP (1) 2005: 209-212 - [c46]Mark J. F. Gales, Bin Jia, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland, Kai Yu:
Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System. ICASSP (1) 2005: 841-844 - [c45]Xunying Liu, Mark J. F. Gales, Khe Chai Sim, Kai Yu:
Investigation of Acoustic Modeling Techniques for LVCSR Systems. ICASSP (1) 2005: 849-852 - [c44]Do Yeong Kim, Ho Yin Chan, Gunnar Evermann, Mark J. F. Gales, David Mrva, Khe Chai Sim, Philip C. Woodland:
Development of the CU-HTK 2004 Broadcast News Transcription Systems. ICASSP (1) 2005: 861-864 - [c43]Khe Chai Sim, Mark J. F. Gales:
Temporally varying model parameters for large vocabulary continuous speech recognition. INTERSPEECH 2005: 2137-2140 - [c42]Rohit Sinha, S. E. Tranter, Mark J. F. Gales, Philip C. Woodland:
The Cambridge University March 2005 speaker diarisation system. INTERSPEECH 2005: 2437-2440 - [c41]Hank Liao, Mark J. F. Gales:
Joint uncertainty decoding for noise robust speech recognition. INTERSPEECH 2005: 3129-3132 - 2004
- [j13]Antti-Veikko I. Rosti, Mark J. F. Gales:
Factor analysed hidden Markov models for speech recognition. Comput. Speech Lang. 18(2): 181-200 (2004) - [c40]Gunnar Evermann, Ho Yin Chan, Mark J. F. Gales, Thomas Hain, Xunying Liu, David Mrva, Lan Wang, Philip C. Woodland:
Development of the 2003 CU-HTK conversational telephone speech transcription system. ICASSP (1) 2004: 249-252 - [c39]Kai Yu, Mark J. F. Gales:
Adaptive training using structured transforms. ICASSP (1) 2004: 317-320 - [c38]Xunying Liu, Mark J. F. Gales:
Model complexity control and compression using discriminative growth functions. ICASSP (1) 2004: 797-800 - [c37]Khe Chai Sim, Mark J. F. Gales:
Basis superposition precision matrix modelling for large vocabulary continuous speech recognition. ICASSP (1) 2004: 801-804 - [c36]Antti-Veikko I. Rosti, Mark J. F. Gales:
Rao-Blackwellised Gibbs sampling for switching linear dynamical systems. ICASSP (1) 2004: 809-812 - [c35]Do Yeong Kim, Srinivasan Umesh, Mark J. F. Gales, Thomas Hain, Philip C. Woodland:
Using VTLN for broadcast news transcription. INTERSPEECH 2004: 1953-1956 - 2003
- [c34]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Automatic complexity control for HLDA systems. ICASSP (1) 2003: 132-135 - [c33]Daniel Povey, Philip C. Woodland, Mark J. F. Gales:
Discriminative map for acoustic model adaptation. ICASSP (1) 2003: 312-315 - [c32]Mark J. F. Gales, Yuan Dong, Daniel Povey, Philip C. Woodland:
Porting: SwitchBoard to the VoiceMail task. ICASSP (1) 2003: 536-539 - [c31]S. S. Airey, Mark J. F. Gales:
Product of Gaussians and multiple stream systems. ICASSP (1) 2003: 844-847 - [c30]S. S. Airey, Mark J. F. Gales:
Product of Gaussians as a distributed representation for speech recognition. INTERSPEECH 2003: 877-880 - [c29]Daniel Povey, Mark J. F. Gales, Do Yeong Kim, Philip C. Woodland:
MMI-MAP and MPE-MAP for acoustic model adaptation. INTERSPEECH 2003: 1981-1984 - 2002
- [j12]Mark J. F. Gales:
Transformation streams and the HMM error model. Comput. Speech Lang. 16(2): 225-243 (2002) - [j11]Scott Saobing Chen, Ellen Eide, Mark J. F. Gales, Ramesh A. Gopinath, D. Kanvesky, Peder A. Olsen:
Automatic transcription of Broadcast News. Speech Commun. 37(1-2): 69-87 (2002) - [j10]Mark J. F. Gales:
Maximum likelihood multiple subspace projections for hidden Markov models. IEEE Trans. Speech Audio Process. 10(2): 37-47 (2002) - [c28]Nathan D. Smith, Mark J. F. Gales:
Using SVMS and discriminative models for speech recognition. ICASSP 2002: 77-80 - [c27]Ricardo de Córdoba, Philip C. Woodland, Mark J. F. Gales:
Improved cross-task recognition using MMIE training. ICASSP 2002: 85-88 - [c26]Mark J. F. Gales:
The HMM error model. ICASSP 2002: 937-940 - [c25]Antti-Veikko I. Rosti, Mark J. F. Gales:
Factor analysed hidden Markov models. ICASSP 2002: 949-952 - [c24]Matthew N. Stuttle, Mark J. F. Gales:
Combining a Gaussian mixture model front end with MFCC parameters. INTERSPEECH 2002: 1565-1568 - 2001
- [c23]Mark J. F. Gales:
Multiple-cluster adaptive training schemes. ICASSP 2001: 361-364 - [c22]Matthew N. Stuttle, Mark J. F. Gales:
A mixture of Gaussians front end for speech recognition. INTERSPEECH 2001: 675-678 - [c21]N. Smith, Mark J. F. Gales:
Speech Recognition using SVMs. NIPS 2001: 1197-1204 - 2000
- [j9]Mark J. F. Gales:
Cluster adaptive training of hidden Markov models. IEEE Trans. Speech Audio Process. 8(4): 417-428 (2000) - [c20]Anuradha Aiyer, Mark J. F. Gales, Michael A. Picheny:
Rapid likelihood calculation of subspace clustered Gaussian components. ICASSP 2000: 1519-1522 - [c19]Ellen Eide, Benoît Maison, Dimitri Kanevsky, Peder A. Olsen, Scott Saobing Chen, Lidia Mangu, Mark J. F. Gales, Miroslav Novak, Ramesh A. Gopinath:
Transcription of broadcast news with a time constraint: IBM's 10xRT HUB4 system. INTERSPEECH 2000: 851-854 - [c18]Mark J. F. Gales:
Factored Semi-Tied Covariance Matrices. NIPS 2000: 779-785
1990 – 1999
- 1999
- [j8]Mark J. F. Gales, Katherine M. Knill, Steve J. Young:
State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs. IEEE Trans. Speech Audio Process. 7(2): 152-161 (1999) - [j7]Mark J. F. Gales:
Semi-tied covariance matrices for hidden Markov models. IEEE Trans. Speech Audio Process. 7(3): 272-281 (1999) - [c17]Scott Saobing Chen, Ellen Eide, Mark J. F. Gales, Ramesh A. Gopinath, Dimitri Kanevsky, Peder A. Olsen:
Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news. ICASSP 1999: 37-40 - [c16]Mark J. F. Gales, Peder A. Olsen:
Tail distribution modelling using the richter and power exponential distributions. EUROSPEECH 1999: 1507-1510 - 1998
- [j6]Mark J. F. Gales:
Maximum likelihood linear transformations for HMM-based speech recognition. Comput. Speech Lang. 12(2): 75-98 (1998) - [j5]Mark J. F. Gales:
Predictive model-based compensation schemes for robust speech recognition. Speech Commun. 25(1-3): 49-74 (1998) - [c15]Mark J. F. Gales:
Semi-tied covariance matrices. ICASSP 1998: 657-660 - [c14]Mark J. F. Gales:
Cluster adaptive training for speech recognition. ICSLP 1998 - 1997
- [c13]Philip C. Woodland, Mark J. F. Gales, David Pye, Steve J. Young:
Broadcast news transcription using HTK. ICASSP 1997: 719-722 - [c12]Harriet J. Nock, Mark J. F. Gales, Steve J. Young:
A comparative study of methods for phonetic decision-tree state clustering. EUROSPEECH 1997: 111-114 - [c11]Mark J. F. Gales:
Transformation smoothing for speaker and environmental adaptation. EUROSPEECH 1997: 2067-2070 - 1996
- [j4]Mark J. F. Gales, Philip C. Woodland:
Mean and variance adaptation within the MLLR framework. Comput. Speech Lang. 10(4): 249-264 (1996) - [j3]Mark J. F. Gales, Steve J. Young:
Robust continuous speech recognition using parallel model combination. IEEE Trans. Speech Audio Process. 4(5): 352-359 (1996) - [c10]Philip C. Woodland, Mark John Francis Gales, David Pye:
Improving environmental robustness in large vocabulary speech recognition. ICASSP 1996: 65-68 - [c9]Kate M. Knill, Mark J. F. Gales, Steve J. Young:
Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs. ICSLP 1996: 470-473 - [c8]Philip C. Woodland, David Pye, Mark J. F. Gales:
Iterative unsupervised adaptation using maximum likelihood linear regression. ICSLP 1996: 1133-1136 - [c7]Mark J. F. Gales, David Pye, Philip C. Woodland:
Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation. ICSLP 1996: 1832-1835 - 1995
- [j2]Mark J. F. Gales, Steve J. Young:
Robust speech recognition in additive and convolutional noise using parallel model combination. Comput. Speech Lang. 9(4): 289-307 (1995) - [c6]Mark John Francis Gales, Steve J. Young:
A fast and flexible implementation of parallel model combination. ICASSP 1995: 133-136 - [c5]Mark J. F. Gales, Steve J. Young:
The application of parallel model combination to a large vocabulary dictation task. EUROSPEECH 1995: 1983-1986 - 1994
- [c4]Mark J. F. Gales, Steve J. Young:
Parallel model combination on a noise corrupted resource management task. ICSLP 1994: 255-258 - 1993
- [j1]Mark J. F. Gales, Steve J. Young:
Cepstral parameter compensation for HMM recognition in noise. Speech Commun. 12(3): 231-239 (1993) - [c3]Mark J. F. Gales, Steve J. Young:
HMM recognition in noise using parallel model combination. EUROSPEECH 1993: 837-840 - [c2]Mark J. F. Gales, Steve J. Young:
Segmental hidden Markov models. EUROSPEECH 1993: 1579-1582 - 1992
- [c1]Mark J. F. Gales, Steve J. Young:
An improved approach to the hidden Markov model decomposition of speech and noise. ICASSP 1992: 233-236
Coauthor Index
aka: Penny Karanasou
aka: Katherine M. Knill
aka: X. Andrew Liu
aka: Potsawee Manakul
aka: Jeremy Heng Meng Wong
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:38 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint