default search action

combined dblp search
author search
venue search
publication search

ask others

Cong-Thanh Do

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c23]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/DoIDH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DoIDH24
Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain:
Improving Accented Speech Recognition Using Data Augmentation Based on Unsupervised Text-to-Speech Synthesis. EUSIPCO 2024: 136-140
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04047
Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain:
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis. CoRR abs/2407.04047 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16423
Mohan Li, Cong-Thanh Do, Simon Keizer, Youmna Farag, Svetlana Stoyanchev, Rama Doddipatla:
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding. CoRR abs/2408.16423 (2024)
2023
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiZDD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiZDD23
Mohan Li, Catalin Zorila, Cong-Thanh Do, Rama Doddipatla:
Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs. ASRU 2023: 1-8
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiDD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiDD23
Mohan Li, Cong-Thanh Do, Rama Doddipatla:
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and Rescoring. ICASSP 2023: 1-5
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoDLH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoDLH23
Cong-Thanh Do, Rama Doddipatla, Mohan Li, Thomas Hain:
Domain Adaptive Self-supervised Training of Automatic Speech Recognition. INTERSPEECH 2023: 4389-4393
2022
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/sigpro/DoNN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sigpro/DoNN22
Cong-Thanh Do, Tran Thien Dat Nguyen, Hoa Van Nguyen:
Robust multi-sensor generalized labeled multi-Bernoulli filter. Signal Process. 192: 108368 (2022)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/sigpro/DoNMSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sigpro/DoNMSC22
Cong-Thanh Do, Tran Thien Dat Nguyen, Diluka Moratuwage, Changbeom Shim, Yon Dohn Chung:
Multi-object tracking with an adaptive generalized labeled multi-Bernoulli filter. Signal Process. 196: 108532 (2022)
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/iccais/NguyenDN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccais/NguyenDN22
Tran Thien Dat Nguyen, Cong-Thanh Do, Hoa Van Nguyen:
An Adaptive Multi-Sensor Generalised Labelled Multi-Bernoulli Filter for Linear Gaussian Models. ICCAIS 2022: 84-89
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoLD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoLD22
Cong-Thanh Do, Mohan Li, Rama Doddipatla:
Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer. INTERSPEECH 2022: 4446-4450
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-14736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-14736
Cong-Thanh Do, Mohan Li, Rama Doddipatla:
Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer. CoRR abs/2207.14736 (2022)
2021
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangDDL0R21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangDDL0R21
Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Erfan Loweimi, Peter Bell, Steve Renals:
Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers. ICASSP 2021: 2750-2754
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DoDH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DoDH21
Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. ICASSP 2021: 6978-6982
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/iccais/OngKD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccais/OngKD21
Jonah Ong, Du Yong Kim, Cong-Thanh Do:
A Tractable Multi-target Detection Model for Line-of-Sight Measurements. ICCAIS 2021: 147-152
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04697
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04697
Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Erfan Loweimi, Peter Bell, Steve Renals:
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers. CoRR abs/2102.04697 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-15515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-15515
Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition. CoRR abs/2103.15515 (2021)
2020
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/DoZH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DoZH20
Cong-Thanh Do, Shucong Zhang, Thomas Hain:
Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness. EUSIPCO 2020: 321-325
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangDDR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangDDR20
Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Steve Renals:
Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition. ICASSP 2020: 7024-7028

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/DoN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/DoN19
Cong-Thanh Do, Hoa Van Nguyen:
Tracking Multiple Targets from Multistatic Doppler Radar with Unknown Probability of Detection. Sensors 19(7): 1672 (2019)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/DoNL19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/DoNL19a
Cong-Thanh Do, Tran Thien Dat Nguyen, Weifeng Liu:
Tracking Multiple Marine Ships via Multiple Sensors with Unknown Backgrounds. Sensors 19(22): 5025 (2019)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Do19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Do19
Cong-Thanh Do:
Subband Temporal Envelope Features and Data Augmentation for End-to-end Recognition of Distant Conversational Speech. ICASSP 2019: 6251-6255
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iccais/DoN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccais/DoN19
Cong-Thanh Do, Tran Thien Dat Nguyen:
Multiple marine ships tracking from multistatic Doppler data with unknown clutter rate. ICCAIS 2019: 1-6
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-01957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-01957
Cong-Thanh Do:
End-to-End Speech Recognition with High-Frame-Rate Features Extraction. CoRR abs/1907.01957 (2019)
2018
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/iccais/DoN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccais/DoN18
Cong-Thanh Do, Hoa Van Nguyen:
Multistatic Doppler-Based Marine Ships Tracking. ICCAIS 2018: 151-156
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoS18
Cong-Thanh Do, Yannis Stylianou:
Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition. INTERSPEECH 2018: 1591-1595
2017
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoS17
Cong-Thanh Do, Yannis Stylianou:
Improved Automatic Speech Recognition Using Subband Temporal Envelope Features and Time-Delay Neural Network Denoising Autoencoder. INTERSPEECH 2017: 3832-3836
2014
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/SarkarDLB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/SarkarDLB14
Achintya Kumar Sarkar, Cong-Thanh Do, Viet Bac Le, Claude Barras:
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification. IEEE Signal Process. Lett. 21(9): 1040-1044 (2014)
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoELdRC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoELdRC14
Cong-Thanh Do, Marc Evrard, A. Leman, Christophe d'Alessandro, Albert Rilliard, J.-L. Crebouw:
Objective evaluation of HMM-based speech synthesis system using kullback-leibler divergence. INTERSPEECH 2014: 2952-2956
[c6]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/DoLG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/DoLG14
Cong-Thanh Do, Lori Lamel, Jean-Luc Gauvain:
Speech-to-text development for Slovak, a low-resourced language. SLTU 2014: 176-182
2013
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoBLS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoBLS13
Cong-Thanh Do, Claude Barras, Viet Bac Le, Achintya Kumar Sarkar:
Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data. INTERSPEECH 2013: 2484-2488
2012
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/DoPG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/DoPG12
Cong-Thanh Do, Dominique Pastor, André Goalic:
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. Speech Commun. 54(1): 119-133 (2012)
[c4]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/DoB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoB12
Cong-Thanh Do, Claude Barras:
Cochlear implant-like processing of speech signal for speaker verification. SAPA@INTERSPEECH 2012: 17-21
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/DoTG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/DoTG12
Cong-Thanh Do, Mohammad Javad Taghizadeh, Philip N. Garner:
Combining cepstral normalization and cochlear implant-like speech processing for microphone array-based speech recognition. SLT 2012: 137-142
2010
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DoPG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DoPG10
Cong-Thanh Do, Dominique Pastor, André Goalic:
On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR. IEEE Trans. Speech Audio Process. 18(5): 1065-1068 (2010)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tbe/DoPG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbe/DoPG10
Cong-Thanh Do, Dominique Pastor, André Goalic:
On Normalized MSE Analysis of Speech Fundamental Frequency in the Cochlear Implant-Like Spectrally Reduced Speech. IEEE Trans. Biomed. Eng. 57(3): 572-577 (2010)
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoPLG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoPLG10
Cong-Thanh Do, Dominique Pastor, Gaël Le Lan, André Goalic:
Recognizing cochlear implant-like spectrally reduced speech with HMM-based ASR: experiments with MFCCs and PLP coefficients. INTERSPEECH 2010: 2634-2637

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c1]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/avsp/DoAPG09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/avsp/DoAPG09
Cong-Thanh Do, Abdeldjalil Aïssa-El-Bey, Dominique Pastor, André Goalic:
Area of mouth opening estimation from speech acoustics using blind deconvolution technique. AVSP 2009: 80-85

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.