default search action
Chongjia Ni
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c50]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. ICASSP 2024: 326-330 - [c49]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. ICASSP 2024: 10356-10360 - [c48]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-Shot Learners for Speech Recognition? ICASSP 2024: 10366-10370 - [i15]Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis. CoRR abs/2406.02009 (2024) - [i14]Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng:
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs. CoRR abs/2407.04051 (2024) - [i13]Kun Zhou, You Zhang, Shengkui Zhao, Hao Wang, Zexu Pan, Dianwen Ng, Chong Zhang, Chongjia Ni, Yukun Ma, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions. CoRR abs/2409.16681 (2024) - 2023
- [c47]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition. ICASSP 2023: 1-5 - [c46]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-Resource Keyword Spotting. ICASSP 2023: 1-5 - [c45]Zhao Yang, Dianwen Ng, Chong Zhang, Xiao Fu, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma, Jizhong Zhao:
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition. INTERSPEECH 2023: 72-76 - [c44]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Qian Chen, Wen Wang, Eng Siong Chng, Bin Ma:
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition. INTERSPEECH 2023: 1319-1323 - [c43]Jia Qi Yip, Duc-Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. INTERSPEECH 2023: 1938-1942 - [c42]Zhao Yang, Dianwen Ng, Xizhe Li, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng:
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement. INTERSPEECH 2023: 3774-3778 - [c41]Zhao Yang, Dianwen Ng, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng:
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions. INTERSPEECH 2023: 4953-4957 - [i12]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition. CoRR abs/2302.14597 (2023) - [i11]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-resource Keyword Spotting. CoRR abs/2305.01170 (2023) - [i10]Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. CoRR abs/2305.12121 (2023) - [i9]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-shot Learners for Speech Recognition? CoRR abs/2309.09413 (2023) - [i8]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for enhanced speech separation performance. CoRR abs/2309.12608 (2023) - [i7]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. CoRR abs/2312.11825 (2023) - 2022
- [i6]Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization. CoRR abs/2209.06360 (2022) - [i5]Lei Wang, Rong Tong, Cheung Chi Leung, Sunil Sivadas, Chongjia Ni, Bin Ma:
Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages. CoRR abs/2210.03580 (2022) - 2021
- [c40]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. EMNLP (1) 2021: 9339-9349 - [c39]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. ICASSP 2021: 6813-6817 - [c38]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [i4]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. CoRR abs/2110.08545 (2021) - 2020
- [c37]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - [c36]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Speech Transformer with Speaker Aware Persistent Memory. INTERSPEECH 2020: 1261-1265 - [c35]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Universal Speech Transformer. INTERSPEECH 2020: 5021-5025 - [c34]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Cross Attention with Monotonic Alignment for Speech Transformer. INTERSPEECH 2020: 5031-5035 - [i3]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020)
2010 – 2019
- 2019
- [c33]Shengkui Zhao, Chongjia Ni, Rong Tong, Bin Ma:
Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition. INTERSPEECH 2019: 1238-1242 - [c32]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164 - [i2]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019) - [i1]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019) - 2018
- [j6]Ai-Ying Zhang, Chongjia Ni:
Submodular Based Unsupervised Data Selection. IEICE Trans. Inf. Syst. 101-D(6): 1591-1604 (2018) - [c31]Nguyen Bach, Hongjie Chen, Kai Fan, Cheung-Chi Leung, Bo Li, Chongjia Ni, Rong Tong, Pei Zhang, Boxing Chen, Bin Ma, Fei Huang:
Alibaba Speech Translation Systems for IWSLT 2018. IWSLT 2018: 136-141 - 2017
- [j5]Aiying Zhang, Chongjia Ni:
资源稀缺蒙语语音识别研究 (Research on Low-resource Mongolian Speech Recognition). 计算机科学 44(10): 318-322 (2017) - [c30]Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327 - [c29]Chang Huai You, Bin Ma, Chongjia Ni:
Modification on LSA speech enhancement for speech recognition. ICASSP 2017: 5475-5479 - [c28]Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, Bin Ma:
Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search. ICASSP 2017: 5650-5654 - 2016
- [j4]Aiying Zhang, Chongjia Ni:
基于音频事件检测和分类的音频监控系统背景模型自适应方法研究 (Research on Background Model Adaptation for Acoustic Event Detection and Classification Based on Acoustic Surveillance System). 计算机科学 43(9): 310-314 (2016) - [j3]I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, Chin-Hui Lee:
A Keyword-Aware Language Modeling Approach to Spoken Keyword Search. J. Signal Process. Syst. 82(2): 197-206 (2016) - [c27]Jia Dai, Wenju Liu, Hao Zheng, Wei Xue, Chongjia Ni:
Semi-supervised Learning of Bottleneck Feature for Music Genre Classification. CCPR (2) 2016: 552-562 - [c26]Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Nancy F. Chen, Bin Ma, Haizhou Li:
Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search. ICASSP 2016: 6015-6019 - [c25]Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044 - [c24]Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions. INTERSPEECH 2016: 1315-1319 - [c23]Chongjia Ni, Lei Wang, Cheung-Chi Leung, Feng Rao, Li Lu, Bin Ma, Haizhou Li:
Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search. INTERSPEECH 2016: 3698-3702 - [c22]Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li:
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. INTERSPEECH 2016: 3703-3707 - [c21]Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu:
Long short-term memory recurrent neural network based segment features for music genre classification. ISCSLP 2016: 1-5 - [c20]Lei Wang, Chongjia Ni, Cheung-Chi Leung, Changhuai You, Lei Xie, Haihua Xu, Xiong Xiao, Tin Lay Nwe, Eng Siong Chng, Bin Ma, Haizhou Li:
The NNI Vietnamese Speech Recognition System for MediaEval 2016. MediaEval 2016 - 2015
- [c19]Jia Dai, Chongjia Ni, Wei Xue, Wenju Liu:
A novel codebook representation method and encoding strategy for bag-of-words based acoustic event classification. APSIPA 2015: 31-34 - [c18]Chongjia Ni, Lei Wang, Haibo Liu, Cheung-Chi Leung, Li Lu, Bin Ma:
Submodular data selection with acoustic and phonetic features for automatic speech recognition. ICASSP 2015: 4629-4633 - [c17]Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, Bin Ma:
Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search. ICASSP 2015: 4714-4718 - [c16]I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, Chin-Hui Lee:
A keyword-aware grammar framework for LVCSR-based spoken keyword search. ICASSP 2015: 5196-5200 - [c15]Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao, Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina Goh, Engsiong Chng, Bin Ma, Haizhou Li:
Low-resource keyword search strategies for tamil. ICASSP 2015: 5366-5370 - [c14]Andreea I. Niculescu, Ngoc Thuy Huong Thai, Chongjia Ni, Boon Pang Lim, Kheng Hui Yeo, Rafael E. Banchs:
Smarter driving with IDA, the intelligent driving assistant for singapore. INTERSPEECH 2015: 716-717 - [c13]Jia Dai, Wenju Liu, Chongjia Ni, Like Dong, Hong Yang:
"multilingual" deep neural network for music genre classification. INTERSPEECH 2015: 2907-2911 - [c12]Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Engsiong Chng, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2015. MediaEval 2015 - 2014
- [c11]Chongjia Ni, Cheung-Chi Leung:
Investigation of using different Chinese word segmentation standards and algorithms for automatic speech recognition. ISCSLP 2014: 44-48 - [c10]Chongjia Ni, Nancy F. Chen, Bin Ma:
Multiple time-span feature fusion for deep neural network modeling. ISCSLP 2014: 138-142 - [c9]I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, Chin-Hui Lee:
A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search. ISCSLP 2014: 192-196 - [c8]Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Engsiong Chng, Haizhou Li:
System and keyword dependent fusion for spoken term detection. SLT 2014: 430-435 - 2012
- [j2]Chong-Jia Ni, Wenju Liu, Bo Xu:
From English pitch accent detection to Mandarin stress detection, where is the difference? Comput. Speech Lang. 26(3): 127-148 (2012) - [j1]Chong-Jia Ni, Ai-Ying Zhang, Wenju Liu, Bo Xu:
Automatic Prosodic Break Detection and Feature Analysis. J. Comput. Sci. Technol. 27(6): 1184-1196 (2012) - [c7]Chong-Jia Ni, Ai-Ying Zhang:
The Comparison between Mandarin Break Detection and English Break Detection. CCPR 2012: 597-605 - 2011
- [c6]Chong-Jia Ni, Wenju Liu, Bo Xu:
Prosody dependent Mandarin speech recognition. IJCNN 2011: 197-201 - [c5]Chong-Jia Ni, Wenju Liu, Bo Xu:
Automatic Prosodic Events Detection by Using Syllable-Based Acoustic, Lexical and Syntactic Features. INTERSPEECH 2011: 2017-2020 - 2010
- [c4]Chong-Jia Ni, Wenju Liu, Bo Xu:
Mandarin stress detection using hierarchical model based boosting classification and regression tree. IJCNN 2010: 1-5 - [c3]Chong-Jia Ni, Wenju Liu, Bo Xu:
Using prosody to improve Mandarin automatic speech recognition. INTERSPEECH 2010: 2690-2693 - [c2]Chong-Jia Ni, Wenju Liu, Bo Xu:
Mandarin prosodic break detection based on complementary model. ISCSLP 2010: 353-357
2000 – 2009
- 2008
- [c1]Chong-Jia Ni, Wenju Liu, Bo Xu:
Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information. ISCSLP 2008: 354-357
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:12 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint