default search action
Zhijie Yan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j7]Linghao Zhang, Luqing Wang, Zhijie Yan, Zhentang Jia, Hongjun Wang, Xinyu Tang:
Star Generative Adversarial VGG Network-Based Sample Augmentation for Insulator Defect Detection. Int. J. Comput. Intell. Syst. 17(1): 141 (2024) - [j6]Linghao Zhang, Luqing Wang, Zhijie Yan, Zhentang Jia, Hongjun Wang, Xinyu Tang:
Correction: Star Generative Adversarial VGG Network-Based Sample Augmentation for Insulator Defect Detection. Int. J. Comput. Intell. Syst. 17(1): 149 (2024) - [c61]Shufei Li, Zuoxu Wang, Zhijie Yan, Yiping Gao, Han Jiang, Pai Zheng:
Large Language Model for Humanoid Cognition in Proactive Human-Robot Collaboration. CASE 2024: 540-545 - [c60]Zhijie Yan, Zuoxu Wang, Shufei Li, Mingrui Li, Xinxin Liang, Jihong Liu:
ManufVisSGG: A Vision-Language-Model Approach for Cognitive Scene Graph Generation in Manufacturing Systems. CASE 2024: 1632-1637 - [c59]Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao:
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes. ECCV (18) 2024: 367-384 - [i27]Xiaoji Zheng, Lixiu Wu, Zhijie Yan, Yuanrong Tang, Hao Zhao, Chen Zhong, Bokui Chen, Jiangtao Gong:
Large Language Models Powered Context-aware Motion Prediction. CoRR abs/2403.11057 (2024) - [i26]Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao:
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes. CoRR abs/2403.19589 (2024) - [i25]Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng:
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs. CoRR abs/2407.04051 (2024) - [i24]Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, Zhijie Yan:
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens. CoRR abs/2407.05407 (2024) - [i23]Keyu An, Shiliang Zhang, Zhijie Yan:
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study. CoRR abs/2409.17750 (2024) - 2023
- [c58]Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. ASRU 2023: 1-8 - [c57]Mingrui Li, Zuoxu Wang, Zhijie Yan, Jihong Liu:
Exploiting Patent Documents for Cross-Domain Knowledge Transfer in Innovative Engineering Design: A Doc2Vec-GAT-Based Approach. CASE 2023: 1-6 - [c56]Zhengxiao Han, Zhijie Yan, Yang Li, Pengfei Li, Yifeng Shi, Nairui Luo, Xu Gao, Yongliang Shi, Pengfei Huang, Jiangtao Gong, Guyue Zhou, Yilun Chen, Hang Zhao, Hao Zhao:
M2Sim: A Long-Term Interactive Driving Simulator. CICAI (2) 2023: 172-176 - [c55]Zhengxiao Han, Zhijie Yan, Yang Li, Pengfei Li, Yifeng Shi, Nairui Luo, Xu Gao, Yongliang Shi, Pengfei Huang, Jiangtao Gong, Guyue Zhou, Yilun Chen, Hang Zhao, Hao Zhao:
Long-Term Interactive Driving Simulation: MPC to the Rescue. CICAI (2) 2023: 177-188 - [c54]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). ICASSP 2023: 1-2 - [c53]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. ICASSP 2023: 1-5 - [c52]Zhijie Yan, Pengfei Li, Zheng Fu, Shaocong Xu, Yongliang Shi, Xiaoxue Chen, Yuhang Zheng, Yang Li, Tianyu Liu, Chuxuan Li, Nairui Luo, Xu Gao, Yilun Chen, Zuoxu Wang, Yifeng Shi, Pengfei Huang, Zhengxiao Han, Jirui Yuan, Jiangtao Gong, Guyue Zhou, Hang Zhao, Hao Zhao:
INT2: Interactive Trajectory Prediction at Intersections. ICCV 2023: 8502-8513 - [c51]Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan:
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System. INTERSPEECH 2023: 3247-3251 - [c50]Xiaohuan Zhou, Jiaming Wang, Zeyu Cui, Shiliang Zhang, Zhijie Yan, Jingren Zhou, Chang Zhou:
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition. INTERSPEECH 2023: 4943-4947 - [i22]Xian Shi, Yanni Chen, Shiliang Zhang, Zhijie Yan:
Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model. CoRR abs/2301.12343 (2023) - [i21]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). CoRR abs/2303.13932 (2023) - [i20]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. CoRR abs/2303.13939 (2023) - [i19]Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan:
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System. CoRR abs/2305.10680 (2023) - [i18]Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR. CoRR abs/2309.13573 (2023) - [i17]Jiaming Wang, Zhihao Du, Qian Chen, Yunfei Chu, Zhifu Gao, Zerui Li, Kai Hu, Xiaohuan Zhou, Jin Xu, Ziyang Ma, Wen Wang, Siqi Zheng, Chang Zhou, Zhijie Yan, Shiliang Zhang:
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT. CoRR abs/2310.04673 (2023) - [i16]Yunfei Chu, Jin Xu, Xiaohuan Zhou, Qian Yang, Shiliang Zhang, Zhijie Yan, Chang Zhou, Jingren Zhou:
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models. CoRR abs/2311.07919 (2023) - [i15]Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan:
Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures. CoRR abs/2312.14860 (2023) - 2022
- [c49]Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhi-Jie Yan:
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis. EMNLP 2022: 7458-7469 - [c48]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171 - [c47]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ICASSP 2022: 7577-7581 - [c46]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c45]Min Zhu, Bingqing Shen, Yan Sun, Chongyu Wang, Guoxin Hou, Zhijie Yan, Hongming Cai:
Surface Defect Detection and Classification Based on Fusing Multiple Computer Vision Techniques. IEA/AIE 2022: 51-62 - [c44]Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan:
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition. INTERSPEECH 2022: 2063-2067 - [i14]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i13]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. CoRR abs/2202.07816 (2022) - [i12]Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan:
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios. CoRR abs/2203.09767 (2022) - [i11]Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan:
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition. CoRR abs/2206.08317 (2022) - [i10]Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan:
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis. CoRR abs/2211.10243 (2022) - [i9]Xiaohuan Zhou, Jiaming Wang, Zeyu Cui, Shiliang Zhang, Zhijie Yan, Jingren Zhou, Chang Zhou:
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition. CoRR abs/2212.00500 (2022) - 2021
- [c43]Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-Time Speaker Diarization System Based on Spatial Spectrum. ICASSP 2021: 7208-7212 - [c42]Shiliang Zhang, Siqi Zheng, Weilong Huang, Ming Lei, Hongbin Suo, Jinwei Feng, Zhijie Yan:
Investigation of Spatial-Acoustic Features for Overlapping Speech Detection in Multiparty Meetings. Interspeech 2021: 3550-3554 - [i8]Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-time Speaker Diarization System Based on Spatial Spectrum. CoRR abs/2107.09321 (2021) - [i7]Siqi Zheng, Shiliang Zhang, Weilong Huang, Qian Chen, Hongbin Suo, Ming Lei, Jinwei Feng, Zhijie Yan:
BeamTransformer: Microphone Array-based Overlapping Speech Detection. CoRR abs/2109.04049 (2021) - [i6]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. CoRR abs/2110.07393 (2021) - 2020
- [c41]Kai Fan, Bo Li, Jiayi Wang, Shiliang Zhang, Boxing Chen, Niyu Ge, Zhijie Yan:
Neural Zero-Inflated Quality Estimation Model for Automatic Speech Recognition System. INTERSPEECH 2020: 606-610 - [c40]Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie:
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. INTERSPEECH 2020: 2142-2146 - [i5]Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie:
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. CoRR abs/2006.01712 (2020)
2010 – 2019
- 2019
- [j5]Zidan Sun, Zhijie Yan, Likai Liang, Ran Wei, Wei Wang:
Dynamic Thermal Rating of Transmission Line Based on Environmental Parameter Estimation. J. Inf. Process. Syst. 15(2): 386-398 (2019) - [c39]Shiliang Zhang, Ming Lei, Zhijie Yan:
Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition. INTERSPEECH 2019: 2180-2184 - [i4]Shiliang Zhang, Ming Lei, Zhijie Yan:
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition. CoRR abs/1904.10045 (2019) - 2018
- [j4]Yanling Wang, Weihua Tao, Zhijie Yan, Ran Wei:
Uncertainty analysis of dynamic thermal rating based on environmental parameter estimation. EURASIP J. Wirel. Commun. Netw. 2018: 167 (2018) - [c38]Mengxiao Bi, Heng Lu, Shiliang Zhang, Ming Lei, Zhijie Yan:
Deep Feed-Forward Sequential Memory Networks for Speech Synthesis. ICASSP 2018: 4794-4798 - [c37]Zhiying Huang, Heng Lu, Ming Lei, Zhijie Yan:
Linear Networks Based Speaker Adaptation for Speech Synthesis. ICASSP 2018: 5319-5323 - [c36]Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. ICASSP 2018: 5869-5873 - [c35]Shaofei Xue, Zhijie Yan, Tao Yu, Zhang Liu:
A Study on Improving Acoustic Model for Robust and Far-Field Speech Recognition. DSP 2018: 1-5 - [i3]Mengxiao Bi, Heng Lu, Shiliang Zhang, Ming Lei, Zhijie Yan:
Deep Feed-forward Sequential Memory Networks for Speech Synthesis. CoRR abs/1802.09194 (2018) - [i2]Zhiying Huang, Heng Lu, Ming Lei, Zhijie Yan:
Linear networks based speaker adaptation for speech synthesis. CoRR abs/1803.02445 (2018) - [i1]Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. CoRR abs/1803.05030 (2018) - 2017
- [j3]Zhijie Yan, Yanling Wang, Likai Liang:
Analysis on Ampacity of Overhead Transmission Lines Being Operated. J. Inf. Process. Syst. 13(5): 1358-1371 (2017) - [c34]Shaofei Xue, Zhijie Yan:
Improving latency-controlled BLSTM acoustic models for online speech recognition. ICASSP 2017: 5340-5344 - 2016
- [c33]Zhiying Huang, Shaofei Xue, Zhijie Yan, Li-Rong Dai:
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code. ISCSLP 2016: 1-5 - [c32]Shaofei Xue, Zhijie Yan, Zhiying Huang, Li-Rong Dai:
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR. ISCSLP 2016: 1-5 - 2015
- [c31]Kai Chen, Zhi-Jie Yan, Qiang Huo:
A context-sensitive-chunk BPTT approach to training deep LSTM/BLSTM recurrent neural networks for offline handwriting recognition. ICDAR 2015: 411-415 - [c30]Kai Chen, Zhi-Jie Yan, Qiang Huo:
Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach. INTERSPEECH 2015: 3600-3604 - 2014
- [j2]Jian Xu, Zhi-Jie Yan, Qiang Huo:
An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection. IEEE ACM Trans. Audio Speech Lang. Process. 22(11): 1581-1589 (2014) - 2013
- [j1]Yao Qian, Frank K. Soong, Zhi-Jie Yan:
A Unified Trajectory Tiling Approach to High Quality Speech Rendering. IEEE Trans. Speech Audio Process. 21(2): 280-290 (2013) - [c29]Zhi-Jie Yan, Qiang Huo, Jian Xu, Yu Zhang:
Tied-state based discriminative training of context-expanded region-dependent feature transforms for LVCSR. ICASSP 2013: 6940-6944 - [c28]Zhi-Jie Yan, Qiang Huo, Jian Xu:
A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR. INTERSPEECH 2013: 104-108 - 2012
- [c27]Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo:
A study of discriminative feature extraction for i-vector based acoustic sniffing in IVN acoustic model training. ICASSP 2012: 4077-4080 - [c26]Jian Xu, Zhi-Jie Yan, Qiang Huo:
A comparative study of fMPE and RDLT approaches to LVCSR. ISCSLP 2012: 21-24 - [c25]Jian Xu, Zhi-Jie Yan, Qiang Huo:
A feature-transform based approach to unsupervised task adaptation and personalization. ISCSLP 2012: 229-232 - [c24]Darren Edge, Kai-Yin Cheng, Michael Whitney, Yao Qian, Zhijie Yan, Frank K. Soong:
Tip tap tones: mobile microtraining of mandarin sounds. Mobile HCI (Companion) 2012: 215-216 - [c23]Darren Edge, Kai-Yin Cheng, Michael Whitney, Yao Qian, Zhijie Yan, Frank K. Soong:
Tip tap tones: mobile microtraining of mandarin sounds. Mobile HCI 2012: 427-430 - 2011
- [c22]Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model. ICASSP 2011: 4520-4523 - [c21]Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo:
A study of an irrelevant variability normalization based discriminative training approach for LVCSR. ICASSP 2011: 5308-5311 - [c20]Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model. INTERSPEECH 2011: 373-376 - [c19]Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo:
An i-vector Based Approach to Training Data Clustering for Improved Speech Recognition. INTERSPEECH 2011: 789-792 - [c18]Jian Xu, Yu Zhang, Zhi-Jie Yan, Qiang Huo:
An i-vector Based Approach to Acoustic Sniffing for Irrelevant Variability Normalization Based Acoustic Model Training and Speech Recognition. INTERSPEECH 2011: 1701-1704 - [c17]Yu Zhang, Zhi-Jie Yan, Qiang Huo:
A new i-vector approach and its application to irrelevant variability normalization based acoustic model training. MLSP 2011: 1-6 - 2010
- [c16]Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Guoliang Zhang, Lijuan Wang:
An HMM Trajectory Tiling (HTT) Approach to High Quality TTS - Microsoft Entry to Blizzard Challenge 2010. Blizzard Challenge 2010 - [c15]Yu Zhang, Zhi-Jie Yan, Frank K. Soong:
Cross-validation based decision tree clustering for HMM-based TTS. ICASSP 2010: 4602-4605 - [c14]Qingqing Zhang, Frank K. Soong, Yao Qian, Zhijie Yan, Jielin Pan, Yonghong Yan:
Improved modeling for F0 generation and V/U decision in HMM-based TTS. ICASSP 2010: 4606-4609 - [c13]Zhi-Jie Yan, Yao Qian, Frank K. Soong:
RIch-context Unit Selection (RUS) approach to high quality TTS. ICASSP 2010: 4798-4801 - [c12]Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Xin Zhuang, Shengyi Kong:
An HMM trajectory tiling (HTT) approach to high quality TTS. INTERSPEECH 2010: 422-425 - [c11]Yining Chen, Zhi-Jie Yan, Frank K. Soong:
A perceptual study of acceleration parameters in HMM-based TTS. INTERSPEECH 2010: 426-429
2000 – 2009
- 2009
- [c10]Zhi-Jie Yan, Cong Liu, Yu Hu, Hui Jiang:
A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition. ICASSP 2009: 3757-3760 - [c9]Zhi-Jie Yan, Yao Qian, Frank K. Soong:
Rich context modeling for high quality HMM-based TTS. INTERSPEECH 2009: 1755-1758 - 2008
- [c8]Zhi-Jie Yan, Bo Zhu, Yu Hu, Ren-Hua Wang:
Minimum word classification error training of HMMS for automatic speech recognition. ICASSP 2008: 4521-4524 - [c7]Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
Soft margin estimation with various separation levels for LVCSR. INTERSPEECH 2008: 269-272 - [c6]Bo Zhu, Zhi-Jie Yan, Yu Hu, Zhiguo Wang, Li-Rong Dai, Ren-Hua Wang:
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map. ISCSLP 2008: 93-96 - 2007
- [c5]Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
A study on soft margin estimation for LVCSR. ASRU 2007: 268-271 - [c4]Zhi-Jie Yan, Frank K. Soong, Ren-Hua Wang:
Word Graph Based Feature Enhancement for Noisy Speech Recognition. ICASSP (4) 2007: 373-376 - 2006
- [c3]Cong Liu, Zhijie Yan, Yu Hu, Renhua Wang:
A Comparative Study on Confidence Measure in Mandarin Command Word Recognition. ISCSLP 2006 - [c2]Zhijie Yan, Peng Liu, Jun Du, Frank K. Soong, Renhua Wang:
Training Discriminative HMM by Optimal Allocation of Gaussian Kernels. ISCSLP 2006 - [c1]Zhi-Jie Yan, Jian-Lai Zhou, Frank K. Soong, Ren-Hua Wang:
Signal Trajectory Based Noise Compensation for Robust Speech Recognition. ISCSLP (Selected Papers) 2006: 335-345
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-08 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint