default search action

combined dblp search
author search
venue search
publication search

ask others

Zhijie Yan

Zhi-Jie Yan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcisys/ZhangWYJWT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcisys/ZhangWYJWT24
Linghao Zhang, Luqing Wang, Zhijie Yan, Zhentang Jia, Hongjun Wang, Xinyu Tang:
Star Generative Adversarial VGG Network-Based Sample Augmentation for Insulator Defect Detection. Int. J. Comput. Intell. Syst. 17(1): 141 (2024)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcisys/ZhangWYJWT24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcisys/ZhangWYJWT24a
Linghao Zhang, Luqing Wang, Zhijie Yan, Zhentang Jia, Hongjun Wang, Xinyu Tang:
Correction: Star Generative Adversarial VGG Network-Based Sample Augmentation for Insulator Defect Detection. Int. J. Comput. Intell. Syst. 17(1): 149 (2024)
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/case/LiWYGJZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case/LiWYGJZ24
Shufei Li, Zuoxu Wang, Zhijie Yan, Yiping Gao, Han Jiang, Pai Zheng:
Large Language Model for Humanoid Cognition in Proactive Human-Robot Collaboration. CASE 2024: 540-545
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/case/YanWLLLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case/YanWLLLL24
Zhijie Yan, Zuoxu Wang, Shufei Li, Mingrui Li, Xinxin Liang, Jihong Liu:
ManufVisSGG: A Vision-Language-Model Approach for Cognitive Scene Graph Generation in Manufacturing Systems. CASE 2024: 1632-1637
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/JinZLLZHLZYSZJLCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/JinZLLZHLZYSZJLCZ24
Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao:
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes. ECCV (18) 2024: 367-384
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11057
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11057
Xiaoji Zheng, Lixiu Wu, Zhijie Yan, Yuanrong Tang, Hao Zhao, Chen Zhong, Bokui Chen, Jiangtao Gong:
Large Language Models Powered Context-aware Motion Prediction. CoRR abs/2403.11057 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-19589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-19589
Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao:
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes. CoRR abs/2403.19589 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04051
Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng:
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs. CoRR abs/2407.04051 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-05407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-05407
Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, Zhijie Yan:
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens. CoRR abs/2407.05407 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17750
Keyu An, Shiliang Zhang, Zhijie Yan:
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study. CoRR abs/2409.17750 (2024)
2023
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiangSYLZDCXQWCLYB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiangSYLZDCXQWCLYB23
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. ASRU 2023: 1-8
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/case/LiWYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case/LiWYL23
Mingrui Li, Zuoxu Wang, Zhijie Yan, Jihong Liu:
Exploiting Patent Documents for Cross-Domain Knowledge Transfer in Innovative Engineering Design: A Doc2Vec-GAT-Based Approach. CASE 2023: 1-6
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/cicba/HanYLLSLGSHGZCZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cicba/HanYLLSLGSHGZCZZ23
Zhengxiao Han, Zhijie Yan, Yang Li, Pengfei Li, Yifeng Shi, Nairui Luo, Xu Gao, Yongliang Shi, Pengfei Huang, Jiangtao Gong, Guyue Zhou, Yilun Chen, Hang Zhao, Hao Zhao:
M²Sim: A Long-Term Interactive Driving Simulator. CICAI (2) 2023: 172-176
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/cicba/HanYLLSLGSHGZCZZ23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cicba/HanYLLSLGSHGZCZZ23a
Zhengxiao Han, Zhijie Yan, Yang Li, Pengfei Li, Yifeng Shi, Nairui Luo, Xu Gao, Yongliang Shi, Pengfei Huang, Jiangtao Gong, Guyue Zhou, Yilun Chen, Hang Zhao, Hao Zhao:
Long-Term Interactive Driving Simulation: MPC to the Rescue. CICAI (2) 2023: 177-188
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangDLYCWYL0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangDLYCWYL0Z23
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). ICASSP 2023: 1-2
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangDLYCWYLRZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangDLYCWYLRZ23
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. ICASSP 2023: 1-5
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/YanLFXSCZLLLLGC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/YanLFXSCZLLLLGC23
Zhijie Yan, Pengfei Li, Zheng Fu, Shaocong Xu, Yongliang Shi, Xiaoxue Chen, Yuhang Zheng, Yang Li, Tianyu Liu, Chuxuan Li, Nairui Luo, Xu Gao, Yilun Chen, Zuoxu Wang, Yifeng Shi, Pengfei Huang, Zhengxiao Han, Jirui Yuan, Jiangtao Gong, Guyue Zhou, Hang Zhao, Hao Zhao:
INT2: Interactive Trajectory Prediction at Intersections. ICCV 2023: 8502-8513
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLGZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLGZY23
Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan:
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System. INTERSPEECH 2023: 3247-3251
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouWCZYZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouWCZYZZ23
Xiaohuan Zhou, Jiaming Wang, Zeyu Cui, Shiliang Zhang, Zhijie Yan, Jingren Zhou, Chang Zhou:
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition. INTERSPEECH 2023: 4943-4947
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12343
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12343
Xian Shi, Yanni Chen, Shiliang Zhang, Zhijie Yan:
Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model. CoRR abs/2301.12343 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-13932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-13932
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). CoRR abs/2303.13932 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-13939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-13939
Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. CoRR abs/2303.13939 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10680
Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan:
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System. CoRR abs/2305.10680 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13573
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR. CoRR abs/2309.13573 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04673
Jiaming Wang, Zhihao Du, Qian Chen, Yunfei Chu, Zhifu Gao, Zerui Li, Kai Hu, Xiaohuan Zhou, Jin Xu, Ziyang Ma, Wen Wang, Siqi Zheng, Chang Zhou, Zhijie Yan, Shiliang Zhang:
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT. CoRR abs/2310.04673 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-07919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-07919
Yunfei Chu, Jin Xu, Xiaohuan Zhou, Qian Yang, Shiliang Zhang, Zhijie Yan, Chang Zhou, Jingren Zhou:
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models. CoRR abs/2311.07919 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14860
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14860
Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan:
Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures. CoRR abs/2312.14860 (2023)
2022
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/DuZZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/DuZZY22
Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhi-Jie Yan:
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis. EMNLP 2022: 7458-7469
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZFXZDHGYMXB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZFXZDHGYMXB22
Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RenLHZCYZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RenLHZCYZ22
Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ICASSP 2022: 7577-7581
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZGFDZHXTWQLYM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZGFDZHXTWQLYM22
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/ieaaie/ZhuSSWHYC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ieaaie/ZhuSSWHYC22
Min Zhu, Bingqing Shen, Yan Sun, Chongyu Wang, Guoxin Hou, Zhijie Yan, Hongming Cai:
Surface Defect Detection and Classification Based on Fusing Multiple Computer Vision Techniques. IEA/AIE 2022: 51-62
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoZ0Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoZ0Y22
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan:
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition. INTERSPEECH 2022: 2063-2067
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03647
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07816
Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. CoRR abs/2202.07816 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-09767
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-09767
Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan:
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios. CoRR abs/2203.09767 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08317
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan:
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition. CoRR abs/2206.08317 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10243
Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan:
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis. CoRR abs/2211.10243 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-00500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-00500
Xiaohuan Zhou, Jiaming Wang, Zeyu Cui, Shiliang Zhang, Zhijie Yan, Jingren Zhou, Chang Zhou:
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition. CoRR abs/2212.00500 (2022)
2021
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengHWSFY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengHWSFY21
Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-Time Speaker Diarization System Based on Spatial Spectrum. ICASSP 2021: 7208-7212
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZHLSFY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZHLSFY21
Shiliang Zhang, Siqi Zheng, Weilong Huang, Ming Lei, Hongbin Suo, Jinwei Feng, Zhijie Yan:
Investigation of Spatial-Acoustic Features for Overlapping Speech Detection in Multiparty Meetings. Interspeech 2021: 3550-3554
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09321
Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-time Speaker Diarization System Based on Spatial Spectrum. CoRR abs/2107.09321 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-04049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-04049
Siqi Zheng, Shiliang Zhang, Weilong Huang, Qian Chen, Hongbin Suo, Ming Lei, Jinwei Feng, Zhijie Yan:
BeamTransformer: Microphone Array-based Overlapping Speech Detection. CoRR abs/2109.04049 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07393
Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. CoRR abs/2110.07393 (2021)
2020
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FanLWZCGY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FanLWZCGY20
Kai Fan, Bo Li, Jiayi Wang, Shiliang Zhang, Boxing Chen, Niyu Ge, Zhijie Yan:
Neural Zero-Inflated Quality Estimation Model for Automatic Speech Recognition System. INTERSPEECH 2020: 606-610
[c40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangGLLGYX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangGLLGYX20
Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie:
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. INTERSPEECH 2020: 2142-2146
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01712
Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie:
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition. CoRR abs/2006.01712 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j5]
- view
  - electronic edition @ jips-k.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jips/SunYLWW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jips/SunYLWW19
Zidan Sun, Zhijie Yan, Likai Liang, Ran Wei, Wei Wang:
Dynamic Thermal Rating of Transmission Line Based on Environmental Parameter Estimation. J. Inf. Process. Syst. 15(2): 386-398 (2019)
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangLY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangLY19
Shiliang Zhang, Ming Lei, Zhijie Yan:
Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition. INTERSPEECH 2019: 2180-2184
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-10045
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-10045
Shiliang Zhang, Ming Lei, Zhijie Yan:
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition. CoRR abs/1904.10045 (2019)
2018
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejwcn/WangTYW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejwcn/WangTYW18
Yanling Wang, Weihua Tao, Zhijie Yan, Ran Wei:
Uncertainty analysis of dynamic thermal rating based on environmental parameter estimation. EURASIP J. Wirel. Commun. Netw. 2018: 167 (2018)
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BiLZLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BiLZLY18
Mengxiao Bi, Heng Lu, Shiliang Zhang, Ming Lei, Zhijie Yan:
Deep Feed-Forward Sequential Memory Networks for Speech Synthesis. ICASSP 2018: 4794-4798
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangLLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangLLY18
Zhiying Huang, Heng Lu, Ming Lei, Zhijie Yan:
Linear Networks Based Speaker Adaptation for Speech Synthesis. ICASSP 2018: 5319-5323
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLYD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLYD18
Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. ICASSP 2018: 5869-5873
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icdsp/XueYYL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdsp/XueYYL18
Shaofei Xue, Zhijie Yan, Tao Yu, Zhang Liu:
A Study on Improving Acoustic Model for Robust and Far-Field Speech Recognition. DSP 2018: 1-5
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-09194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-09194
Mengxiao Bi, Heng Lu, Shiliang Zhang, Ming Lei, Zhijie Yan:
Deep Feed-forward Sequential Memory Networks for Speech Synthesis. CoRR abs/1802.09194 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-02445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-02445
Zhiying Huang, Heng Lu, Ming Lei, Zhijie Yan:
Linear networks based speaker adaptation for speech synthesis. CoRR abs/1803.02445 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05030
Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. CoRR abs/1803.05030 (2018)
2017
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jips/YanWL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jips/YanWL17
Zhijie Yan, Yanling Wang, Likai Liang:
Analysis on Ampacity of Overhead Transmission Lines Being Operated. J. Inf. Process. Syst. 13(5): 1358-1371 (2017)
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueY17
Shaofei Xue, Zhijie Yan:
Improving latency-controlled BLSTM acoustic models for online speech recognition. ICASSP 2017: 5340-5344
2016
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuangXYD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuangXYD16
Zhiying Huang, Shaofei Xue, Zhijie Yan, Li-Rong Dai:
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code. ISCSLP 2016: 1-5
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XueYHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XueYHD16
Shaofei Xue, Zhijie Yan, Zhiying Huang, Li-Rong Dai:
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR. ISCSLP 2016: 1-5
2015
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icdar/ChenYH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdar/ChenYH15
Kai Chen, Zhi-Jie Yan, Qiang Huo:
A context-sensitive-chunk BPTT approach to training deep LSTM/BLSTM recurrent neural networks for offline handwriting recognition. ICDAR 2015: 411-415
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenYH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenYH15
Kai Chen, Zhi-Jie Yan, Qiang Huo:
Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach. INTERSPEECH 2015: 3600-3604
2014
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XuYH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuYH14
Jian Xu, Zhi-Jie Yan, Qiang Huo:
An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection. IEEE ACM Trans. Audio Speech Lang. Process. 22(11): 1581-1589 (2014)
2013
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianSY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianSY13
Yao Qian, Frank K. Soong, Zhi-Jie Yan:
A Unified Trajectory Tiling Approach to High Quality Speech Rendering. IEEE Trans. Speech Audio Process. 21(2): 280-290 (2013)
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanHXZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanHXZ13
Zhi-Jie Yan, Qiang Huo, Jian Xu, Yu Zhang:
Tied-state based discriminative training of context-expanded region-dependent feature transforms for LVCSR. ICASSP 2013: 6940-6944
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanHX13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanHX13
Zhi-Jie Yan, Qiang Huo, Jian Xu:
A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR. INTERSPEECH 2013: 104-108
2012
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangXYH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangXYH12
Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo:
A study of discriminative feature extraction for i-vector based acoustic sniffing in IVN acoustic model training. ICASSP 2012: 4077-4080
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuYH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuYH12
Jian Xu, Zhi-Jie Yan, Qiang Huo:
A comparative study of fMPE and RDLT approaches to LVCSR. ISCSLP 2012: 21-24
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuYH12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuYH12a
Jian Xu, Zhi-Jie Yan, Qiang Huo:
A feature-transform based approach to unsupervised task adaptation and personalization. ISCSLP 2012: 229-232
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/mhci/EdgeCWQYS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mhci/EdgeCWQYS12
Darren Edge, Kai-Yin Cheng, Michael Whitney, Yao Qian, Zhijie Yan, Frank K. Soong:
Tip tap tones: mobile microtraining of mandarin sounds. Mobile HCI (Companion) 2012: 215-216
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/mhci/EdgeCWQYS12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mhci/EdgeCWQYS12a
Darren Edge, Kai-Yin Cheng, Michael Whitney, Yao Qian, Zhijie Yan, Frank K. Soong:
Tip tap tones: mobile microtraining of mandarin sounds. Mobile HCI 2012: 427-430
2011
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LongYSDG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LongYSDG11
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model. ICASSP 2011: 4520-4523
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangXYH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangXYH11
Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo:
A study of an irrelevant variability normalization based discriminative training approach for LVCSR. ICASSP 2011: 5308-5311
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LongYSDG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LongYSDG11
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model. INTERSPEECH 2011: 373-376
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangXYH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangXYH11
Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo:
An i-vector Based Approach to Training Data Clustering for Improved Speech Recognition. INTERSPEECH 2011: 789-792
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuZYH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuZYH11
Jian Xu, Yu Zhang, Zhi-Jie Yan, Qiang Huo:
An i-vector Based Approach to Acoustic Sniffing for Irrelevant Variability Normalization Based Acoustic Model Training and Speech Recognition. INTERSPEECH 2011: 1701-1704
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/0007YH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/0007YH11
Yu Zhang, Zhi-Jie Yan, Qiang Huo:
A new i-vector approach and its application to irrelevant variability normalization based acoustic model training. MLSP 2011: 1-6
2010
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/QianYWSZW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/QianYWSZW10
Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Guoliang Zhang, Lijuan Wang:
An HMM Trajectory Tiling (HTT) Approach to High Quality TTS - Microsoft Entry to Blizzard Challenge 2010. Blizzard Challenge 2010
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangYS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangYS10
Yu Zhang, Zhi-Jie Yan, Frank K. Soong:
Cross-validation based decision tree clustering for HMM-based TTS. ICASSP 2010: 4602-4605
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangSQYPY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangSQYPY10
Qingqing Zhang, Frank K. Soong, Yao Qian, Zhijie Yan, Jielin Pan, Yonghong Yan:
Improved modeling for F0 generation and V/U decision in HMM-based TTS. ICASSP 2010: 4606-4609
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanQS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanQS10
Zhi-Jie Yan, Yao Qian, Frank K. Soong:
RIch-context Unit Selection (RUS) approach to high quality TTS. ICASSP 2010: 4798-4801
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianYWSZK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianYWSZK10
Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Xin Zhuang, Shengyi Kong:
An HMM trajectory tiling (HTT) approach to high quality TTS. INTERSPEECH 2010: 422-425
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenYS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenYS10
Yining Chen, Zhi-Jie Yan, Frank K. Soong:
A perceptual study of acceleration parameters in HMM-based TTS. INTERSPEECH 2010: 426-429

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanLHJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanLHJ09
Zhi-Jie Yan, Cong Liu, Yu Hu, Hui Jiang:
A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition. ICASSP 2009: 3757-3760
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanQS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanQS09
Zhi-Jie Yan, Yao Qian, Frank K. Soong:
Rich context modeling for high quality HMM-based TTS. INTERSPEECH 2009: 1755-1758
2008
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanZHW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanZHW08
Zhi-Jie Yan, Bo Zhu, Yu Hu, Ren-Hua Wang:
Minimum word classification error training of HMMS for automatic speech recognition. ICASSP 2008: 4521-4524
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiYLW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiYLW08
Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
Soft margin estimation with various separation levels for LVCSR. INTERSPEECH 2008: 269-272
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhuYHWDW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhuYHWDW08
Bo Zhu, Zhi-Jie Yan, Yu Hu, Zhiguo Wang, Li-Rong Dai, Ren-Hua Wang:
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map. ISCSLP 2008: 93-96
2007
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiYLW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiYLW07
Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
A study on soft margin estimation for LVCSR. ASRU 2007: 268-271
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanSW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanSW07
Zhi-Jie Yan, Frank K. Soong, Ren-Hua Wang:
Word Graph Based Feature Enhancement for Noisy Speech Recognition. ICASSP (4) 2007: 373-376
2006
[c3]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/0006Y0W06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0006Y0W06
Cong Liu, Zhijie Yan, Yu Hu, Renhua Wang:
A Comparative Study on Confidence Measure in Mandarin Command Word Recognition. ISCSLP 2006
[c2]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/Yan0DSW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Yan0DSW06
Zhijie Yan, Peng Liu, Jun Du, Frank K. Soong, Renhua Wang:
Training Discriminative HMM by Optimal Allocation of Gaussian Kernels. ISCSLP 2006
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YanZSW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YanZSW06
Zhi-Jie Yan, Jian-Lai Zhou, Frank K. Soong, Ren-Hua Wang:
Signal Trajectory Based Noise Compensation for Robust Speech Recognition. ISCSLP (Selected Papers) 2006: 335-345

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.