default search action

combined dblp search
author search
venue search
publication search

ask others

Xiang Yin 0006

> Home > Persons

Person information

affiliation: ByteDance AI Lab, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/eswa/QiuHZYN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eswa/QiuHZYN24
Yilei Qiu, Zhou He, Wenyu Zhang, Xiang Yin, Chengjie Ni:
MSGCN-ISTL: A multi-scaled self-attention-enhanced graph convolutional network with improved STL decomposition for probabilistic load forecasting. Expert Syst. Appl. 238(Part A): 121737 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangZRZYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangZRZYL24
Mingyang Zhang, Yi Zhou, Yi Ren, Chen Zhang, Xiang Yin, Haizhou Li:
RefXVC: Cross-Lingual Voice Conversion With Enhanced Reference Leveraging. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4146-4156 (2024)
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuH0Y024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuH0Y024
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling. AAAI 2024: 18698-18706
[c27]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001L0HYJY0WW0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001L0HYJY0WW0M24
Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis. ICLR 2024
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YeZ0YLH0HHL00MZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YeZ0YLH0HHL00MZ24
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. ICLR 2024
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0008H00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0008H00024
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. ACM Multimedia 2024: 4187-4196
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08503
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. CoRR abs/2401.08503 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-21491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-21491
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. CoRR abs/2407.21491 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-04708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-04708
Jiawei Huang, Chen Zhang, Yi Ren, Ziyue Jiang, Zhenhui Ye, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency. CoRR abs/2408.04708 (2024)
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/eswa/YinZJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eswa/YinZJ23
Xiang Yin, Wenyu Zhang, Xin Jing:
Static-dynamic collaborative graph convolutional network with meta-learning for node-level traffic flow prediction. Expert Syst. Appl. 227: 120333 (2023)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/isci/YinZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/isci/YinZZ23
Xiang Yin, Wenyu Zhang, Shuai Zhang:
Spatiotemporal dynamic graph convolutional network for traffic speed forecasting. Inf. Sci. 641: 119056 (2023)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/QianLSWGYJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/QianLSWGYJ23
Tao Qian, Fan Lou, Jiatong Shi, Yuning Wu, Shuai Guo, Xiang Yin, Qin Jin:
UniLG: A Unified Structure-aware Framework for Lyrics Generation. ACL (1) 2023: 983-1001
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangLC0LYHZLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangLC0LYHZLYZ23
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. ACL (1) 2023: 8590-8604
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YeHRJLHYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YeHRJLHYZ23
Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ACL (1) 2023: 9317-9331
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangHZZLYM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangHZZLYM23
Chunfeng Wang, Peisong Huang, Yuxiang Zou, Haoyu Zhang, Shichao Liu, Xiang Yin, Zejun Ma:
LiteG2P: A Fast, Light and High Accuracy Model for Grapheme-to-Phoneme Conversion. ICASSP 2023: 1-5
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LiW0MK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LiW0MK23
Zhi Li, Pengfei Wei, Xiang Yin, Zejun Ma, Alex C. Kot:
Virtual Try-On with Pose-Garment Keypoints Guided Inpainting. ICCV 2023: 22731-22740
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangHY0LLYLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHY0LLYLYZ23
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. ICML 2023: 13916-13932
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/QuYWLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QuYWLM23
Xinghua Qu, Xiang Yin, Pengfei Wei, Lu Lu, Zejun Ma:
AudioQR: Deep Neural Audio Watermarks For QR Code. IJCAI 2023: 6192-6200
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Song0LWW00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Song0LWW00M23
Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, Xiang Yin, Zejun Ma:
StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation. INTERSPEECH 2023: 42-46
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wei0WLQXM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wei0WLQXM23
Pengfei Wei, Xiang Yin, Chunfeng Wang, Zhonghao Li, Xinghua Qu, Zhiqiang Xu, Zejun Ma:
S2CD: Self-heuristic Speaker Content Disentanglement for Any-to-Any Voice Conversion. INTERSPEECH 2023: 2288-2292
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CongZL0W00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CongZL0W00M23
Yahuan Cong, Haoyu Zhang, Haopeng Lin, Shichao Liu, Chunfeng Wang, Yi Ren, Xiang Yin, Zejun Ma:
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech. INTERSPEECH 2023: 5486-5490
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0003ZLYMJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0003ZLYMJ23
Yuchen Liu, Haoyu Zhang, Shichao Liu, Xiang Yin, Zejun Ma, Qin Jin:
Emotionally Situated Text-to-Speech Synthesis in User-Agent Conversation. ACM Multimedia 2023: 5966-5974
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WeiKQ0X0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WeiKQ0X0023
Pengfei Wei, Lingdong Kong, Xinghua Qu, Yi Ren, Zhiqiang Xu, Jing Jiang, Xiang Yin:
Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective. NeurIPS 2023
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/QuLS0OLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/QuLS0OLM23
Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma:
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions and Prospects. SIGIR 2023: 2701-2711
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12661
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12661
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01086
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01086
Chunfeng Wang, Peisong Huang, Yuxiang Zou, Haoyu Zhang, Shichao Liu, Xiang Yin, Zejun Ma:
LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion. CoRR abs/2303.01086 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-00787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-00787
Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10763
Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15403
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. CoRR abs/2305.15403 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17732
Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, Xiang Yin, Zejun Ma:
StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation. CoRR abs/2305.17732 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18474
Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao:
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation. CoRR abs/2305.18474 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02236
Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao:
Detector Guidance for Multi-Object Text-to-Image Generation. CoRR abs/2306.02236 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03504
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03504
Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis. CoRR abs/2306.03504 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03509
Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. CoRR abs/2306.03509 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08219
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08219
Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma:
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects. CoRR abs/2306.08219 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15304
Yahuan Cong, Haoyu Zhang, Haopeng Lin, Shichao Liu, Chunfeng Wang, Yi Ren, Xiang Yin, Zejun Ma:
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech. CoRR abs/2306.15304 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-07218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-07218
Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. CoRR abs/2307.07218 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-15016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-15016
Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin:
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model. CoRR abs/2308.15016 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11947
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling. CoRR abs/2312.11947 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15946
Bo Han, Yi Ren, Hao Peng, Teng Zhang, Zeyu Ling, Xiang Yin, Feilin Han:
EnchantDance: Unveiling the Potential of Music-Driven Dance Movement. CoRR abs/2312.15946 (2023)
2022
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuTWBGYM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuTWBGYM22
Jingning Xu, Benlai Tang, Mingjie Wang, Siyuan Bian, Wenyi Guo, Xiang Yin, Zejun Ma:
Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation. ICASSP 2022: 1745-1749
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWP022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWP022
Zikai Chen, Lin Wu, Junjie Pan, Xiang Yin:
An Automatic Soundtracking System for Text-to-Speech Audiobooks. INTERSPEECH 2022: 476-480
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLT0WYM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLT0WYM22
Chao Wang, Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Yibiao Yu, Zejun Ma:
Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding. INTERSPEECH 2022: 4287-4291
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-06260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-06260
Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma:
Towards Realistic Visual Dubbing with Heterogeneous Sources. CoRR abs/2201.06260 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04922
Wudi Bao, Junhui Zhang, Junjie Pan, Xiang Yin, Zejun Ma:
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation. CoRR abs/2206.04922 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-07365
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-07365
Pengfei Wei, Lingdong Kong, Xinghua Qu, Xiang Yin, Zhiqiang Xu, Jing Jiang, Zejun Ma:
Unsupervised Video Domain Adaptation: A Disentanglement Perspective. CoRR abs/2208.07365 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05805
Junhui Zhang, Junjie Pan, Xiang Yin, Zejun Ma:
Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features. CoRR abs/2212.05805 (2022)
2021
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanWYWXM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanWYWXM21
Junjie Pan, Lin Wu, Xiang Yin, Pengfei Wu, Chenchang Xu, Zejun Ma:
A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels. ICASSP 2021: 6069-6073
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTYWXSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTYWXSM21
Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Ling Xu, Chen Shen, Zejun Ma:
PPG-Based Singing Voice Conversion with Adversarial Representation Learning. ICASSP 2021: 7073-7077
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZouLYLWZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZouLYLWZM21
Yuxiang Zou, Shichao Liu, Xiang Yin, Haopeng Lin, Chunfeng Wang, Haoyu Zhang, Zejun Ma:
Fine-Grained Prosody Modeling in Neural Speech Synthesis Using ToBI Representation. Interspeech 2021: 3146-3150
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/GuYRWTZCWM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GuYRWTZCWM21
Yu Gu, Xiang Yin, Yonghui Rao, Yuan Wan, Benlai Tang, Yang Zhang, Jitong Chen, Yuxuan Wang, Zejun Ma:
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders. ISCSLP 2021: 1-5
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XieLBTYYWYZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XieLBTYYWYZM21
Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma:
Towards Realistic Visual Dubbing with Heterogeneous Sources. ACM Multimedia 2021: 1739-1747
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04153
Pengfei Wu, Junjie Pan, Chenchang Xu, Junhui Zhang, Lin Wu, Xiang Yin, Zejun Ma:
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech. CoRR abs/2110.04153 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04754
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04754
Chao Wang, Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Yibiao Yu, Zejun Ma:
Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding. CoRR abs/2110.04754 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-11894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-11894
Jingning Xu, Benlai Tang, Mingjie Wang, Siyuan Bian, Wenyi Guo, Xiang Yin, Zejun Ma:
Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation. CoRR abs/2110.11894 (2021)
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XuCWCZZWCYZJWL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XuCWCZZWCYZJWL20
Runxin Xu, Jun Cao, Mingxuan Wang, Jiaze Chen, Hao Zhou, Ying Zeng, Yuping Wang, Li Chen, Xiang Yin, Xijin Zhang, Songcheng Jiang, Yuxuan Wang, Lei Li:
Xiaomingbot: A Multilingual Robot News Reporter. ACL (demo) 2020: 1-8
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PanYZLZMW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PanYZLZMW20
Junjie Pan, Xiang Yin, Zhiling Zhang, Shichao Liu, Yang Zhang, Zejun Ma, Yuxuan Wang:
A Unified Sequence-to-Sequence Front-End Model for Mandarin Text-to-Speech Synthesis. ICASSP 2020: 6689-6693
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangPY0LZWM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangPY0LZWM20
Junhui Zhang, Junjie Pan, Xiang Yin, Chen Li, Shichao Liu, Yang Zhang, Yuxuan Wang, Zejun Ma:
A Hybrid Text Normalization System Using Multi-Head Self-Attention For Mandarin. ICASSP 2020: 6694-6698
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-11012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-11012
Yu Gu, Xiang Yin, Yonghui Rao, Yuan Wan, Benlai Tang, Yang Zhang, Jitong Chen, Yuxuan Wang, Zejun Ma:
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders. CoRR abs/2004.11012 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09271
Wenjie Li, Benlai Tang, Xiang Yin, Yushi Zhao, Wei Li, Kang Wang, Hao Huang, Yuxuan Wang, Zejun Ma:
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech. CoRR abs/2005.09271 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-08005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-08005
Runxin Xu, Jun Cao, Mingxuan Wang, Jiaze Chen, Hao Zhou, Ying Zeng, Yuping Wang, Li Chen, Xiang Yin, Xijin Zhang, Songcheng Jiang, Yuxuan Wang, Lei Li:
Xiaomingbot: A Multilingual Robot News Reporter. CoRR abs/2007.08005 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14804
Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Ling Xu, Chen Shen, Zejun Ma:
PPG-based singing voice conversion with adversarial representation learning. CoRR abs/2010.14804 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04111
Junjie Pan, Xiang Yin, Zhiling Zhang, Shichao Liu, Yang Zhang, Zejun Ma, Yuxuan Wang:
A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis. CoRR abs/1911.04111 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04128
Junhui Zhang, Junjie Pan, Xiang Yin, Chen Li, Shichao Liu, Yang Zhang, Yuxuan Wang, Zejun Ma:
A hybrid text normalization system using multi-head self-attention for mandarin. CoRR abs/1911.04128 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.