default search action

combined dblp search
author search
venue search
publication search

ask others

Kou Tanaka

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KameokaKTHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KameokaKTHS24
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2213-2226 (2024)
[c37]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/KondoKTKH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KondoKTKH24
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko, Noboru Harada:
Learning to Assess Subjective Impressions from Speech. EUSIPCO 2024: 381-385
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KondoKTK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KondoKTK24
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
Selecting N-Lowest Scores for Training MOS Prediction Models. ICASSP 2024: 1451-1455
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoKT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoKT24
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka:
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator. ICASSP 2024: 12561-12565
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16464
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka:
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator. CoRR abs/2403.16464 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02245
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Yuto Kondo:
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation. CoRR abs/2409.02245 (2024)
2023
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/SekiKKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/SekiKKT23
Shogo Seki, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka:
Non-Parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder. IEEE Access 11: 44590-44599 (2023)
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/SekiIKKTH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SekiIKKTH23
Shogo Seki, Kanami Imamura, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Noboru Harada:
W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion. EUSIPCO 2023: 296-300
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoKTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoKTS23
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. ICASSP 2023: 1-5
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SekiKTK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SekiKTK23
Shogo Seki, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
JSV-VC: Jointly Trained Speaker Verification and Voice Conversion Models. ICASSP 2023: 1-5
[c31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaKKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaKKS23
Kou Tanaka, Takuhiro Kaneko, Hirokazu Kameoka, Shogo Seki:
CFVC: Conditional Filtering for Controllable Voice Conversion. INTERSPEECH 2023: 2058-2062
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTS23
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. INTERSPEECH 2023: 4369-4373
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/TanakaKK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/TanakaKK23
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko:
PRVAE-VC: Non-Parallel Many-to-Many Voice Conversion with Perturbation-Resistant Variational Autoencoder. SSW 2023: 88-93
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-13909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-13909
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. CoRR abs/2303.13909 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07117
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07117
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. CoRR abs/2308.07117 (2023)
2022
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoTKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoTKS22
Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki:
ISTFTNET: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform. ICASSP 2022: 6207-6211
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KameokaKST22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KameokaKST22
Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki, Kou Tanaka:
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech. INTERSPEECH 2022: 506-510
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTS22
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks. INTERSPEECH 2022: 1631-1635
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TanakaKKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TanakaKKS22
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki:
Distilling Sequence-to-Sequence Voice Conversion Models for Streaming Conversion Applications. SLT 2022: 1022-1028
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02395
Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki:
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform. CoRR abs/2203.02395 (2022)
2021
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KameokaHTKHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KameokaHTKHT21
Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda:
Many-to-Many Voice Transformer Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 656-670 (2021)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoKTH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoKTH21
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames. ICASSP 2021: 5919-5923
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-12841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-12841
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames. CoRR abs/2102.12841 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06900
Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion. CoRR abs/2104.06900 (2021)
2020
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/KameokaTKKH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KameokaTKKH20
Hirokazu Kameoka, Kou Tanaka, Damian Kwasny, Takuhiro Kaneko, Nobukatsu Hojo:
ConvS2S-VC: Fully Convolutional Sequence-to-Sequence Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1849-1863 (2020)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KameokaKTH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KameokaKTH20
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
Nonparallel Voice Conversion With Augmented Classifier Star Generative Adversarial Networks. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2982-2995 (2020)
[c23]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/EshghiKTKT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/EshghiKTKT20
Mohammad Eshghi, Kazuhiro Kobayashi, Kou Tanaka, Hirokazu Kameoka, Tomoki Toda:
Phoneme Embeddings on Predicting Fundamental Frequency Pattern for Electrolaryngeal Speech. APSIPA 2020: 572-577
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTH20
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion. INTERSPEECH 2020: 2017-2021
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/sigdial/ArimotoHTKSSI20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/ArimotoHTKSSI20
Tsunehiro Arimoto, Ryuichiro Higashinaka, Kou Tanaka, Takahito Kawanishi, Hiroaki Sugiyama, Hiroshi Sawada, Hiroshi Ishiguro:
Collection and Analysis of Dialogues Provided by Two Speakers Acting as One. SIGdial 2020: 323-328
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08445
Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda:
Many-to-Many Voice Transformer Network. CoRR abs/2005.08445 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02977
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics. CoRR abs/2010.02977 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11672
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11672
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion. CoRR abs/2010.11672 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KameokaKTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KameokaKTH19
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-Parallel Voice Conversion With Auxiliary Classifier Variational Autoencoder. IEEE ACM Trans. Audio Speech Lang. Process. 27(9): 1432-1443 (2019)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanakaKKH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanakaKKH19
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
ATTS2S-VC: Sequence-to-sequence Voice Conversion with Attention and Context Preservation Mechanisms. ICASSP 2019: 6805-6809
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoKTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoKTH19
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Cyclegan-VC2: Improved Cyclegan-based Non-parallel Voice Conversion. ICASSP 2019: 6820-6824
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTH19
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. INTERSPEECH 2019: 679-683
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/EshghiTKKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/EshghiTKKT19
Mohammad Eshghi, Kou Tanaka, Kazuhiro Kobayashi, Hirokazu Kameoka, Tomoki Toda:
An Investigation of Features for Fundamental Frequency Pattern Prediction in Electrolaryngeal Speech Enhancement. SSW 2019: 251-256
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-02892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-02892
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation. CoRR abs/1904.02892 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04540
Hirokazu Kameoka, Kou Tanaka, Aaron Valero Puche, Yasunori Ohishi, Takuhiro Kaneko:
Crossmodal Voice Conversion. CoRR abs/1904.04540 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04631
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04631
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion. CoRR abs/1904.04631 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-12279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-12279
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. CoRR abs/1907.12279 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-01601
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-01601
Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019)
2018
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HojoKTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HojoKTK18
Nobukatsu Hojo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
Automatic Speech Pronunciation Correction with Dynamic Frequency Warping-Based Spectral Conversion. EUSIPCO 2018: 2310-2314
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/OyamadaKKTHA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/OyamadaKKTHA18
Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando:
Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram. EUSIPCO 2018: 2514-2518
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanakaKM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanakaKM18
Kou Tanaka, Hirokazu Kameoka, Kazuho Morikawa:
Vae-Space: Deep Generative Model of Voice Fundamental Frequency Contours. ICASSP 2018: 5779-5783
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KameokaKTH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KameokaKTH18
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks. SLT 2018: 266-273
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TanakaKHK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TanakaKHK18
Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka:
Synthetic-to-Natural Speech Waveform Conversion Using Cycle-Consistent Adversarial Networks. SLT 2018: 632-639
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-02181
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-02181
Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando:
Generative adversarial network-based approach to signal reconstruction from magnitude spectrograms. CoRR abs/1804.02181 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-02169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-02169
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks. CoRR abs/1806.02169 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-05092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-05092
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder. CoRR abs/1808.05092 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-10288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-10288
Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka:
WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks. CoRR abs/1809.10288 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01609
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01609
Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo:
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion. CoRR abs/1811.01609 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-04076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-04076
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms. CoRR abs/1811.04076 (2018)
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/TanakaT017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/TanakaT017
Kou Tanaka, Tomoki Toda, Satoshi Nakamura:
A Vibration Control Method of an Electrolarynx Based on Statistical F₀ Pattern Prediction. IEICE Trans. Inf. Syst. 100-D(9): 2165-2173 (2017)
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaKT017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaKT017
Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, Satoshi Nakamura:
Physically Constrained Statistical F₀ Prediction for Electrolaryngeal Speech Enhancement. INTERSPEECH 2017: 1069-1073
2016
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/TanakaTNN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/TanakaTNN16
Kou Tanaka, Tomoki Toda, Graham Neubig, Satoshi Nakamura:
Real-time vibration control of an electrolarynx based on statistical F0 contour prediction. EUSIPCO 2016: 1333-1337
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanakaKTN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanakaKTN16
Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, Satoshi Nakamura:
Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework. ICASSP 2016: 5665-5669
2015
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/assets/TanakaTNSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/assets/TanakaTNSN15
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction. ASSETS 2015: 435-436
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/TakamichiKTT015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/TakamichiKTT015
Shinnosuke Takamichi, Kazuhiro Kobayashi, Kou Tanaka, Tomoki Toda, Satoshi Nakamura:
The NAIST Text-to-Speech System for the Blizzard Challenge 2015. Blizzard Challenge 2015
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TajiriTTNSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TajiriTTNSN15
Yusuke Tajiri, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments. INTERSPEECH 2015: 2769-2773
2014
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/TanakaTNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/TanakaTNSN14
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation. IEICE Trans. Inf. Syst. 97-D(6): 1429-1437 (2014)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TanakaTNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TanakaTNSN14
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction. APSIPA 2014: 1-4
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TsurutaTTNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TsurutaTTNSN14
Sakura Tsuruta, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments. APSIPA 2014: 1-4
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanakaTNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanakaTNSN14
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement. ICASSP 2014: 4488-4492
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaTNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaTNSN14
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Direct F₀ control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation. INTERSPEECH 2014: 31-35
2013
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaTNSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaTNSN13
Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion. INTERSPEECH 2013: 3067-3071

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.