default search action

combined dblp search
author search
venue search
publication search

ask others

Shigeki Karita

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-06227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-06227
Min Ma, Yuma Koizumi, Shigeki Karita, Heiga Zen, Jason Riesa, Haruko Ishikawa, Michiel Bacchiani:
FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks. CoRR abs/2408.06227 (2024)
2023
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoizumiZKDYMB0H23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoizumiZKDYMB0H23
Yuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe, Nobuyuki Morioka, Michiel Bacchiani, Yu Zhang, Wei Han, Ankur Bapna:
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus. INTERSPEECH 2023: 5496-5500
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/KoizumiZKDYMZHBB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/KoizumiZKDYMZHBB23
Yuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe, Nobuyuki Morioka, Yu Zhang, Wei Han, Ankur Bapna, Michiel Bacchiani:
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations. WASPAA 2023: 1-5
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01664
Yuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe, Nobuyuki Morioka, Yu Zhang, Wei Han, Ankur Bapna, Michiel Bacchiani:
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations. CoRR abs/2303.01664 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18802
Yuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe, Nobuyuki Morioka, Michiel Bacchiani, Yu Zhang, Wei Han, Ankur Bapna:
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus. CoRR abs/2305.18802 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04530
Shigeki Karita, Richard Sproat, Haruko Ishikawa:
Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency. CoRR abs/2306.04530 (2023)
2022
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KuboKB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KuboKB22
Yotaro Kubo, Shigeki Karita, Michiel Bacchiani:
Knowledge Transfer from Large-Scale Pretrained Language Models to End-To-End Speech Recognizers. ICASSP 2022: 8512-8516
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoizumiKNPB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoizumiKNPB22
Yuma Koizumi, Shigeki Karita, Arun Narayanan, Sankaran Panchapagesan, Michiel Bacchiani:
SNRi Target Training for Joint Speech Enhancement and Recognition. INTERSPEECH 2022: 1173-1177
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07894
Yotaro Kubo, Shigeki Karita, Michiel Bacchiani:
Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers. CoRR abs/2202.07894 (2022)
2021
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaritaKBJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaritaKBJ21
Shigeki Karita, Yotaro Kubo, Michiel Adriaan Unico Bacchiani, Llion Jones:
A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition. Interspeech 2021: 2092-2096
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TjandraPZK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TjandraPZK21
Andros Tjandra, Ruoming Pang, Yu Zhang, Shigeki Karita:
Unsupervised Learning of Disentangled Speech Content and Style Representation. Interspeech 2021: 4089-4093
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/KoizumiKWEHJB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/KoizumiKWEHJB21
Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement. WASPAA 2021: 161-165
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-05111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-05111
Shigeki Karita, Yotaro Kubo, Michiel Adriaan Unico Bacchiani, Llion Jones:
A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition. CoRR abs/2106.05111 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-15813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-15813
Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement. CoRR abs/2106.15813 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-00764
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-00764
Yuma Koizumi, Shigeki Karita, Arun Narayanan, Sankaran Panchapagesan, Michiel Bacchiani:
SNRi Target Training for Joint Speech Enhancement and Recognition. CoRR abs/2111.00764 (2021)
2020
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/InagumaKDKYHW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/InagumaKDKYHW20
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Yalta, Tomoki Hayashi, Shinji Watanabe:
ESPnet-ST: All-in-One Speech Translation Toolkit. ACL (demo) 2020: 302-311
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriyaOKSTAMSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriyaOKSTAMSD20
Takafumi Moriya, Tsubasa Ochiai, Shigeki Karita, Hiroshi Sato, Tomohiro Tanaka, Takanori Ashihara, Ryo Masumura, Yusuke Shinohara, Marc Delcroix:
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. INTERSPEECH 2020: 546-550
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-10234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-10234
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, Shinji Watanabe:
ESPnet-ST: All-in-One Speech Translation Toolkit. CoRR abs/2004.10234 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12973
Andros Tjandra, Ruoming Pang, Yu Zhang, Shigeki Karita:
Unsupervised Learning of Disentangled Speech Content and Style Representation. CoRR abs/2010.12973 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13006
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang:
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans. CoRR abs/2012.13006 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KaritaWWYZCHHIJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KaritaWWYZCHHIJ19
Shigeki Karita, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto:
A Comparative Study on Transformer vs RNN in Speech Applications. ASRU 2019: 449-456
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KaritaWIDON19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KaritaWIDON19
Shigeki Karita, Shinji Watanabe, Tomoharu Iwata, Marc Delcroix, Atsunori Ogawa, Tomohiro Nakatani:
Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders. ICASSP 2019: 6166-6170
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelcroixWOKKON19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelcroixWOKKON19
Marc Delcroix, Shinji Watanabe, Tsubasa Ochiai, Keisuke Kinoshita, Shigeki Karita, Atsunori Ogawa, Tomohiro Nakatani:
End-to-End SpeakerBeam for Single Channel Target Speech Recognition. INTERSPEECH 2019: 451-455
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaritaSWDON19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaritaSWDON19
Shigeki Karita, Nelson Enrique Yalta Soplin, Shinji Watanabe, Marc Delcroix, Atsunori Ogawa, Tomohiro Nakatani:
Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration. INTERSPEECH 2019: 1408-1412
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgawaDKN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgawaDKN19
Atsunori Ogawa, Marc Delcroix, Shigeki Karita, Tomohiro Nakatani:
Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders. INTERSPEECH 2019: 3900-3904
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06317
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang:
A Comparative Study on Transformer vs RNN in Speech Applications. CoRR abs/1909.06317 (2019)
2018
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiKIKN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiKIKN18
Takuya Higuchi, Keisuke Kinoshita, Nobutaka Ito, Shigeki Karita, Tomohiro Nakatani:
Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming. ICASSP 2018: 531-535
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KaritaODN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KaritaODN18
Shigeki Karita, Atsunori Ogawa, Marc Delcroix, Tomohiro Nakatani:
Sequence Training of Encoder-Decoder Model Using Policy Gradient for End-to-End Speech Recognition. ICASSP 2018: 5839-5843
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OgawaDKN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OgawaDKN18
Atsunori Ogawa, Marc Delcroix, Shigeki Karita, Tomohiro Nakatani:
Rescoring N-Best Speech Recognition List Based on One-on-One Hypothesis Comparison Using Encoder-Classifier Model. ICASSP 2018: 6099-6103
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaritaWIOD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaritaWIOD18
Shigeki Karita, Shinji Watanabe, Tomoharu Iwata, Atsunori Ogawa, Marc Delcroix:
Semi-Supervised End-to-End Speech Recognition. INTERSPEECH 2018: 2-6
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeHKHNUSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeHKHNUSH18
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018: 2207-2211
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelcroixWOKN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelcroixWOKN18
Marc Delcroix, Shinji Watanabe, Atsunori Ogawa, Shigeki Karita, Tomohiro Nakatani:
Auxiliary Feature Based Adaptation of End-to-end ASR Systems. INTERSPEECH 2018: 2444-2448
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00015
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. CoRR abs/1804.00015 (2018)
2017
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/hscma/ArakiIDOKHYTKN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hscma/ArakiIDOKHYTKN17
Shoko Araki, Nobutaka Ito, Marc Delcroix, Atsunori Ogawa, Keisuke Kinoshita, Takuya Higuchi, Takuya Yoshioka, Dung T. Tran, Shigeki Karita, Tomohiro Nakatani:
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming. HSCMA 2017: 16-20
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TranDKHON17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TranDKHON17
Dung T. Tran, Marc Delcroix, Shigeki Karita, Michael Hentschel, Atsunori Ogawa, Tomohiro Nakatani:
Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling. INTERSPEECH 2017: 1596-1600
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaritaODN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaritaODN17
Shigeki Karita, Atsunori Ogawa, Marc Delcroix, Tomohiro Nakatani:
Forward-Backward Convolutional LSTM for Acoustic Modeling. INTERSPEECH 2017: 1601-1605
2015
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YoshiokaKN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YoshiokaKN15
Takuya Yoshioka, Shigeki Karita, Tomohiro Nakatani:
Far-field speech recognition using CNN-DNN-HMM with convolution in time. ICASSP 2015: 4360-4364
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/KaritaNKIB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/KaritaNKIB15
Shigeki Karita, Kumi Nakamura, Kazuhiro Kono, Yoshimichi Ito, Noboru Babaguchi:
Owner authentication for mobile devices using motion gestures based on multi-owner template update. ICME Workshops 2015: 1-6

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.