default search action
Shigeki Sagayama
Person information
- affiliation: Meiji University, Tokyo, Japan
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c175]Keiko Ochi, Nobutaka Ono, Keiho Owada, Miho Kuroda, Shigeki Sagayama, Hidenori Yamasue:
Entrainment Analysis for Assessment of Autistic Speech Prosody Using Bottleneck Features of Deep Neural Network. ICASSP 2022: 8492-8496 - [c174]Keiko Ochi, Nobutaka Ono, Keiho Owada, Miho Kuroda, Shigeki Sagayama, Hidenori Yamasue:
Use of Nods Less Synchronized with Turn-Taking and Prosody During Conversations in Adults with Autism. INTERSPEECH 2022: 1136-1140 - 2021
- [c173]Keiko Ochi, Masaki Kojima, Keiho Owada, Nobutaka Ono, Shigeki Sagayama, Hidenori Yamasue:
Pitch and Volume Stability in the Communicative Response of Adults with Autism. APSIPA ASC 2021: 428-432 - [c172]Yasuyuki Saito, Honoka Fujii, Shigeki Sagayama:
Semi-automatic music piece creation based on impression words extracted from object and background in color image. GCCE 2021: 268-272 - 2020
- [j39]Junya Koguchi, Shinnosuke Takamichi, Masanori Morise, Hiroshi Saruwatari, Shigeki Sagayama:
DNN-Based Full-Band Speech Synthesis Using GMM Approximation of Spectral Envelope. IEICE Trans. Inf. Syst. 103-D(12): 2673-2681 (2020) - [j38]Christoph M. Wilk, Shigeki Sagayama:
A Parameterized Harmony Model for Automatic Music Completion. J. Inf. Process. 28: 258-266 (2020) - [c171]Yasuyuki Saito, Yasuji Sakai, Yuu Igarashi, Suguru Agata, Eita Nakamura, Shigeki Sagayama:
Music Recreation in Nursing Home using Automatic Music Accompaniment System and Score of VLN. LifeTech 2020: 127-131
2010 – 2019
- 2019
- [j37]Christoph M. Wilk, Shigeki Sagayama:
Automatic Music Completion Based on Joint Optimization of Harmony Progression and Voicing. J. Inf. Process. 27: 693-700 (2019) - [c170]Christoph M. Wilk, Shigeki Sagayama:
Polyphonic Voicing Optimization for Automatic Music Completion. APSIPA 2019: 375-382 - [c169]Matsuto Hori, Christoph M. Wilk, Shigeki Sagayama:
Piano Practice Evaluation and Visualization by HMM for Arbitrary Jumps and Mistakes. CISS 2019: 1-5 - [c168]You Li, Christoph M. Wilk, Takeshi Hori, Shigeki Sagayama:
Automatic Piano Reduction of Orchestral Music Based on Musical Entropy. CISS 2019: 1-5 - [c167]Daiki Mitsumoto, Takeshi Hori, Shigeki Sagayama, Hidenori Yamasue, Keiho Owada, Masaki Kojima, Keiko Ochi, Nobutaka Ono:
Autism Spectrum Disorder Discrimination Based on Voice Activities Related to Fillers and Laughter. CISS 2019: 1-6 - 2018
- [c166]Christoph M. Wilk, Shigeki Sagayama:
Harmony and Voicing Interpolation for Automatic Music Composition Assistance. APSIPA 2018: 89-98 - [c165]Takeshi Hori, Kazuyuki Nakamura, Shigeki Sagayama:
Multiresolutional Hierarchical Bayesian NMF for Detailed Audio Analysis of Music Performances. APSIPA 2018: 1626-1635 - [c164]Takuya Takahashi, Takeshi Hori, Christoph M. Wilk, Shigeki Sagayama:
Semi-Supervised NMF in the chroma Domain Applied to Music Harmony Estimation. APSIPA 2018: 1636-1641 - [c163]Junya Koguchi, Shigeki Sagayama:
Composite Wavelet Model for Stability-Oriented Speech Synthesis from Cepstral Features. APSIPA 2018: 1697-1701 - 2017
- [j36]Eita Nakamura, Kazuyoshi Yoshii, Shigeki Sagayama:
Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 794-806 (2017) - [c162]Takeshi Hori, Kazuyuki Nakamura, Shigeki Sagayama:
Music chord recognition from audio data using bidirectional encoder-decoder LSTMs. APSIPA 2017: 1312-1315 - [c161]Gen Hori, Shigeki Sagayama:
Variant of Viterbi algorithm based on p-Norm. DSP 2017: 1-5 - [i4]Eita Nakamura, Kazuyoshi Yoshii, Shigeki Sagayama:
Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices. CoRR abs/1701.08343 (2017) - 2016
- [j35]Hideyuki Tachibana, Yuu Mizuno, Nobutaka Ono, Shigeki Sagayama:
A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques. J. Inf. Process. 24(3): 470-482 (2016) - [j34]Tomohiko Nakamura, Eita Nakamura, Shigeki Sagayama:
Real-Time Audio-to-Score Alignment of Music Performances Containing Errors and Arbitrary Repeats and Skips. IEEE ACM Trans. Audio Speech Lang. Process. 24(2): 329-339 (2016) - [c160]Gen Hori, Shigeki Sagayama:
Minimax Viterbi Algorithm for HMM-Based Guitar Fingering Decision. ISMIR 2016: 448-453 - [c159]Yasuhiro Hamada, Nobutaka Ono, Shigeki Sagayama:
Non-filter waveform generation from cepstrum using spectral phase reconstruction. SSW 2016: 27-31 - 2015
- [j33]Nobutaka Ito, Emmanuel Vincent, Tomohiro Nakatani, Nobutaka Ono, Shoko Araki, Shigeki Sagayama:
Blind Suppression of Nonstationary Diffuse Acoustic Noise Based on Spatial Covariance Matrix Decomposition. J. Signal Process. Syst. 79(2): 145-157 (2015) - [c158]Eita Nakamura, Shigeki Sagayama:
Automatic Piano Reduction from Ensemble Scores Based on Merged-Output Hidden Markov Model. ICMC 2015 - [c157]Eita Nakamura, Philippe Cuvillier, Arshia Cont, Nobutaka Ono, Shigeki Sagayama:
Autoregressive Hidden Semi-Markov Model of Symbolic Music Performance for Score Following. ISMIR 2015: 392-398 - [i3]Tomohiko Nakamura, Eita Nakamura, Shigeki Sagayama:
Real-Time Audio-to-Score Alignment of Music Performances Containing Errors and Arbitrary Repeats and Skips. CoRR abs/1512.07748 (2015) - 2014
- [j32]Hideyuki Tachibana, Nobutaka Ono, Shigeki Sagayama:
Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 228-237 (2014) - [j31]Hideyuki Tachibana, Nobutaka Ono, Hirokazu Kameoka, Shigeki Sagayama:
Harmonic/percussive sound separation based on anisotropic smoothness of spectrograms. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 2059-2073 (2014) - [c156]Toru Taniguchi, Nobutaka Ono, Akinori Kawamura, Shigeki Sagayama:
An auxiliary-function approach to online independent vector analysis for real-time blind source separation. HSCMA 2014: 107-111 - [c155]Gen Hori, Shigeki Sagayama:
HMM-Based Automatic Arrangement for Guitars with Transposition and its Implementation. ICMC 2014 - [c154]Eita Nakamura, Nobutaka Ono, Yasuyuki Saito, Shigeki Sagayama:
Merged-Output Hidden Markov Model for Score Following of MIDI Performance with Ornaments, Desynchronized Voices, Repeats and Skips. ICMC 2014 - [c153]Eita Nakamura, Nobutaka Ono, Shigeki Sagayama:
Merged-Output HMM for Piano Fingering of Both Hands. ISMIR 2014: 531-536 - [i2]Eita Nakamura, Tomohiko Nakamura, Yasuyuki Saito, Nobutaka Ono, Shigeki Sagayama:
Outer-Product Hidden Markov Model and Polyphonic MIDI Score Following. CoRR abs/1404.2313 (2014) - [i1]Eita Nakamura, Nobutaka Ono, Shigeki Sagayama, Kenji Watanabe:
A Stochastic Temporal Model of Polyphonic MIDI Performance with Ornaments. CoRR abs/1404.2314 (2014) - 2013
- [j30]Hirokazu Kameoka, Misa Sato, Takuma Ono, Nobutaka Ono, Shigeki Sagayama:
Bayesian Nonparametric Approach to Blind Separation of Infinitely Many Sparse Sources. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 96-A(10): 1928-1937 (2013) - [j29]Gen Hori, Hirokazu Kameoka, Shigeki Sagayama:
Input-Output HMM Applied to Automatic Arrangement for Guitars. Inf. Media Technol. 8(2): 477-484 (2013) - [j28]Gen Hori, Hirokazu Kameoka, Shigeki Sagayama:
Input-Output HMM Applied to Automatic Arrangement for Guitars. J. Inf. Process. 21(2): 264-271 (2013) - [j27]Stanislaw Andrzej Raczynski, Emmanuel Vincent, Shigeki Sagayama:
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling. IEEE Trans. Speech Audio Process. 21(9): 1830-1840 (2013) - [c152]Masato Tsuchiya, Kazuki Ochiai, Hirokazu Kameoka, Shigeki Sagayama:
Probabilistic model of two-dimensional rhythm tree structure representation for automatic transcription of polyphonic MIDI signals. APSIPA 2013: 1-6 - [c151]Tatsuma Ishihara, Hirokazu Kameoka, Kota Yoshizato, Daisuke Saito, Shigeki Sagayama:
Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence. INTERSPEECH 2013: 1017-1021 - [c150]Hirokazu Kameoka, Kota Yoshizato, Tatsuma Ishihara, Yasunori Ohishi, Kunio Kashino, Shigeki Sagayama:
Generative modeling of speech F0 contours. INTERSPEECH 2013: 1826-1830 - [c149]Nobutaka Ito, Emmanuel Vincent, Nobutaka Ono, Shigeki Sagayama:
General algorithms for estimating spectrogram and transfer functions of target signal for blind suppression of diffuse noise. MLSP 2013: 1-6 - [c148]Nobukatsu Hojo, Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama:
Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models. SSW 2013: 129-134 - [p3]Tae Hun Kim, Satoru Fukayama, Takuya Nishimoto, Shigeki Sagayama:
Statistical Approach to Automatic Expressive Rendition of Polyphonic Piano Music. Guide to Computing for Expressive Music Performance 2013: 145-179 - 2012
- [j26]Dong Yu, Geoffrey E. Hinton, Nelson Morgan, Jen-Tzung Chien, Shigeki Sagayama:
Introduction to the Special Section on Deep Learning for Speech and Language Processing. IEEE Trans. Speech Audio Process. 20(1): 4-6 (2012) - [c147]Kazuki Ochiai, Hirokazu Kameoka, Shigeki Sagayama:
Explicit beat structure modeling for non-negative matrix factorization-based multipitch analysis. ICASSP 2012: 133-136 - [c146]Hideyuki Tachibana, Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama:
Comparative evaluations of various harmonic/percussive sound separation algorithms based on anisotropic continuity of spectrogram. ICASSP 2012: 465-468 - [c145]Takuma Ono, Nobutaka Ono, Shigeki Sagayama:
User-guided independent vector analysis with source activity tuning. ICASSP 2012: 2417-2420 - [c144]Miquel Espi, Masakiyo Fujimoto, Daisuke Saito, Nobutaka Ono, Shigeki Sagayama:
A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection. ICASSP 2012: 4293-4296 - [c143]Hirokazu Kameoka, Masahiro Nakano, Kazuki Ochiai, Yutaka Imoto, Kunio Kashino, Shigeki Sagayama:
Constrained and regularized variants of non-negative matrix factorization incorporating music-specific constraints. ICASSP 2012: 5365-5368 - [c142]Satoru Fukayama, Daisuke Saito, Shigeki Sagayama:
Assistance for Novice Users on Creating Songs from Japanese Lyrics. ICMC 2012 - [c141]Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama:
Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech. INTERSPEECH 2012: 390-393 - [c140]Shigeki Matsuda, Naoya Ito, Kosuke Tsujino, Hideki Kashioka, Shigeki Sagayama:
Speaker-Dependent Voice Activity Detection Robust to Background Speech Noise. INTERSPEECH 2012: 2626-2629 - [c139]Takayoshi Oshima, Yutaka Kamamoto, Takehiro Moriya, Nobutaka Ono, Shigeki Sagayama:
Variable-length coding of ACELP gain using Entropy-Constrained VQ. ISCIT 2012: 105-109 - [c138]Hirokazu Kameoka, Kazuki Ochiai, Masahiro Nakano, Masato Tsuchiya, Shigeki Sagayama:
Context-free 2D Tree Structure Model of Musical Notes for Bayesian Modeling of Polyphonic Spectrograms. ISMIR 2012: 307-312 - [c137]Hirokazu Kameoka, Misa Sato, Takuma Ono, Nobutaka Ono, Shigeki Sagayama:
Blind Separation of Infinitely Many Sparse Sources. IWAENC 2012 - 2011
- [j25]Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard, Shigeki Sagayama:
Introduction to the Special Issue on Music Signal Processing. IEEE J. Sel. Top. Signal Process. 5(6): 1085-1087 (2011) - [j24]Jun Wu, Emmanuel Vincent, Stanislaw Andrzej Raczynski, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds. IEEE J. Sel. Top. Signal Process. 5(6): 1124-1132 (2011) - [j23]Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama:
Computational auditory induction as a missing-data model-fitting problem with Bregman divergence. Speech Commun. 53(5): 658-676 (2011) - [j22]Emiru Tsunoo, George Tzanetakis, Nobutaka Ono, Shigeki Sagayama:
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines. IEEE ACM Trans. Audio Speech Lang. Process. 19(4): 1003-1014 (2011) - [j21]Nobutaka Ito, Hikaru Shimizu, Nobutaka Ono, Shigeki Sagayama:
Diffuse Noise Suppression Using Crystal-Shaped Microphone Arrays. IEEE Trans. Speech Audio Process. 19(7): 2101-2110 (2011) - [c136]Jun Wu, Shigeki Sagayama:
Musical Instrument Identification Based on New Boosting Algorithm with Probabilistic Decisions. CMMR/FRSM 2011: 66-78 - [c135]Emmanuel Dupoux, Guillaume Beraud-Sudreau, Shigeki Sagayama:
Templatic features for modeling phoneme acquisition. CogSci 2011 - [c134]Jun Wu, Emmanuel Vincent, Stanislaw Andrzej Raczynski, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Multipitch estimation by joint modeling of harmonic and transient sounds. ICASSP 2011: 25-28 - [c133]Ngoc Q. K. Duong, Hideyuki Tachibana, Emmanuel Vincent, Nobutaka Ono, Rémi Gribonval, Shigeki Sagayama:
Multichannel harmonic and percussive component separation by joint modeling of spatial and spectral continuity. ICASSP 2011: 205-208 - [c132]Masahiro Nakano, Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama:
Infinite-state spectrum model for music signal analysis. ICASSP 2011: 1972-1975 - [c131]Takuho Nakano, Akisato Kimura, Hirokazu Kameoka, Shigeki Miyabe, Shigeki Sagayama, Nobutaka Ono, Kunio Kashino, Takuya Nishimoto:
Automatic video annotation via Hierarchical Topic Trajectory Model considering cross-modal correlations. ICASSP 2011: 2380-2383 - [c130]Tomoyuki Hamamura, Bunpei Irie, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Concurrent Optimization of Context Clustering and GMM for Offline Handwritten Word Recognition Using HMM. ICDAR 2011: 523-527 - [c129]Miquel Espi, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Using Spectral Fluctuation of Speech in Multi-Feature HMM-Based Voice Activity Detection. INTERSPEECH 2011: 2613-2616 - [c128]Tae Hun Kim, Satoru Fukayama, Takuya Nishimoto, Shigeki Sagayama:
Polyhymnia: An Automatic Piano Performance System with Statistical Modeling of Polyphonic Expression and Musical Symbol Interpretation. NIME 2011: 96-99 - [c127]Masahiro Nakano, Jonathan Le Roux, Hirokazu Kameoka, Tomohiko Nakamura, Nobutaka Ono, Shigeki Sagayama:
Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model. WASPAA 2011: 325-328 - 2010
- [j20]Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama:
Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency. IEEE Trans. Speech Audio Process. 18(6): 1507-1516 (2010) - [c126]Keisuke Hasegawa, Nobutaka Ono, Shigeki Miyabe, Shigeki Sagayama:
Blind Estimation of Locations and Time Offsets for Distributed Recording Devices. LVA/ICA 2010: 57-64 - [c125]Nobutaka Ito, Emmanuel Vincent, Nobutaka Ono, Rémi Gribonval, Shigeki Sagayama:
Crystal-MUSIC: Accurate Localization of Multiple Sources in Diffuse Noise Environments Using Crystal-Shaped Microphone Arrays. LVA/ICA 2010: 81-88 - [c124]Jonathan Le Roux, Emmanuel Vincent, Yuu Mizuno, Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama:
Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency. LVA/ICA 2010: 89-96 - [c123]Masahiro Nakano, Jonathan Le Roux, Hirokazu Kameoka, Yu Kitano, Nobutaka Ono, Shigeki Sagayama:
Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms. LVA/ICA 2010: 149-156 - [c122]Emiru Tsunoo, Taichi Akase, Nobutaka Ono, Shigeki Sagayama:
Music mood classification by rhythm and bass-line unit pattern analysis. ICASSP 2010: 265-268 - [c121]Hideyuki Tachibana, Takuma Ono, Nobutaka Ono, Shigeki Sagayama:
Melody line estimation in homophonic music audio signals based on temporal-variability of melodic source. ICASSP 2010: 425-428 - [c120]Nobutaka Ono, Shigeki Sagayama:
R-means localization: A simple iterative algorithm for range-difference-based source localization. ICASSP 2010: 2718-2721 - [c119]Nobutaka Ito, Nobutaka Ono, Emmanuel Vincent, Shigeki Sagayama:
Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra. ICASSP 2010: 2818-2821 - [c118]Yu Kitano, Hirokazu Kameoka, Yosuke Izumi, Nobutaka Ono, Shigeki Sagayama:
A sparse component model of source signals and its application to blind source separation. ICASSP 2010: 4122-4125 - [c117]Yushi Ueda, Yuuki Uchiyama, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
HMM-based approach for automatic chord detection using refined acoustic features. ICASSP 2010: 5518-5521 - [c116]Jun Wu, Yu Kitano, Stanislaw Andrzej Raczynski, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Musical instrument identification based on harmonic temporal timbre features. SAPA@INTERSPEECH 2010: 7-12 - [c115]Halfdan Rump, Shigeki Miyabe, Emiru Tsunoo, Nobutaka Ono, Shigeki Sagayama:
Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation. ISMIR 2010: 87-92 - [c114]Stanislaw Andrzej Raczynski, Emmanuel Vincent, Frédéric Bimbot, Shigeki Sagayama:
Multiple Pitch Transcription using DBN-based Musicological Models. ISMIR 2010: 363-368 - [c113]Kazuma Murao, Masahiro Nakano, Yu Kitano, Nobutaka Ono, Shigeki Sagayama:
Monophonic Instrument Sound Segregation by Clustering NMF Components Based on Basis Similarity and Gain Disjointness. ISMIR 2010: 375-380 - [c112]Emmanuel Vincent, Stanislaw Andrzej Raczynski, Nobutaka Ono, Shigeki Sagayama:
A Roadmap Towards Versatile MIR. ISMIR 2010: 662-664 - [c111]Jun Wu, Yu Kitano, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Flexible Harmonic Temporal Structure for Modeling Musical Instrument. ICEC 2010: 416-418 - [c110]Miquel Espi, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Analysis on speech characteristics for robust voice activity detection. SLT 2010: 151-156 - [c109]Takuho Nakano, Shigeki Sagayama, Nobutaka Ono, Akisato Kimura, Hirokazu Kameoka, Kunio Kashino:
SEMANTIC INDEXING AND KNOWN ITEM SEARCH BASED ON A UNIFIED MODEL WITH TOPIC TRANSITION REPRESENTATION. TRECVID 2010 - [p2]Nobutaka Ono, Kenichi Miyamoto, Hirokazu Kameoka, Jonathan Le Roux, Yuuki Uchiyama, Emiru Tsunoo, Takuya Nishimoto, Shigeki Sagayama:
Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks. Advances in Music Information Retrieval 2010: 213-236
2000 – 2009
- 2009
- [c108]Stanislaw Andrzej Raczynski, Nobutaka Ono, Shigeki Sagayama:
Extending Nonnegative Matrix Factorization - A discussion in the context of multiple frequency estimation of musical signals. EUSIPCO 2009: 934-938 - [c107]Emiru Tsunoo, Nobutaka Ono, Shigeki Sagayama:
Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals. ICASSP 2009: 185-188 - [c106]Hirokazu Kameoka, Nobutaka Ono, Kunio Kashino, Shigeki Sagayama:
Complex NMF: A new sparse representation for acoustic signals. ICASSP 2009: 3437-3440 - [c105]Emiru Tsunoo, George Tzanetakis, Nobutaka Ono, Shigeki Sagayama:
Audio genre classification using percussive pattern clustering combined with timbral features. ICME 2009: 382-385 - [c104]Yosuke Izumi, Kenta Nishiki, Shinji Watanabe, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment. INTERSPEECH 2009: 1955-1958 - [c103]Emiru Tsunoo, Nobutaka Ono, Shigeki Sagayama:
Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification. ISMIR 2009: 219-224 - [c102]Jeremy Reed, Yushi Ueda, Sabato Marco Siniscalchi, Yuuki Uchiyama, Shigeki Sagayama, Chin-Hui Lee:
Minimum Classification Error Training to Improve Isolated Chord Recognition. ISMIR 2009: 609-614 - [c101]Satoru Fukayama, Kei Nakatsuma, Shinji Sako, Yuichiro Yonebayashi, Tae Hun Kim, Si Wei Qin, Takuho Nakano, Takuya Nishimoto, Shigeki Sagayama:
Orpheus: Automatic Composition System Considering Prosody of Japanese Lyrics. ICEC 2009: 309-310 - [c100]Stanislaw Andrzej Raczynski, Nobutaka Ono, Shigeki Sagayama:
Note detection with dynamic bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques. WASPAA 2009: 49-52 - [c99]Nobutaka Ono, Hitoshi Kohno, Nobutaka Ito, Shigeki Sagayama:
Blind alignment of asynchronously recorded signals for distributed microphone array. WASPAA 2009: 161-164 - 2008
- [j19]Nobutaka Ono, Souichiro Fukamachi, Shigeki Sagayama:
Sound Source Localization with Front-Back Judgement by Two Microphones Asymmetrically Mounted on a Sphere. J. Multim. 3(3): 1-9 (2008) - [j18]Shoichiro Saito, Hirokazu Kameoka, Keigo Takahashi, Takuya Nishimoto, Shigeki Sagayama:
Specmurt Analysis of Polyphonic Music Signals. IEEE Trans. Speech Audio Process. 16(3): 639-650 (2008) - [c98]Nobutaka Ono, Kenichi Miyamoto, Jonathan Le Roux, Hirokazu Kameoka, Shigeki Sagayama:
Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram. EUSIPCO 2008: 1-4 - [c97]Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama:
Auxiliary function approach to parameter estimation of constrained sinusoidal model for monaural speech separation. ICASSP 2008: 29-32 - [c96]Kenichi Miyamoto, Hirokazu Kameoka, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama:
Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signals. ICASSP 2008: 113-116 - [c95]Nobutaka Ito, Nobutaka Ono, Shigeki Sagayama:
A blind noise decorrelation approach with crystal arrays on designing post-filters for diffuse noise suppression. ICASSP 2008: 317-320 - [c94]Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama, Alain de Cheveigné:
Modulation analysis of speech through orthogonal FIR filterbank optimization. ICASSP 2008: 4189-4192 - [c93]Ikumi Ota, Ryo Yamamoto, Takuya Nishimoto, Shigeki Sagayama:
On-line handwritten Kanji string recognition based on grammar description of character structures. ICPR 2008: 1-5 - [c92]Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama:
Computational auditory induction by missing-data non-negative matrix factorization. SAPA@INTERSPEECH 2008: 1-6 - [c91]Jonathan Le Roux, Nobutaka Ono, Shigeki Sagayama:
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction. SAPA@INTERSPEECH 2008: 23-28 - [c90]Nobutaka Ono, Kenichi Miyamoto, Hirokazu Kameoka, Shigeki Sagayama:
A Real-time Equalizer of Harmonic and Percussive Components in Music Signals. ISMIR 2008: 139-144 - 2007
- [j17]Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. IEEE Trans. Speech Audio Process. 15(3): 982-994 (2007) - [j16]Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama:
Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments. IEEE Trans. Speech Audio Process. 15(4): 1135-1145 (2007) - [c89]Kenichi Miyamoto, Hirokazu Kameoka, Haruto Takeda, Takuya Nishimoto, Shigeki Sagayama:
Probabilistic Approach to Automatic Music Transcription from Audio Signals. ICASSP (2) 2007: 697-700 - [c88]Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono, Alain de Cheveigné, Shigeki Sagayama:
Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments. ICASSP (4) 2007: 1053-1056 - [c87]Haruto Takeda, Takuya Nishimoto, Shigeki Sagayama:
Rhythm and Tempo Analysis Toward Automatic Music Transcription. ICASSP (4) 2007: 1317-1320 - [c86]Ikumi Ota, Ryo Yamamoto, Shinji Sako, Shigeki Sagayama:
Online Handwritten Kanji Recognition Based on Inter-stroke Grammar. ICDAR 2007: 1188-1192 - [c85]Yuichiro Yonebayashi, Hirokazu Kameoka, Shigeki Sagayama:
Automatic Decision of Piano Fingering Based on a Hidden Markov Models. IJCAI 2007: 2915-2921 - [c84]Stanislaw Andrzej Raczynski, Nobutaka Ono, Shigeki Sagayama:
Multipitch Analysis with Harmonic Nonnegative Matrix Approximation. ISMIR 2007: 381-386 - [c83]Nobutaka Ono, Souichiro Fukamachi, Takuya Nishimoto, Shigeki Sagayama:
Sound Source Localization by Asymmetrically Arrayed 2ch Microphones on a Sphere. MMSP 2007: 56-59 - 2006
- [c82]Takuya Nishimoto, Shinji Sako, Shigeki Sagayama, Kazue Ohshima, Koichi Oda, Takayuki Watanabe:
Effect of Learning on Listening to Ultra-Fast Synthesized Speech. EMBC 2006: 5691-5694 - [c81]Chandra Kant Raut, Takuya Nishimoto, Shigeki Sagayama:
Model Adaptation for Long Convolutional Distortion by Maximum Likelihood Based State Filtering Approach. ICASSP (1) 2006: 1133-1136 - [c80]Hirokazu Kameoka, Jonathan Le Roux, Nobutaka Ono, Shigeki Sagayama:
Speech analyzer using a joint estimation model of spectral envelope and fine structure. INTERSPEECH 2006 - 2005
- [c79]Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
Audio stream segregation of multi-pitch music signal based on time-space clustering using Gaussian kernel 2-dimensional model. ICASSP (3) 2005: 5-8 - [c78]Chandra Kant Raut, Takuya Nishimoto, Shigeki Sagayama:
Model adaptation by state splitting of HMM for long reverberation. INTERSPEECH 2005: 277-280 - [c77]Shoichiro Saito, Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
Specmurt Analysis of Multi-Pitch Music Signals with Adaptive Estimation of Common Harmonic Structure . ISMIR 2005: 84-91 - [c76]Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
Harmonic-Temporal Clustering via Deterministic Annealing EM Algorithm for Audio Feature Extraction. ISMIR 2005: 115-122 - 2004
- [c75]Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:
Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds. ICASSP (4) 2004: 297-300 - [c74]Shigeki Sagayama, Keigo Takahashi, Hirokazu Kameoka, Takuya Nishimoto:
Specmurt anasylis: a piano-roll-visualization of polyphonic music signal by deconvolution of log-frequency spectrum. SAPA@INTERSPEECH 2004: 128 - [c73]Shigeki Sagayama, Okajima Takashi, Yutaka Kamamoto, Takuya Nishimoto:
Complex spectrum circle centroid for microphone-array-based noisy speech recognition. INTERSPEECH 2004: 825-828 - [c72]Takuya Nishimoto, Shigeki Sagayama, Hirokazu Kameoka:
Multi-pitch trajectory estimation of concurrent speech based on harmonic GMM and nonlinear kalman filtering. INTERSPEECH 2004: 2433-2436 - [c71]Chandra Kant Raut, Takuya Nishimoto, Shigeki Sagayama:
Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment. INTERSPEECH 2004: 2809-2812 - [c70]Haruto Takeda, Takuya Nishimoto, Shigeki Sagayama:
Rhythm and Tempo Recognition of Music Performance from a Probabilistic Approach. ISMIR 2004 - [p1]Shinichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama:
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-like characters 2004: 187-212 - 2003
- [c69]Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Generation of Hierarchical Dictionary for Stroke-order Free Kanji Handwriting Recognition Based on Substroke HMM. ICDAR 2003: 514-518 - [c68]Hiroshi Shimodaira, Takashi Sudo, Mitsuru Nakai, Shigeki Sagayama:
On-line Overlaid-Handwriting Recognition Based on Substroke HMMs. ICDAR 2003: 1043-1047 - [c67]Haruto Takeda, Takuya Nishimoto, Shigeki Sagayama:
Automatic rhythm transcription from multiphonic MIDI signals. ISMIR 2003 - 2002
- [c66]Haruto Takeda, Naoki Saito, Tomoshi Otsuki, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Hidden Markov model for automatic transcription of MIDI signals. IEEE Workshop on Multimedia Signal Processing 2002: 428-431 - [c65]Hiroshi Shimodaira, Nobuyoshi Sakai, Mitsuru Nakai, Shigeki Sagayama:
Jacobian joint adaptation to noise, channel and vocal tract length. ICASSP 2002: 197-200 - [c64]Junko Tokuno, Nobuhito Inami, Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Context-dependent substroke model for HMM-based on-line handwriting recognition. IWFHR 2002: 78-83 - [c63]Mitsuru Nakai, Takashi Sudo, Hiroshi Shimodaira, Shigeki Sagayama:
Pen Pressure Features for Writer-Independent On-Line Handwriting Recognition Based on Substroke HMM. ICPR (3) 2002: 220-223 - 2001
- [c62]Katsuhisa Fujinaga, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Multiple-regression hidden Markov model. ICASSP 2001: 513-516 - [c61]Mitsuru Nakai, Naoto Akira, Hiroshi Shimodaira, Shigeki Sagayama:
Substroke Approach to HMM-Based On-line Kanji Handwriting Recognition. ICDAR 2001: 491-495 - [c60]Hiroshi Shimodaira, Ken-ichi Noma, Mitsuru Nakai, Shigeki Sagayama:
Support vector machine with dynamic time-alignment kernel for speech recognition. INTERSPEECH 2001: 1841-1844 - [c59]Hiroshi Shimodaira, Ken-ichi Noma, Mitsuru Nakai, Shigeki Sagayama:
Dynamic Time-Alignment Kernel in Support Vector Machine. NIPS 2001: 921-928 - 2000
- [j15]Satoshi Takahashi, Shigeki Sagayama:
Speaker adaptation of acoustic models using correlations of training transfer vectors. Syst. Comput. Jpn. 31(14): 74-82 (2000) - [c58]Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Asynchronous-transition HMM. ICASSP 2000: 1005-1008 - [c57]Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Feature-dependent allophone clustering. INTERSPEECH 2000: 413-416 - [c56]Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Free software toolkit for Japanese large vocabulary continuous speech recognition. INTERSPEECH 2000: 476-479 - [c55]Hiroshi Shimodaira, Yutaka Kato, Toshihiko Akae, Mitsuru Nakai, Shigeki Sagayama:
Jacobian adaptation of HMM with initial model selection for noisy speech recognition. INTERSPEECH 2000: 1003-1006 - [c54]Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee:
IPA Japanese Dictation Free Software Project. LREC 2000
1990 – 1999
- 1999
- [j14]Osamu Yoshioka, Kazuhiro Arai, Noboru Sugamura, Shigeki Sagayama:
An address data entry system with a multimodal interface including speech recognition. Syst. Comput. Jpn. 30(9): 64-73 (1999) - 1998
- [c53]Shoichi Matsunaga, Shigeki Sagayama:
Two-step generation of variable-word-length language model integrating local and global constraints. ICASSP 1998: 697-700 - 1997
- [j13]Jun-ichi Takahashi, Shigeki Sagayama:
Vector-field-smoothed Bayesian learning for fast and incremental speaker/telephone-channel adaptation. Comput. Speech Lang. 11(2): 127-146 (1997) - [j12]Kazuo Hakoda, Mikio Kitai, Shigeki Sagayama:
Speech recognition and synthesis technology development at NTT for telecommunications services. Int. J. Speech Technol. 2(2): 145-153 (1997) - [j11]Mikio Kitai, Kazuo Hakoda, Shigeki Sagayama, Tomokazu Yamada, Hajime Tsukada, Satoshi Takahashi, Yoshiaki Noda, Jun-ichi Takahashi, Yuki Yoshida, Kazuhiro Arai, Takashi Imoto, Tomohisa Hirokawa:
ASR and TTS telecommunications applications in Japan. Speech Commun. 23(1-2): 17-30 (1997) - [c52]Shigeki Sagayama, Yoshikazu Yamaguchi, Satoshi Takahashi, Jun-ichi Takahashi:
Jacobian approach to fast acoustic model adaptation. ICASSP 1997: 835-838 - [c51]Satoshi Takahashi, Kiyoaki Aikawa, Shigeki Sagayama:
Discrete mixture HMM. ICASSP 1997: 971-974 - [c50]Shigeru Homma, Kiyoaki Aikawa, Shigeki Sagayama:
Improved estimation of supervision in unsupervised speaker adaptation. ICASSP 1997: 1023-1026 - [c49]Yoshikazu Yamaguchi, Satoshi Takahashi, Shigeki Sagayama:
Fast adaptation of acoustic models to environmental noise using jacobian adaptation algorithm. EUROSPEECH 1997: 2051-2054 - [c48]Shoichi Matsunaga, Shigeki Sagayama:
Variable-length language modeling integrating global constraints. EUROSPEECH 1997: 2719-2722 - 1996
- [j10]Tetsuo Kosaka, Shoichi Matsunaga, Shigeki Sagayama:
Speaker-independent speech recognition based on tree-structured speaker clustering. Comput. Speech Lang. 10(1): 55-74 (1996) - [j9]Jun-ichi Takami, Shigeki Sagayama:
A speaker-adaptation technique for context-dependent models represented by hidden markov networks. Syst. Comput. Jpn. 27(2): 75-86 (1996) - [c47]Satoshi Takahashi, Shigeki Sagayama:
Tied-structure HMM based on parameter correlation for efficient model training. ICASSP 1996: 467-470 - [c46]Jun-ichi Takahashi, Shigeki Sagayama:
Minimum classification error training for a small amount of data enhanced by vector-field-smoothed Bayesian learning. ICASSP 1996: 597-600 - [c45]Shigeru Homma, Jun-ichi Takahashi, Shigeki Sagayama:
Iterative unsupervised speaker adaptation for batch dictation. ICSLP 1996: 1141-1144 - [c44]Tomokazu Yamada, Shigeki Sagayama:
LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone models. ICSLP 1996: 2103-2106 - 1995
- [j8]Tetsuo Kosaka, Shigeki Sagayama:
Automatic Determination of the Number of Mixture Components for Continuous HMMs Based a Uniform Variance Criterion. IEICE Trans. Inf. Syst. 78-D(6): 642-647 (1995) - [j7]Ryosuke Isotani, Shoichi Matsunaga, Shigeki Sagayama:
Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams. IEICE Trans. Inf. Syst. 78-D(6): 692-697 (1995) - [j6]Kouichi Yamaguchi, Harald Singer, Shoichi Matsunaga, Shigeki Sagayama:
Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition. IEICE Trans. Inf. Syst. 78-D(6): 719-724 (1995) - [j5]Yasunaga Miyazawa, Jun-ichi Takami, Shigeki Sagayama, Shoichi Matsunaga:
Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network. IEICE Trans. Inf. Syst. 78-D(8): 1044-1050 (1995) - [j4]Jun-ichi Takahashi, Noboru Sugamura, Tomohisa Hirokawa, Shigeki Sagayama, Sadaoki Furui:
Interactive voice technology development for telecommunications applications. Speech Commun. 17(3-4): 287-301 (1995) - [c43]Shigeki Sagayama, Satoshi Takahashi:
On the use of scalar quantization for fast HMM computation. ICASSP 1995: 213-216 - [c42]Satoshi Takahashi, Shigeki Sagayama:
Four-level tied-structure for efficient representation of acoustic modeling. ICASSP 1995: 520-523 - [c41]Jun-ichi Takahashi, Shigeki Sagayama:
Vector-field-smoothed Bayesian learning for incremental speaker adaptation. ICASSP 1995: 696-699 - [c40]Takatoshi Jitsuhiro, Tomokazu Yamada, Shigeki Sagayama:
Syllabic duration control for vocabulary-free speech recognition. EUROSPEECH 1995: 15-18 - [c39]Yoshiaki Noda, Shigeki Sagayama:
Fast and accurate beam search using forward heuristic functions in HMM-LR speech recognition. EUROSPEECH 1995: 913-916 - 1994
- [c38]Tetsuo Kosaka, Shigeki Sagayama:
Tree-structured speaker clustering for fast speaker adaptation. ICASSP (1) 1994: 245-248 - [c37]Yasunaga Miyazawa, Jun-ichi Takami, Shigeki Sagayama, Shoichi Matsunaga:
All-phoneme ergodic hidden Markov network for unsupervised speaker adaptation. ICASSP (1) 1994: 249-252 - [c36]Kouichi Yamaguchi, Harald Singer, Shoichi Matsunaga, Shigeki Sagayama:
Speaker-consistent parsing for speaker-independent continuous speech recognition. ICSLP 1994: 791-794 - [c35]Jun-ichi Takahashi, Shigeki Sagayama:
Telephone line characteristic adaptation using vector field smoothing technique. ICSLP 1994: 991-994 - [c34]Tetsuo Kosaka, Shoichi Matsunaga, Shigeki Sagayama:
Tree-structured speaker clustering for speaker-independent continuous speech recognition. ICSLP 1994: 1375-1378 - 1993
- [j3]Yasuhiro Komori, Shigeki Sagayama, Alexander H. Waibel:
A neural fuzzy training approach for improving speech recognition. Syst. Comput. Jpn. 24(8): 82-94 (1993) - [j2]Kazuki Katagishi, Harald Singer, Kiyoaki Aikawa, Shigeki Sagayama:
Feature extraction using a matrix coefficient filter for speech recognition. Speech Commun. 13(3-4): 297-306 (1993) - [j1]Harald Singer, Shigeki Sagayama:
Suprasegmental duration control with matrix parsing in continuous speech recognition. Speech Commun. 13(3-4): 315-322 (1993) - [c33]Akito Nagai, Kouichi Yamaguchi, Shigeki Sagayama, Akira Kurematsu:
ATREUS: a comparative study of continuous speech recognition systems at ATR. ICASSP (2) 1993: 139-142 - [c32]Harald Singer, Shigeki Sagayama:
Matrix parser and its application to HMM-based speech recognition. ICASSP (2) 1993: 295-298 - [c31]Tetsuo Kosaka, Jun-ichi Takami, Shigeki Sagayama:
Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition. ICASSP (2) 1993: 570-573 - [c30]Gen-ichiro Kikui, Mark Seligman, Toshiyuki Takezawa, Masami Suzuki, Kenji Kita, Tsuyoshi Morimoto, Masaaki Nagata, Toshihisa Tashiro, Herbert S. Tropf, Shigeki Sagayama, Jun-ichi Takami, Kazumi Ohkura, Akira Kurematsu:
Spoken Language Translation System. IJCAI 1993: 1705 - [c29]Tetsuo Kosaka, Edward Willems, Jun-ichi Takami, Shigeki Sagayama:
A dynamic approach to speaker adaptation of hidden Markov networks for speech recognition. EUROSPEECH 1993: 363-366 - [c28]Shigeki Sagayama, Jun-ichi Takami, Akito Nagai, Harald Singer, Kouichi Yamaguchi, Kazumi Ohkura, Kenji Kita, Akira Kurematsu:
ATREUS: a speech recognition front-end for a speech translation system. EUROSPEECH 1993: 1287-1290 - [c27]Tsuyoshi Morimoto, Toshiyuki Takezawa, Fumihiro Yato, Shigeki Sagayama, Toshihisa Tashiro, Masaaki Nagata, Akira Kurematsu:
ATR's speech translation system: ASURA. EUROSPEECH 1993: 1291-1294 - [c26]Jin'ichi Murakami, Hiroki Yamatomo, Shigeki Sagayama:
The possibility for acquisition of statistical network grammar using ergodic HMM. EUROSPEECH 1993: 1327-1330 - [c25]Ryosuke Isotani, Shigeki Sagayama:
Speech recognition using particle n-grams and content-word n-grams. EUROSPEECH 1993: 1955-1958 - 1992
- [c24]Harald Singer, Shigeki Sagayama:
Pitch dependent phone modelling for HMM based speech recognition. ICASSP 1992: 273-276 - [c23]Jun-ichi Takami, Shigeki Sagayama:
A successive state splitting algorithm for efficient allophone modeling. ICASSP 1992: 573-576 - [c22]David Rainion, Shigeki Sagayama:
Appropriate error criterion selection for continuous speech HMM minimum error training. ICSLP 1992: 233-236 - [c21]Akito Nagai, Kenji Kita, Toshiyuki Hanazawa, Tadashi Suzuki, Tomohiro Iwasaki, Tsuyoshi Kawabata, Kunio Nakajima, Kiyohiro Shikano, Tsuyoshi Morimoto, Shigeki Sagayama, Akira Kurematsu:
Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition. ICSLP 1992: 237-240 - [c20]Kouichi Yamaguchi, Shigeki Sagayama, Kenji Kita, Frank K. Soong:
Continuous mixture HMM-LR using the a* algorithm for continuous speech recognition. ICSLP 1992: 301-304 - [c19]Kenji Kita, Tsuyoshi Morimoto, Kazumi Ohkura, Shigeki Sagayama:
Continuously spoken sentence recognition by HMM-LR. ICSLP 1992: 305-308 - [c18]Kazumi Ohkura, Masahide Sugiyama, Shigeki Sagayama:
Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs. ICSLP 1992: 369-372 - [c17]Hiroaki Hattori, Shigeki Sagayama:
Vector field smoothing principle for speaker adaptation. ICSLP 1992: 381-384 - [c16]Tsuyoshi Morimoto, Toshiyuki Takezawa, Kazumi Ohkura, Masaaki Nagata, Fumihiro Yato, Shigeki Sagayama, Akira Kurematsu:
Enhancement of ATR's spoken language translation system: SL-TRANS2. ICSLP 1992: 397-400 - [c15]Akito Nagai, Jun-ichi Takami, Shigeki Sagayama:
The SSS-LR continuous speech recognition system: integrating SSS-derived allophone models and a phoneme-context-dependent LR parser. ICSLP 1992: 1511-1514 - 1991
- [c14]Masami Nakamura, Shinichi Tamura, Shigeki Sagayama:
Phoneme recognition by phoneme filter neural networks. ICASSP 1991: 85-88 - [c13]Jun-ichi Takami, Shigeki Sagayama:
A pairwise discriminant approach to robust phoneme recognition by time-delay neural networks. ICASSP 1991: 89-92 - [c12]Shigeki Sagayama:
A matrix representation of HMM-based speech recognition algorithms. EUROSPEECH 1991: 1225-1228 - [c11]Akito Nagai, Shigeki Sagayama, Kenji Kita:
Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition. EUROSPEECH 1991: 1397-1400 - 1990
- [c10]Shoichi Matsunaga, Shigeki Sagayama, Shigeru Homma, Sadaoki Furui:
A continuous speech recognition system based on a two-level grammar approach. ICASSP 1990: 589-592 - [c9]Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano, Shigeki Sagayama:
Speaker weighted training of HMM using multiple reference speakers. ICSLP 1990: 149-152 - [c8]Masanobu Abe, Shigeki Sagayama:
Statistical study on voice individuality conversion across different languages. ICSLP 1990: 157-160 - [c7]Shigeki Sagayama, Shigeru Honrna:
Estimation of unknown context using a phoneme environment clustering algorithm. ICSLP 1990: 361-364 - [c6]Fikret S. Gürgen, Shigeki Sagayama, Sadaoki Furui:
Line spectrum pair frequency - based distance measures for speech recognition. ICSLP 1990: 521-524 - [c5]Satoshi Takahashi, Shoichi Matsunaga, Shigeki Sagayama:
Isolated word recognition using pitch pattern information. ICSLP 1990: 553-556 - [c4]Jun-ichi Takami, Shigeki Sagayama:
Phoneme recognition by pairwise discriminant TDNNs. ICSLP 1990: 677-680 - [c3]Shoichi Matsunaga, Shigeki Sagayama:
Sentence speech recognition using semantic dependency analysis. ICSLP 1990: 929-932
1980 – 1989
- 1989
- [c2]Shigeki Sagayama:
Phoneme environment clustering for speech recognition. ICASSP 1989: 397-400 - 1986
- [c1]Shigeki Sagayama, Fumitada Itakura:
Duality theory of composite sinusoidal modeling and linear prediction. ICASSP 1986: 1261-1264
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-02 00:06 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint