Emiru Tsunoo

Chaitanya Prasad Narisetty

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Run-and-Back Stitch Search: Novel Block Synchronous Decoding For Streaming Encoder-Decoder ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Speech Recognition and Audio Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Polyphone Disambiguation and Accent Prediction Using Pre-Trained Language Models in Japanese TTS Front-End.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Streaming Transformer Asr With Blockwise Synchronous Beam Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation.

[BibT_eX]

[DOI]

Michael Hentschel

Takao Okuda

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Streaming Transformer ASR with Blockwise Synchronous Inference.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Towards Online End-to-end Transformer Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Transformer ASR with Contextual Block Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2017

Hierarchical Recurrent Neural Network for Story Segmentation.

[BibT_eX]

[DOI]

Peter Bell

Steve Renals

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2011

Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2011

2010

Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Music Information Retrieval, 2010

Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Music mood classification by rhythm and bass-line unit pattern analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification.

[BibT_eX]

[DOI]

Nobutaka Ono

Shigeki Sagayama

Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Audio genre classification using percussive pattern clustering combined with timbral features.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals.

[BibT_eX]

[DOI]