Emiru Tsunoo

According to our database1, Emiru Tsunoo authored at least 41 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens.
CoRR, 2024

Task Arithmetic for Language Expansion in Speech Translation.
CoRR, 2024

Decoder-only Architecture for Streaming End-to-end Speech Recognition.
CoRR, 2024

Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting.
CoRR, 2024

Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model.
CoRR, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Phoneme-Aware Encoding for Prefix-Tree-Based Contextual ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.
CoRR, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Tensor decomposition for minimization of E2E SLU model toward on-device processing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Joint Modelling of Spoken Language Understanding Tasks with Integrated Dialog History.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Residual Language Model for End-to-end Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Run-and-Back Stitch Search: Novel Block Synchronous Decoding For Streaming Encoder-Decoder ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Speech Recognition and Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Polyphone Disambiguation and Accent Prediction Using Pre-Trained Language Models in Japanese TTS Front-End.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR, 2021

Streaming Transformer Asr With Blockwise Synchronous Beam Search.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Streaming Transformer ASR with Blockwise Synchronous Inference.
CoRR, 2020

2019
Towards Online End-to-end Transformer Automatic Speech Recognition.
CoRR, 2019

End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Transformer ASR with Contextual Block Processing.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2017
Hierarchical Recurrent Neural Network for Story Segmentation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2011
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

2010
Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks.
Proceedings of the Advances in Music Information Retrieval, 2010

Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Music mood classification by rhythm and bass-line unit pattern analysis.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Audio genre classification using percussive pattern clustering combined with timbral features.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals.
Proceedings of the IEEE International Conference on Acoustics, 2009


  Loading...