We stand with Ukraine

We stand with Ukraine

Yosuke Kashiwagi

According to our database¹, Yosuke Kashiwagi authored at least 33 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

2012

2014

2016

2018

2020

2022

2024

0

1

2

3

4

5

6

7

8

9

10

5

2

1

1

2

7

4

3

2

2

1

1

1

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

,

Shinji Watanabe

CoRR, 2024

Task Arithmetic for Language Expansion in Speech Translation.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

,

,

Shinji Watanabe

CoRR, 2024

Decoder-only Architecture for Streaming End-to-end Speech Recognition.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2024

Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

,

Shinji Watanabe

CoRR, 2024

Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.

[BibT_eX]

[DOI]

,

,

,

,

Roshan S. Sharma

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Phoneme-Aware Encoding for Prefix-Tree-Based Contextual ASR.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.

[BibT_eX]

[DOI]

,

,

,

,

Roshan S. Sharma

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Tensor decomposition for minimization of E2E SLU model toward on-device processing.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

,

,

,

,

,

Shinji Watanabe

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

,

,

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.

[BibT_eX]

[DOI]

,

,

Kentaro Shibata

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.

[BibT_eX]

[DOI]

,

,

,

,

Yosuke Kashiwagi

,

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Residual Language Model for End-to-end Speech Recognition.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Chaitanya Prasad Narisetty

,

Shinji Watanabe

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Run-and-Back Stitch Search: Novel Block Synchronous Decoding For Streaming Encoder-Decoder ASR.

[BibT_eX]

[DOI]

,

Chaitanya Narisetty

,

Michael Hentschel

,

Yosuke Kashiwagi

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Shusuke Takahashi

Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Speech Recognition and Audio Captioning.

[BibT_eX]

[DOI]

Chaitanya Narisetty

,

,

,

Yosuke Kashiwagi

,

Michael Hentschel

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Streaming Transformer Asr With Blockwise Synchronous Beam Search.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Shinji Watanabe

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios.

[BibT_eX]

[DOI]

,

Kentaro Shibata

,

Chaitanya Narisetty

,

Yosuke Kashiwagi

,

Shinji Watanabe

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Streaming Transformer ASR with Blockwise Synchronous Inference.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Shinji Watanabe

CoRR, 2020

2019

Towards Online End-to-end Transformer Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Toshiyuki Kumakura

,

Shinji Watanabe

CoRR, 2019

End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Satoshi Asakawa

,

Toshiyuki Kumakura

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Transformer ASR with Contextual Block Processing.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Toshiyuki Kumakura

,

Shinji Watanabe

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2016

Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

,

,

Yutaka Yamauchi

,

,

Nobuaki Minematsu

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Divergence estimation based on deep neural networks and its use for language identification.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

Nobuaki Minematsu

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Nobuaki Minematsu

,

,

Keikichi Hirose

Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

2014

Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

Nobuaki Minematsu

,

Keikichi Hirose

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

Nobuaki Minematsu

,

Keikichi Hirose

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

Masayuki Suzuki

,

Nobuaki Minematsu

,

Keikichi Hirose

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Loading...