We stand with Ukraine

We stand with Ukraine

Yosuke Kashiwagi

According to our database¹, Yosuke Kashiwagi authored at least 26 papers between 2012 and 2023.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2023

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

CoRR, 2023

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.

[BibT_eX]

[DOI]

,

,

,

,

Roshan S. Sharma

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

Shinji Watanabe

CoRR, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

CoRR, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

,

,

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.

[BibT_eX]

[DOI]

,

,

Kentaro Shibata

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.

[BibT_eX]

[DOI]

,

,

,

,

Yosuke Kashiwagi

,

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

Yosuke Kashiwagi

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Residual Language Model for End-to-end Speech Recognition.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Chaitanya Prasad Narisetty

,

Shinji Watanabe

Proceedings of the Interspeech 2022, 2022

Run-and-Back Stitch Search: Novel Block Synchronous Decoding For Streaming Encoder-Decoder ASR.

[BibT_eX]

[DOI]

,

Chaitanya Narisetty

,

Michael Hentschel

,

Yosuke Kashiwagi

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Shusuke Takahashi

Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Speech Recognition and Audio Captioning.

[BibT_eX]

[DOI]

Chaitanya Narisetty

,

,

,

Yosuke Kashiwagi

,

Michael Hentschel

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Streaming Transformer Asr With Blockwise Synchronous Beam Search.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Shinji Watanabe

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios.

[BibT_eX]

[DOI]

,

Kentaro Shibata

,

Chaitanya Narisetty

,

Yosuke Kashiwagi

,

Shinji Watanabe

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Streaming Transformer ASR with Blockwise Synchronous Inference.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Shinji Watanabe

CoRR, 2020

2019

Towards Online End-to-end Transformer Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Toshiyuki Kumakura

,

Shinji Watanabe

CoRR, 2019

End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Satoshi Asakawa

,

Toshiyuki Kumakura

Proceedings of the Interspeech 2019, 2019

Transformer ASR with Contextual Block Processing.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Toshiyuki Kumakura

,

Shinji Watanabe

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2016

Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

,

,

Yutaka Yamauchi

,

,

Nobuaki Minematsu

Proceedings of the Interspeech 2016, 2016

Divergence estimation based on deep neural networks and its use for language identification.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

,

Nobuaki Minematsu

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint.

[BibT_eX]

[DOI]

,

Yosuke Kashiwagi

,

Nobuaki Minematsu

,

,

Keikichi Hirose

Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

2014

Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition.

[BibT_eX]

[DOI]

,

,

Yosuke Kashiwagi

,

Nobuaki Minematsu

,

Keikichi Hirose

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

,

Nobuaki Minematsu

,

Keikichi Hirose

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Yosuke Kashiwagi

,

Masayuki Suzuki

,

Nobuaki Minematsu

,

Keikichi Hirose

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Loading...