Yosuke Kashiwagi

According to our database1, Yosuke Kashiwagi authored at least 26 papers between 2012 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Phoneme-aware Encoding for Prefix-tree-based Contextual ASR.
CoRR, 2023

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.
CoRR, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.
CoRR, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.
CoRR, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Residual Language Model for End-to-end Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Run-and-Back Stitch Search: Novel Block Synchronous Decoding For Streaming Encoder-Decoder ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Speech Recognition and Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Streaming Transformer Asr With Blockwise Synchronous Beam Search.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Streaming Transformer ASR with Blockwise Synchronous Inference.
CoRR, 2020

2019
Towards Online End-to-end Transformer Automatic Speech Recognition.
CoRR, 2019

End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System.
Proceedings of the Interspeech 2019, 2019

Transformer ASR with Contextual Block Processing.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2016
Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners.
Proceedings of the Interspeech 2016, 2016

Divergence estimation based on deep neural networks and its use for language identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

2014
Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012


  Loading...