Yosuke Kashiwagi

According to our database1, Yosuke Kashiwagi authored at least 33 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens.
CoRR, 2024

Task Arithmetic for Language Expansion in Speech Translation.
CoRR, 2024

Decoder-only Architecture for Streaming End-to-end Speech Recognition.
CoRR, 2024

Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting.
CoRR, 2024

Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model.
CoRR, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Phoneme-Aware Encoding for Prefix-Tree-Based Contextual ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.
CoRR, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Tensor decomposition for minimization of E2E SLU model toward on-device processing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Residual Language Model for End-to-end Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Run-and-Back Stitch Search: Novel Block Synchronous Decoding For Streaming Encoder-Decoder ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Speech Recognition and Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Streaming Transformer Asr With Blockwise Synchronous Beam Search.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Streaming Transformer ASR with Blockwise Synchronous Inference.
CoRR, 2020

2019
Towards Online End-to-end Transformer Automatic Speech Recognition.
CoRR, 2019

End-to-End Adaptation with Backpropagation Through WFST for On-Device Speech Recognition System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Transformer ASR with Contextual Block Processing.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2016
Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Divergence estimation based on deep neural networks and its use for language identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

2014
Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012


  Loading...