Yusuke Kida

According to our database1, Yusuke Kida authored at least 22 papers between 2004 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Conversation-Oriented ASR with Multi-Look-Ahead CBS Architecture.
Proceedings of the IEEE International Conference on Acoustics, 2023

Neural Diarization with Non-Autoregressive Intermediate Attractors.
Proceedings of the IEEE International Conference on Acoustics, 2023

Mask-CTC-Based Encoder Pre-Training for Streaming End-to-End Speech Recognition.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Tourist Guidance Robot Based on HyperCLOVA.
CoRR, 2022

Multi-sequence Intermediate Conditioning for CTC-based ASR.
CoRR, 2022

Alternate Intermediate Conditioning with Syllable-Level and Character-Level Targets for Japanese ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Better Intermediates Improve CTC Inference.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers.
CoRR, 2021

2019
Simultaneous Detection and Localization of a Wake-Up Word Using Multi-Task Learning of the Duration and Endpoint.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Speaker Selective Beamformer with Keyword Mask Estimation.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

2016
Voice Activity Detection: Merging Source and Filter-based Information.
IEEE Signal Process. Lett., 2016

2010
Using duration and pitch for mandarin digit string recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2006
Evaluation of voice activity detection by combining multiple features with weight adaptation.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Using visual odometry to create 3D maps for online footstep planning.
Proceedings of the IEEE International Conference on Systems, 2005

Online dense local 3D world reconstruction from stereo image sequences.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Voice activity detection based on optimally weighted combination of multiple features.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Minimum Classification Error Interactive Training for Speaker Identification.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
3D map building for a humanoid robot by using visual odometry.
Proceedings of the IEEE International Conference on Systems, 2004

Human finding and body property estimation by using floor segmentation and 3D labelling.
Proceedings of the IEEE International Conference on Systems, 2004


  Loading...