Takanori Ashihara

Orcid: 0009-0003-4322-4127

According to our database1, Takanori Ashihara authored at least 36 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding.
CoRR, 2024

Alignment-Free Training for Transducer-based Multi-Talker ASR.
CoRR, 2024

Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings.
CoRR, 2024

Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation.
CoRR, 2024

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling.
CoRR, 2024

Lightweight Zero-shot Text-to-Speech with Mixture of Adapters.
CoRR, 2024

Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over.
CoRR, 2024

Unveiling the Linguistic Capabilities of a Self-Supervised Speech Model Through Cross-Lingual Benchmark and Layer- Wise Similarity Analysis.
IEEE Access, 2024

Probing Self-Supervised Learning Models With Target Speech Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024

Noise-Robust Zero-Shot Text-to-Speech Synthesis Conditioned on Self-Supervised Speech-Representation Model with Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2024

What Do Self-Supervised Speech and Speaker Models Learn? New Findings from a Cross Model Layer-Wise Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Leveraging Language Embeddings for Cross-Lingual Self-Supervised Speech Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Scheduled Sampling for Neural Transducer-Based ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Large Text Corpora For End-To-End Speech Summarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Zero-Shot Text-to-Speech Synthesis Conditioned Using Self-Supervised Speech Representation Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Summarize While Translating: Universal Model With Parallel Decoding for Summarization and Translation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Hybrid RNN-T/Attention-Based Streaming ASR with Triggered Chunkwise Attention and Dual Internal Language Model Integration.
Proceedings of the IEEE International Conference on Acoustics, 2022

Customer Satisfaction Estimation Using Unsupervised Representation Learning with Multi-Format Prediction Loss.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Investigating the Impact of Spectral and Temporal Degradation on End-to-End Automatic Speech Recognition Performance.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Simpleflat: A Simple Whole-Network Pre-Training Approach for RNN Transducer-Based End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Speech Emotion Recognition Based on Listener Adaptive Models.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Self-Distillation for Improving CTC-Transformer-Based ASR Systems.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Distilling Attention Weights for CTC-Based ASR Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-to-End Automatic Speech Recognition with Deep Mutual Learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Neural Whispered Speech Detection with Imbalanced Learning.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Revisiting Dynamic Adjustment of Language Model Scaling Factor for Automatic Speech Recognition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019


  Loading...