Keyu An

Orcid: 0000-0003-0040-0883

According to our database1, Keyu An authored at least 19 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study.
CoRR, 2024

Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition.
CoRR, 2024

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs.
CoRR, 2024

2023
Analysis of Omni-Channel Evolution Game Strategy for E-Commerce Enterprises in the Context of Online and Offline Integration.
Syst., July, 2023

Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures.
CoRR, 2023

Exploring RWKV for Memory Efficient and Low Latency Streaming ASR.
CoRR, 2023

BAT: Boundary aware transducer for memory-efficient and low-latency ASR.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Dynamic Research on Three-Player Evolutionary Game in Waste Product Recycling Supply Chain System.
Syst., 2022

Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study.
CoRR, 2022

Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

An Empirical Study of Language Model Integration for Transducer based Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Efficient Neural Architecture Search for End-to-End Speech Recognition Via Straight-Through Gradients.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

The SLT 2021 Children Speech Recognition Challenge: Open Datasets, Rules and Baselines.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Deformable TDNN with Adaptive Receptive Fields for Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multilingual and Crosslingual Speech Recognition Using Phonological-Vector Based Phone Embeddings.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
CAT: A CTC-CRF Based ASR Toolkit Bridging the Hybrid and the End-to-End Approaches Towards Data Efficiency and Low Latency.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Sequential Deformation for Accurate Scene Text Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
CAT: CRF-based ASR Toolkit.
CoRR, 2019


  Loading...