We stand with Ukraine

We stand with Ukraine

Yuan Shangguan

According to our database¹, Yuan Shangguan authored at least 31 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

0

1

2

3

4

5

6

7

8

9

10

1

3

2

1

1

1

5

2

4

7

2

1

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Changsheng Zhao

,

,

CoRR, 2024

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs.

[BibT_eX]

[DOI]

Yassir Fathullah

,

,

,

,

,

,

,

,

Christian Fuegen

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models.

[BibT_eX]

[DOI]

,

,

,

,

Yassir Fathullah

,

,

,

Raghuraman Krishnamoorthi

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Folding Attention: Memory and Power Optimization for On-Device Transformer-Based Streaming Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Forrest N. Iandola

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Prompting Large Language Models with Speech Recognition Abilities.

[BibT_eX]

[DOI]

Yassir Fathullah

,

,

,

,

,

,

,

,

,

,

Christian Fuegen

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data.

[BibT_eX]

[DOI]

Yassir Fathullah

,

,

,

,

,

,

,

Christian Fuegen

,

CoRR, 2023

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Forrest N. Iandola

,

,

,

CoRR, 2023

Towards Selection of Text-to-speech Data to Augment ASR Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches.

[BibT_eX]

[DOI]

Vinicius Ribeiro

,

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Head State Space Model for Speech Recognition.

[BibT_eX]

[DOI]

Yassir Fathullah

,

,

,

,

,

,

,

,

,

,

Mark J. F. Gales

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning.

[BibT_eX]

[DOI]

,

,

,

,

Raghuraman Krishnamoorthi

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Federated Domain Adaptation for ASR with Full Self-Supervision.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Omni-Sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR Via Supernet.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Ganesh Venkatesh

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Noisy Training Improves E2E ASR for the Edge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Ganesh Venkatesh

,

,

CoRR, 2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Christian Fuegen

,

Michael L. Seltzer

CoRR, 2021

Alignment Restricted Streaming Recurrent Neural Network Transducer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Christian Fuegen

,

Michael L. Seltzer

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition.

[BibT_eX]

[DOI]

,

Rohit Prabhavalkar

,

,

,

,

,

,

,

,

Christian Fuegen

,

Michael L. Seltzer

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Christian Fuegen

,

Michael L. Seltzer

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Christian Fuegen

,

,

,

Michael L. Seltzer

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Memory-Efficient Speech Recognition on Smart Devices.

[BibT_eX]

[DOI]

Ganesh Venkatesh

,

Alagappan Valliappan

,

,

,

Christian Fuegen

,

Michael L. Seltzer

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Efficient Knowledge Distillation for RNN-Transducer Models.

[BibT_eX]

[DOI]

Sankaran Panchapagesan

,

,

Chung-Cheng Chiu

,

,

,

Alexander Gruenstein

Proceedings of the IEEE International Conference on Acoustics, 2021

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer.

[BibT_eX]

[DOI]

,

,

,

Antoine Bruguier

,

Christian Fuegen

,

Michael L. Seltzer

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.

[BibT_eX]

[DOI]

CoRR, 2020

Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer.

[BibT_eX]

[DOI]

,

,

,

,

Françoise Beaufays

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Optimizing Speech Recognition For The Edge.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

Streaming End-to-end Speech Recognition for Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2015

UMEME: University of Michigan Emotional McGurk Effect Data Set.

[BibT_eX]

[DOI]

Emily Mower Provost

,

,

IEEE Trans. Affect. Comput., 2015

EmoShapelets: Capturing local dynamics of audio-visual affective speech.

[BibT_eX]

[DOI]

,

Emily Mower Provost

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Loading...