Ching-Feng Yeh

According to our database1, Ching-Feng Yeh authored at least 40 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Attention or Convolution: Transformer Encoders in Audio Language Models for Inference Efficiency.
Proceedings of the IEEE International Conference on Acoustics, 2024

Altogether: Image Captioning via Re-aligning Alt-text.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Efficient Speech Representation Learning with Low-Bit Quantization.
CoRR, 2023

Continual Learning for On-Device Speech Recognition Using Disentangled Conformers.
Proceedings of the IEEE International Conference on Acoustics, 2023

Flap: Fast Language-Audio Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021
TorchAudio: Building Blocks for Audio and Speech Processing.
CoRR, 2021

Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Alignment Restricted Streaming Recurrent Neural Network Transducer.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications.
Proceedings of the IEEE International Conference on Acoustics, 2021

Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition.
CoRR, 2020

Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Weak-Attention Suppression for Transformer Based Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Aipnet: Generative Adversarial Pre-Training of Accent-Invariant Networks for End-To-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
RNN-T For Latency Controlled ASR With Improved Beam Search.
CoRR, 2019

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention.
CoRR, 2019

2018
Training Augmentation with Adversarial Examples for Robust Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Domain Adversarial Training for Accented Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
An Efficient Two-Phase ILP-Based Algorithm for Precise CMOS RFIC Layout Generation.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

2016
Novel CMOS RFIC layout generation with concurrent device placement and fixed-length microstrip routing.
Proceedings of the 53rd Annual Design Automation Conference, 2016

2015
A Novel Analog Physical Synthesis Methodology Integrating Existent Design Expertise.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2015

An Improved Framework for Recognizing Highly Imbalanced Bilingual Code-Switched Lectures with Cross-Language Acoustic Modeling and Frame-Level Language Identification.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Personalized speech recognizer with keyword-based personalized lexicon and language model using word vector representations.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Exploring Feasibilities of Symmetry Islands and Monotonic Current Paths in Slicing Trees for Analog Placement.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2014

Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Transcribing code-switched bilingual lectures using deep neural networks with unit merging in acoustic modeling.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Speaking rate normalization with lattice-based context-dependent phoneme duration modeling for personalized speech recognizers on mobile devices.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Minimum Phone Error model training on merged acoustic units for transcribing bilingual code-switched speech.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Recognition of highly imbalanced code-mixed bilingual speech with frame-level language detection based on blurred posteriorgram.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Bilingual Acoustic Model Adaptation by Unit Merging on Different Levels and Cross-Level Integration.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
A framework integrating different relevance feedback scenarios and approaches for spoken term detection.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

An integrated framework for transcribing Mandarin-English code-mixed lectures with improved acoustic and language modeling.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Improved spoken term detection by discriminative training of acoustic models based on user relevance feedback.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improved spoken term detection by feature space pseudo-relevance feedback.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010


  Loading...