Szu-Wei Fu

Orcid: 0000-0002-3487-8212

According to our database1, Szu-Wei Fu authored at least 59 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2014
2016
2018
2020
2022
2024
0
5
10
6
3
2
2
3
6
1
3
1
4
2
6
2
3
4
2
3
1
1
4

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts.
CoRR, 2024

Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data.
CoRR, 2024

Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration.
CoRR, 2024

The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction.
CoRR, 2024

DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment.
CoRR, 2024

An Investigation of Incorporating Mamba for Speech Enhancement.
CoRR, 2024

RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-Based Speech Enhancement.
Proceedings of the 26th IEEE International Workshop on Multimedia Signal Processing, 2024

A Study On Incorporating Whisper For Robust Speech Assessment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Multi-objective Non-intrusive Hearing-aid Speech Assessment Model.
CoRR, 2023

QuAVF: Quality-aware Audio-Visual Fusion for Ego4D Talking to Me Challenge.
CoRR, 2023

Real-Time Speech Interruption Analysis: from Cloud to Client Deployment.
Proceedings of the IEEE International Conference on Acoustics, 2023

Study on the Correlation Between Objective Evaluations and Subjective Speech Quality and Intelligibility.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application.
IEEE Access, 2022

Improving Meeting Inclusiveness using Speech Interruption Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MTI-Net: A Multi-Target Speech Intelligibility Prediction Model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

OSSEM: one-shot speaker adaptive speech enhancement using meta learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Boosting Self-Supervised Embeddings for Speech Enhancement.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Perceptual Contrast Stretching on Target Feature for Speech Enhancement.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation Based Only on Noisy/ Reverberated Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation.
IEEE Trans. Cogn. Dev. Syst., 2021

SpeechBrain: A General-Purpose Speech Toolkit.
CoRR, 2021

Improving Perceptual Quality by Phone-Fortified Perceptual Loss Using Wasserstein Distance for Speech Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality.
IEEE Signal Process. Lett., 2020

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement.
CoRR, 2020

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing.
CoRR, 2020

iMetricGAN: Intelligibility Enhancement for Speech-in-Noise Using Generative Adversarial Network-Based Metric Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques.
IEEE Signal Process. Lett., 2019

Time-Domain Multi-modal Bone/air Conducted Speech Enhancement.
CoRR, 2019

Seeing Voices in Noise: A Study of Audiovisual-Enhanced Vocoded Speech Intelligibility in Cochlear Implant Simulation.
CoRR, 2019

Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement.
CoRR, 2019

Multichannel Speech Enhancement by Raw Waveform-mapping using Fully Convolutional Networks.
CoRR, 2019

Noise Reduction in ECG Signals Using Fully Convolutional Denoising Autoencoders.
IEEE Access, 2019

Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

A Study on Speech Enhancement Using Exponent-Only Floating Point Quantized Neural Network (EOFP-QNN).
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.
IEEE Trans. Biomed. Eng., 2017

End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
CoRR, 2017

Multi-Metrics Learning for Speech Enhancement.
CoRR, 2017

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Raw waveform-based speech enhancement by fully convolutional networks.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Collagen image compression using the JPEG-based predictive lossless coding scheme.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Maximum Entropy Learning with Deep Belief Networks.
Entropy, 2016

SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Horizontal adaptive disparity estimation scheme for stereoscopic images.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Compression for the feature points with binary descriptors.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Image deblurring using a pyramid-based Richardson-Lucy algorithm.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

A novel compression algorithm for IMFs of Hilbert-Huang transform.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

End-point preserved stroke extraction.
Proceedings of the International Conference on Audio, 2014


  Loading...