Anderson R. Avila

Orcid: 0000-0002-3088-5116

According to our database1, Anderson R. Avila authored at least 28 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning.
CoRR, 2024

Backdoor Attacks to Deep Neural Networks: A Survey of the Literature, Challenges, and Future Research Directions.
IEEE Access, 2024

VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks.
Proceedings of the IEEE International Conference on Acoustics, 2024

On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild".
CoRR, 2023

Multimodal Audio-textual Architecture for Robust Spoken Language Understanding.
CoRR, 2023

On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications.
CoRR, 2023

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for "In the Wild" Edge Applications.
CoRR, 2023

How Secure is Code Generated by ChatGPT?
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Assessing the Vulnerability of Self-Supervised Speech Representations for Keyword Spotting Under White-Box Adversarial Attacks.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Robustdistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement.
CoRR, 2022

Low-bit Shift Network for End-to-End Spoken Language Understanding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Feature Pooling of Modulation Spectrum Features for Improved Speech Emotion Recognition in the Wild.
IEEE Trans. Affect. Comput., 2021

Automatic speaker verification from affective speech using Gaussian mixture model based estimation of neutral speech characteristics.
Speech Commun., 2021

On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems.
Comput. Speech Lang., 2021

Sequential End-to-End Intent and Slot Label Classification and Localization.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Streaming End-to-End Framework For Spoken Language Understanding.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Non-intrusive speech quality prediction based on the blind estimation of clean speech and the i-vector framework.
Qual. User Exp., 2020

On the use of the i-vector speech representation for instrumental quality measurement.
Qual. User Exp., 2020

Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity.
Proceedings of the 11th International Conference on Quality of Multimedia Experience QoMEX 2019, 2019

Blind Channel Response Estimation for Replay Attack Detection.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Non-intrusive Speech Quality Assessment Using Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech-Based Stress Classification based on Modulation Spectral Features and Convolutional Neural Networks.
Proceedings of the 27th European Signal Processing Conference, 2019

Towards a Neuro-Inspired No-Reference Instrumental Quality Measure for Text-to-Speech Systems.
Proceedings of the Tenth International Conference on Quality of Multimedia Experience, 2018

Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks.
Proceedings of the 2017 IEEE International Symposium on Signal Processing and Information Technology, 2017

Performance comparison of intrusive and non-intrusive instrumental quality measures for enhanced speech.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
