Tom Bäckström

Orcid: 0000-0002-5590-2349

Affiliations:
  • Aalto University, Helsinki, Finland


According to our database1, Tom Bäckström authored at least 88 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization.
CoRR, 2024

Evaluating privacy, security, and trust perceptions in conversational AI: A systematic review.
Comput. Hum. Behav., 2024

Real-Time Joint Noise Suppression and Bandwidth Extension of Noisy Reverberant Wideband Speech.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

2023
The Internet of Sounds: Convergent Trends, Insights, and Future Directions.
IEEE Internet Things J., July, 2023

Privacy in Speech Technology.
CoRR, 2023

Interpretable Latent Space Using Space-Filling Curves for Phonetic Analysis in Voice Conversion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings Using a Joint Loss Function.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing.
Proceedings of the IEEE International Conference on Acoustics, 2023

Low-Complexity Real-Time Neural Network for Blind Bandwidth Extension of Wideband Speech.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
NSVQ: Noise Substitution in Vector Quantization for Machine Learning.
IEEE Access, 2022

Voice Quality Features for Replay Attack Detection.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Federated Learning for Privacy-Preserving Speaker Recognition.
IEEE Access, 2021

Cancellation of Local Competing Speaker with Near-Field Localization for Distributed ad-hoc Sensor Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

End-to-End Optimized Multi-Stage Vector Quantization of Spectral Envelopes for Speech and Audio Coding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Voice-quality Features for Deep Neural Network Based Speaker Verification Systems.
Proceedings of the 29th European Signal Processing Conference, 2021

PyAWNeS-Codec: Speech and audio codec for ad-hoc acoustic wireless sensor networks.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Enhancement by postfiltering for speech and audio coding in ad-hoc sensor networks.
CoRR, 2020

Users Perceptions about Teleconferencing Applications Collected through Twitter.
CoRR, 2020

Acoustic Fingerprints for Access Management in Ad-Hoc Sensor Networks.
IEEE Access, 2020

Provable Consent for Voice User Interfaces.
Proceedings of the 2020 IEEE International Conference on Pervasive Computing and Communications Workshops, 2020

Perception of Privacy Measured in the Crowd - Paired Comparison on the Effect of Background Noises.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fundamental Frequency Model for Postfiltering at Low Bitrates in a Transform-Domain Speech and Audio Codec.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Evaluation of Zero Frequency Filtering based Method for Multi-pitch Streaming of Concurrent Speech Signals.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Sound Privacy: A Conversational Speech Corpus for Quantifying the Experience of Privacy.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Super-Wideband Spectral Envelope Modeling for Speech Coding.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

End-to-End Optimization of Source Models for Speech and Audio Coding Using a Machine Learning Framework.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing.
Proceedings of the IEEE International Conference on Acoustics, 2019

Robust and Responsive Acoustic Pairing of Devices Using Decorrelating Time-Frequency Modelling.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Postfiltering with Complex Spectral Correlations for Speech and Audio Coding.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Dithered Quantization for Frequency-Domain Speech and Audio Coding.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech Coding, Speech Interfaces and IOT - Opportunities and Challenges.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

Optimal temporal dynamics of MFCCs for low-complexity VAD systems - a case study.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Quadratic Programming Approach to Glottal Inverse Filtering by Joint Norm-1 and Norm-2 Optimization.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

An Unsupervised Hybrid Approach for Online Detection of Sound Scene Changes in Broadcast Content.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

Estimation of the Probability Distribution of Spectral Fine Structure in the Speech Source.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Modeling formant dynamics in speech spectral envelopes.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Entropy Coding of Spectral Envelopes for Speech and Audio Coding Using Distribution Quantization.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Joint Enhancement and Coding of Speech by Incorporating Wiener Filtering in a CELP Codec.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Blind Recovery of Perceptual Models in Distributed Speech and Audio Coding.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Noise-adaptive perceptual weighting in the AMR-WB encoder for increased speech loudness in adverse far-end noise conditions.
Proceedings of the 24th European Signal Processing Conference, 2016

Spectral Envelope Statistics for Source Modeling in Speech Enhancement.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Coding of Parametric Models with Randomized Quantization in a Distributed Speech and Audio Codec.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
Decorrelating MVDR Filterbanks Using the Non-Uniform Discrete Fourier Transform.
IEEE Signal Process. Lett., 2015

Glottal inverse filtering based on quadratic programming.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Intelligibility evaluation of speech coding standards in severe background noise and packet loss conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Finding line spectral frequencies using the fast fourier transform.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Arithmetic coding of speech and audio spectra using tcx based on linear predictive spectral envelopes.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Harmonic model for MDCT based audio coding with LPC envelope.
Proceedings of the 23rd European Signal Processing Conference, 2015

Envelope modeling for speech and audio processing using distribution quantization.
Proceedings of the 23rd European Signal Processing Conference, 2015

Comparison of windowing schemes for speech coding.
Proceedings of the 23rd European Signal Processing Conference, 2015

An evaluation of stereo speech enhancement methods for different audio-visual scenarios.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Sparse time-frequency representation of speech by the vandermonde transform.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Automatic estimation of the lip radiation effect in glottal inverse filtering.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Implementation and evaluation of the Vandermonde transform.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Vandermonde Factorization of Toeplitz Matrices and Applications in Filtering and Warping.
IEEE Trans. Signal Process., 2013

Comparison of windowing in speech and audio coding.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Computationally efficient objective function for algebraic codebook optimization in ACELP.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Enumerative Algebraic Coding for ACELP.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2009
Stabilised weighted linear prediction.
Speech Commun., 2009

Parametric AM/FM decomposition for speech and audio coding.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Pitch variation estimation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Simple proofs of root locations of two symmetric linear prediction models.
Signal Process., 2008

DC-constrained linear prediction for glottal inverse filtering.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Minimum Separation of Line Spectral Frequencies.
IEEE Signal Process. Lett., 2007

Effect of White-Noise Correction on Linear Predictive Coding.
IEEE Signal Process. Lett., 2007

Stabilised weighted linear prediction - a robust all-pole method for speech processing.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Properties of line spectrum pair polynomials - A review.
Signal Process., 2006

2005
Group delay function as a means to assess quality of glottal inverse filtering.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A toolkit for voice inverse filtering and parametrisation.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Objective Quality Measures for Glottal Inverse Filtering of Speech Pressure Signals.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Line spectral properties of quadratic models.
Proceedings of the 13th European Signal Processing Conference, 2005

2004
A time-domain interpretation for the LSP decomposition.
IEEE Trans. Speech Audio Process., 2004

Linear predictive method for improved spectral modeling of lower frequencies of speech with small prediction orders.
IEEE Trans. Speech Audio Process., 2004

2003
On line spectral frequencies.
IEEE Signal Process. Lett., 2003

All-pole modeling technique based on weighted sum of LSP polynomials.
IEEE Signal Process. Lett., 2003

A constrained linear predictive model with the minimum-phase property.
Signal Process., 2003

Linear predictive method with low-frequency emphasis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

On the stability of constrained linear predictive models.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

All-pole modeling of wide-band speech with symmetric linear prediction.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range.
IEEE Trans. Speech Audio Process., 2002

All-pole modeling of wide-band speech using weighted sum of the LSP polynomials.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A time domain reformulation of linear prediction equivalent to the LSP decomposition.
Proceedings of the IEEE International Conference on Acoustics, 2002


  Loading...