Malcolm Slaney

Orcid: 0000-0001-9733-4864

According to our database1, Malcolm Slaney authored at least 101 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2010, "For contributions to perceptual signal processing and tomographic imaging".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
The CARFAC v2 Cochlear Model in Matlab, NumPy, and JAX.
CoRR, 2024

2023
Neural architecture search for energy-efficient always-on audio machine learning.
Neural Comput. Appl., June, 2023

Disentangling Speech from Surroundings with Neural Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Disentangling speech from surroundings in a neural audio codec.
CoRR, 2022

Neural Architecture Search for Energy Efficient Always-on Audio Models.
CoRR, 2022

Multi-Channel Speech Denoising for Machine Ears.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
VHP: Vibrotactile Haptics Platform for On-body Applications.
Proceedings of the UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, 2021

2020
Deep Canonical Correlation Analysis For Decoding The Auditory Brain.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

2018
Decoding the auditory brain with canonical component analysis.
NeuroImage, 2018

Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards mobile gaze-directed beamforming: a novel neuro-technology for hearing loss.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2017
Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers.
CoRR, 2017

CNN architectures for large-scale audio classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2015
A Study of Multimodal Addressee Detection in Human-Human-Computer Interaction.
IEEE Trans. Multim., 2015

Multimodal addressee detection in multiparty dialogue systems.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Probabilistic features for connecting eye gaze to spoken language understanding.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Artificial neural network features for speaker diarization.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Eye gaze for understanding conversational speech.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The influence of pitch and noise on the discriminability of filterbank features.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

The Relation of Eye Gaze and Face Pose: Potential Impact on Speech Recognition.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Eye Gaze for Spoken Language Understanding in Multi-modal Conversational Interactions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Gaze-enhanced speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Introduction to the special section on the 20<sup>th</sup> anniversary of the ACM international conference on multimedia.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Micro Stories and Mega Stories.
IEEE Multim., 2013

Data driven suppression rule for speech enhancement.
Proceedings of the 2013 Information Theory and Applications Workshop, 2013

QBT-Extended: An Annotated Dataset of Melodically Contoured Tapped Queries.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Pitch-gesture modeling using subband autocorrelation change detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Characteristic contours of syllabic-level units in laughter.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Optimal Parameters for Locality-Sensitive Hashing.
Proc. IEEE, 2012

Web-Scale Multimedia Processing and Applications [Scanning the Issue].
Proc. IEEE, 2012

Don't Click Here.
IEEE Multim., 2012

Tell Me a Story.
IEEE Multim., 2012

Coulda, woulda, shoulda: 20 years of multimedia opportunities.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Learning Sparse Feature Representations for Music Annotation and Retrieval.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

A model of attention-driven scene analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Audio and Acoustic Signal Processing [In the Spotlight].
IEEE Signal Process. Mag., 2011

Academia Meets Industry at the Multimedia Grand Challenge.
IEEE Multim., 2011

Precision-Recall Is Wrong for Multimedia.
IEEE Multim., 2011

Web-Scale Multimedia Analysis: Does Content Matter?
IEEE Multim., 2011

Identifying authoritative sources of multimedia content: mining specificity and expertise from large-scale multimedia databases.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Recommender Systems, Missing Data and Statistical Model Estimation.
Proceedings of the IJCAI 2011, 2011

Using gaze patterns to study and predict reading struggles due to distraction.
Proceedings of the International Conference on Human Factors in Computing Systems, 2011

2010
Solving Demodulation as an Optimization Problem.
IEEE Trans. Speech Audio Process., 2010

Scalable Audio-Content Analysis.
EURASIP J. Audio Speech Music. Process., 2010

Processing web-scale multimedia data.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Image classification using the web graph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multimodal retrieval and ranking: more than waveforms.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

The information content of demodulated speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Unsupervised image ranking.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Periodicity Detection and Localization using Spike Timing from the AER EAR.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Reconciliation of human and machine speech recognition performance.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio.
IEEE Trans. Speech Audio Process., 2008

Analysis of Minimum Distances in High-Dimensional Musical Spaces.
IEEE Trans. Speech Audio Process., 2008

Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes].
IEEE Signal Process. Mag., 2008

Content-Based Music Information Retrieval: Current Directions and Future Challenges.
Proc. IEEE, 2008

Resolving tag ambiguity.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Learning a Metric for Music Similarity.
Proceedings of the ISMIR 2008, 2008

Comparing Local Feature Descriptors in pLSA-Based Image Models.
Proceedings of the Pattern Recognition, 2008

Continuous visual vocabulary modelsfor pLSA-based scene recognition.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
Collaborative Filtering and the Missing at Random Assumption.
Proceedings of the UAI 2007, 2007

Similarity Based on Rating Data.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A Unified System for Chord Transcription and Key Extraction Using Hidden Markov Models.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

PLSA on Large Scale Image Databases.
Proceedings of the IEEE International Conference on Acoustics, 2007

Fast Recognition of Remixed Music Audio.
Proceedings of the IEEE International Conference on Acoustics, 2007

Varying Time Constants and Gain Adaptation in Feature Extraction for Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2007

Image retrieval on large-scale image databases.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations.
IEEE Trans. Speech Audio Process., 2006

Automatic Chord Recognition from Audio Using a HMM with Supervised Learning.
Proceedings of the ISMIR 2006, 2006

Song Intersection by Approximate Nearest Neighbor Search.
Proceedings of the ISMIR 2006, 2006

A statistical model of timbre perception.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

The Importance of Sequences in Musical Similarity.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Being Literate with Large Document Collections: Observational Studies and Cost Structure Tradeoffs.
Proceedings of the 39th Hawaii International International Conference on Systems Science (HICSS-39 2006), 2006

2005
A timbre space for speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Analytic Worksheets: A Framework to Support Human Analysis of Large Streaming Data Volumes.
Proceedings of the Human-Computer Interaction, 2005

Measuring Information Understanding in Large Document Collections.
Proceedings of the 38th Hawaii International Conference on System Sciences (HICSS-38 2005), 2005

The History and Future of CASA.
Proceedings of the Speech Separation by Humans and Machines, 2005

2004
Low-power audio classification for ubiquitous sensor networks.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Speech discrimination based on multiscale spectro-temporal modulations.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
BabyEars: A recognition system for affective vocalizations.
Speech Commun., 2003

Modeling Multitasking Users.
Proceedings of the User Modeling 2003, 2003

2002
Mixtures of probability experts for audio retrieval and indexing.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Semantic-audio retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Multimedia edges: finding hierarchy in all dimensions.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Hierarchical segmentation using latent semantic indexing in scale space.
Proceedings of the IEEE International Conference on Acoustics, 2001

FastMPEG: time-scale modification of bit-compressed audio information.
Proceedings of the IEEE International Conference on Acoustics, 2001

Temporal Events in All Dimensions and Scales.
Proceedings of the IEEE Workshop on Detection and Recognition of Events in Video, 2001

Principles of computerized tomographic imaging.
Classics in applied mathematics 33, SIAM, ISBN: 978-0-89871-494-4, 2001

2000
FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

1998
Baby Ears: a recognition system for affective vocalizations.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

MACH1: nonuniform time-scale modification of speech.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Video Rewrite: driving visual speech with audio.
Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, 1997

Construction and evaluation of a robust multifeature speech/music discriminator.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Video rewrite: visual speech synthesis from video.
Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997

1996
Automatic audio morphing.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1994
Pattern Playback in the 90s.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

Auditory model inversion for sound separation.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1990
A perceptual pitch detector.
Proceedings of the 1990 International Conference on Acoustics, 1990

Speaker-independent vowel recognition: spectrograms versus cochleagrams.
Proceedings of the 1990 International Conference on Acoustics, 1990


  Loading...