Joseph Keshet

Orcid: 0000-0003-2332-5783

Affiliations:
  • Israel Institute of Technology, Haifa, Israel


According to our database1, Joseph Keshet authored at least 84 papers between 2001 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Enhancing analysis of diadochokinetic speech using deep neural networks.
Comput. Speech Lang., 2025

2024
Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR.
CoRR, 2024

WhisperNER: Unified Open Named Entity and Speech Recognition.
CoRR, 2024

HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing.
CoRR, 2024

Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network.
CoRR, 2024

Keyword-Guided Adaptation of Automatic Speech Recognition.
CoRR, 2024

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Open-Vocabulary Keyword-Spotting with Adaptive Instance Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Combining Language Models For Specialized Domains: A Colorful Approach.
CoRR, 2023

2022
Speech Time-Scale Modification With GANs.
IEEE Signal Process. Lett., 2022

The Vocal Signature of Social Anxiety: Exploration using Hypothesis-Testing and Machine-Learning Approaches.
CoRR, 2022

THOR: Threshold-Based Ranking Loss for Ordinal Regression.
CoRR, 2022

Correcting Misproducted Speech using Spectrogram Inpainting.
CoRR, 2022

A Baseline for Detecting Out-of-Distribution Examples in Image Captioning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Correcting Mispronunciations in Speech using Spectrogram Inpainting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Formant Estimation and Tracking using Probabilistic Heat-Maps.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DDKtor: Automatic Diadochokinetic Speech Analysis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Unsupervised Word Segmentation using K Nearest Neighbors.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Self-supervised Speaker Diarization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DeepFry: Identifying Vocal Fry Using Deep Neural Networks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Pitch Estimation by Multiple Octave Decoders.
IEEE Signal Process. Lett., 2021

Constant Random Perturbations Provide Adversarial Robustness with Minimal Effect on Accuracy.
CoRR, 2021

CNN-Based Spoken Term Detection and Localization without Dynamic Programming.
Proceedings of the IEEE International Conference on Acoustics, 2021

Fairness in the Eyes of the Data: Certifying Machine-Learning Models.
Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

Adversarial Robustness for Face Recognition: How to Introduce Ensemble Diversity among Feature Extractors?
Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021

2020
Online prediction of time series with assumed behavior.
Eng. Appl. Artif. Intell., 2020

Redesigning the classification layer by randomizing the class representation vectors.
CoRR, 2020

Minimal Modifications of Deep Neural Networks using Verification.
Proceedings of the LPAR 2020: 23rd International Conference on Logic for Programming, 2020

Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Hide and Speak: Towards Deep Neural Networks for Speech Steganography.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Phoneme Boundary Detection Using Learnable Segmental Features.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Hide and Speak: Deep Neural Networks for Speech Steganography.
CoRR, 2019

Dr.VOT: Measuring Positive and Negative Voice Onset Time in the Wild.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

SpeechYOLO: Detection and Localization of Speech Objects.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Adversarial Examples on Discrete Sequences for Beating Whole-Binary Malware Detection.
CoRR, 2018

Fooling End-to-end Speaker Verification by Adversarial Examples.
CoRR, 2018

Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring.
Proceedings of the 27th USENIX Security Symposium, 2018

Out-of-Distribution Detection using Multiple Semantic Label Representations.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Fooling End-To-End Speaker Verification With Adversarial Examples.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Spoken Term Detection Automatically Adjusted for a Given Threshold.
IEEE J. Sel. Top. Signal Process., 2017

Learning Similarity Function for Pronunciation Variations.
CoRR, 2017

Houdini: Fooling Deep Structured Prediction Models.
CoRR, 2017

Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Automatic Measurement of Pre-Aspiration.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Learning Similarity Functions for Pronunciation Variations.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Sequence segmentation using joint RNN and structured prediction models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
StructED: Risk Minimization in Structured Prediction.
J. Mach. Learn. Res., 2016

Domain Adaptation For Formant Estimation Using Deep Learning.
CoRR, 2016

Automatic measurement of vowel duration via structured prediction.
CoRR, 2016

Formant Estimation and Tracking Using Deep Learning.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Measurement of Voice Onset Time and Prevoicing Using Recurrent Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The relationship of voice onset time and Voice Offset Time to physical age.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Online Prediction of Exponential Decay Time Series with Human-Agent Application.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

2015
Context-Based Prediction of App Usage.
CoRR, 2015

Risk Minimization in Structured Prediction using Orbit Loss.
CoRR, 2015

Vowel duration measurement using deep neural networks.
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015

2013
Learning Efficient Random Maximum A-Posteriori Predictors with Non-Decomposable Loss Functions.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Predicting Human Strategic Decisions Using Facial Expressions.
Proceedings of the IJCAI 2013, 2013

Discriminative articulatory models for spoken term detection in low-resource conversational settings.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Discriminative spoken term detection with limited data.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Automatic Measurement of Positive and Negative Voice Onset Time.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Explicit Approximations of the Gaussian Kernel
CoRR, 2011

Generalization Bounds and Consistency for Latent Structural Probit and Ramp Loss.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

A GPU-tailored approach for training kernelized SVMs.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Direct Error Rate Minimization of Hidden Markov Models.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

PAC-Bayesian approach for minimization of phoneme error rate.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Direct Loss Minimization for Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Automatic discriminative measurement of voice onset time.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Discriminative keyword spotting.
Speech Commun., 2009

Bounded Kernel-Based Online Learning.
J. Mach. Learn. Res., 2009

Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Support Vector Machines with a Reject Option.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

The projectron: a bounded kernel-based Perceptron.
Proceedings of the Machine Learning, 2008

2007
Large margin algorithms for discriminative continuous speech recognition (זיהוי דיבור רציף באמצעות אלגוריתמי שוליים רחבים.).
PhD thesis, 2007

A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment.
IEEE Trans. Speech Audio Process., 2007

2006
Online Passive-Aggressive Algorithms.
J. Mach. Learn. Res., 2006

Discriminative kernel-based phoneme sequence recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Phoneme alignment based on discriminative learning.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
An Online Algorithm for Hierarchical Phoneme Classification.
Proceedings of the Machine Learning for Multimodal Interaction, 2004

Learning to Align Polyphonic Music.
Proceedings of the ISMIR 2004, 2004

Large margin hierarchical classification.
Proceedings of the Machine Learning, 2004

2002
Kernel Design Using Boosting.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

2001
Plosive spotting with margin classifiers.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001


  Loading...