Herman Kamper
Orcid: 0000-0003-2980-3475
According to our database1,
Herman Kamper
authored at least 91 papers
between 2011 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Trans. Assoc. Comput. Linguistics, 2024
IEEE Signal Process. Lett., 2024
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming.
CoRR, 2024
CoRR, 2024
2023
Infant Phonetic Learning as Perceptual Space Learning: A Crosslinguistic Evaluation of Computational Models.
Cogn. Sci., July, 2023
Word Segmentation on Discovered Phone Units With Dynamic Programming and Self-Supervised Scoring.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices.
CoRR, 2023
IEEE Access, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Mitigating Catastrophic Forgetting for Few-Shot Spoken Word Classification Through Meta-Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
IEEE J. Sel. Top. Signal Process., 2022
Comput. Speech Lang., 2022
YFACC: A Yorùbá Speech-Image Dataset for Cross-Lingual Keyword Localisation Through Visual Grounding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the Artificial Intelligence Research - Third Southern African Conference, 2022
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Improved Acoustic Word Embeddings for Zero-Resource Languages Using Multilingual Transfer.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Signal Process. Image Commun., 2021
Comput. Speech Lang., 2021
Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel.
CoRR, 2021
CoRR, 2021
A Comparison of Self-Supervised Speech Representations As Input Features For Unsupervised Acoustic Word Embeddings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Acoustic Word Embeddings for Zero-Resource Languages Using Self-Supervised Contrastive Learning and Multilingual Adaptation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Analyzing Speaker Information in Self-Supervised Models to Improve Zero-Resource Speech Processing.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Towards Unsupervised Phone and Word Segmentation Using Self-Supervised Vector-Quantized Neural Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Multilingual Transfer of Acoustic Word Embeddings Improves When Training on Languages Related to the Target Zero-Resource Language.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
2020
IEEE Signal Process. Lett., 2020
On the expected behaviour of noise regularised deep neural networks as Gaussian processes.
Pattern Recognit. Lett., 2020
If dropout limits trainable depth, does critical initialisation still matter? A large-scale statistical analysis on ReLU networks.
Pattern Recognit. Lett., 2020
CoRR, 2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
CoRR, 2020
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020
Proceedings of the Artificial Intelligence Research, 2020
Unsupervised vs. Transfer Learning for Multimodal One-Shot Matching of Speech and Images.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Vector-Quantized Neural Networks for Acoustic Unit Discovery in the ZeroSpeech 2020 Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the 42th Annual Meeting of the Cognitive Science Society, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
CoRR, 2019
Pre-training on high-resource speech recognition improves low-resource speech-to-text translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Feature Exploration for Almost Zero-Resource ASR-Free Keyword Spotting Using a Multilingual Bottleneck Extractor and Correspondence Autoencoders.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Unsupervised Acoustic Unit Discovery for Speech Synthesis Using Discrete Latent-Variable Neural Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Truly Unsupervised Acoustic Word Embeddings Using Weak Top-down Constraints in Encoder-decoder Models.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Almost Zero-Resource ASR-free Keyword Spotting using Multilingual Bottleneck Features and Correspondence Autoencoders.
CoRR, 2018
ASR-Free CNN-DTW Keyword Spotting Using Multilingual Bottleneck Features for Almost Zero-Resource Languages.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
Critical initialisation for deep signal propagation in noisy rectifier neural networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Fast ASR-free and Almost Zero-resource Keyword Spotting Using DTW and CNNs for Humanitarian Monitoring.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
PhD thesis, 2017
Comput. Speech Lang., 2017
CoRR, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017
An embedded segmental K-means model for unsupervised segmentation and clustering of speech.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
A comparison of neural network methods for unsupervised representation learning on the zero resource speech challenge.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Fully unsupervised small-vocabulary speech recognition using a segmental Bayesian model.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Unsupervised neural network based feature extraction using weak top-down constraints.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system.
Comput. Speech Lang., 2014
Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
2012
Resource development and experiments in automatic south african broadcast news transcription.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
2011
Multi-Accent Speech Recognition of Afrikaans, Black and White Varieties of South African English.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011