Florian Metze
Orcid: 0000-0002-6663-8600Affiliations:
- Carnegie Mellon University, Pittsburgh, USA
According to our database1,
Florian Metze
authored at least 256 papers
between 1996 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
Machine Listening for Heart Status Monitoring: Introducing and Benchmarking HSS - The Heart Sounds Shenzhen Corpus.
IEEE J. Biomed. Health Informatics, 2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Revisiting Factorizing Aggregated Posterior in Learning Disentangled Representations.
CoRR, 2020
CoRR, 2020
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the Imaging and Multimedia Analytics in a Web and Mobile World 2020, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Automatic word count estimation from daylong child-centered recordings in various language environments using language-independent syllabification of speech.
Speech Commun., 2019
Int. J. Multim. Inf. Retr., 2019
CoRR, 2019
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions.
CoRR, 2019
Proceedings of the 2019 Text Analysis Conference, 2019
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019
Multitask Learning For Different Subword Segmentations In Neural Machine Translation.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019
Proceedings of the 16th International Conference on Spoken Language Translation, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 12th International Conference on Natural Language Generation, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling.
Proceedings of the IEEE International Conference on Acoustics, 2019
A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
CoRR, 2018
Proceedings of the 2018 Text Analysis Conference, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018
Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Comparing the Max and Noisy-Or Pooling Functions in Multiple Instance Learning for Weakly Supervised Sequence Learning Tasks.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
CoRR, 2017
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
A first attempt at polyphonic sound event detection using connectionist temporal classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Proceedings of the 13th Web for All Conference, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Experiences with Shared Resources for Research and Education in Speech and Language Processing.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
QUESST2014: Evaluating Query-by-Example Speech Search in a zero-resource setting with real-life queries.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Comput. Speech Lang., 2014
Computer, 2014
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Multilingual deep bottle neck features: a study on language selection and training techniques.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
An in-depth comparison of keyword specific thresholding and sum-to-one score normalization.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Improving language-universal feature extraction with deep maxout and convolutional neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
Int. J. Multim. Inf. Retr., 2013
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Multi-layer mutually reinforced random walk with hidden parameters for improved multi-party meeting summarization.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Prosody-Based Unsupervised Speech Summarization with Two-Layer Mutually Reinforced Random Walk.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2013
Subspace mixture model for low-resource speech recognition in cross-lingual settings.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Subword Modeling for Automatic Speech Recognition: Past, Present, and Emerging Approaches.
IEEE Signal Process. Mag., 2012
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Integration of language identification into a recognition system for spoken conversations containing code-Switches.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Two-layer mutually reinforced random walk for improved multi-party meeting summarization.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Intra-Speaker Topic Modeling for Improved Multi-Party Meeting Summarization with Integrated Random Walk.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012
AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012
Proceedings of the International Conference on Multimedia Retrieval, 2012
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Initialization Schemes for Multilayer Perceptron Training and their Impact on ASR Performance using Multilingual Data.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Integrating Intra-Speaker Topic Modeling and Temporal-Based Inter-Speaker Topic Modeling in Random Walk for Improved Multi-Party Meeting Summarization.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the INLG 2012 - Proceedings of the Seventh International Natural Language Generation Conference, 30 May 2012, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Human-Computer Interaction. Interaction Techniques and Environments, 2011
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011
2010
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010), 2010
Multimedia content with a speech track: ACM multimedia 2010 workshop on searching spontaneous conversational speech.
Proceedings of the 18th International Conference on Multimedia 2010, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Improvements to generalized discriminative feature transformation for speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Univers. Access Inf. Soc., 2009
Proceedings of the 3rd IEEE International Conference on Semantic Computing (ICSC 2009), 2009
Usability-Evaluation multimodaler Schnittstellen: Ist das Ganze die Summe seiner Teile?
Proceedings of the Mensch & Computer 2009: Grenzenlos frei!?, 2009
Benutzerstudien zur Bewertung multimodaler, interaktiver Anzeigetafeln in unterschiedlichen Entwicklungsstufen.
Proceedings of the Workshop-Proceedings der Tagung Mensch & Computer 2009, 2009
Proceedings of the Workshop-Proceedings der Tagung Mensch & Computer 2009, 2009
Predicting the quality of multimodal systems based on judgments of single modalities.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Influence of training on direct and indirect measures for the evaluation of multimodal systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Emotion classification in children's speech using fusion of acoustic and linguistic features.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the Human-Computer Interaction. Novel Interaction Methods and Techniques, 2009
Proceedings of the Human-Computer Interaction. Novel Interaction Methods and Techniques, 2009
2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Detecting trends in social bookmarking systems using a probabilistic generative model and smoothing.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and International Conference on Intelligent Agent Technology, 2008
2007
Proceedings of the IEEE International Conference on Systems, 2007
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007
Comparison of Four Approaches to Age and Gender Recognition for Telephone Applications.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Proceedings of the Machine Learning for Multimodal Interaction, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit.
Proceedings of the Pattern Recognition, 26th DAGM Symposium, August 30, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Efficient language model lookahead through polymorphic linguistic context assignment.
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems@ACL 2002, 2002
2001
Proceedings of the First International Conference on Human Language Technology Research, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Generalized radial basis function networks for classification and novelty detection: self-organization of optimal Bayesian decision.
Neural Networks, 2000
Das View4You- System: End-to-End Evaluation.
Proceedings of the KONVENS 2000 / Sprachkommunikation, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
1996
Proceedings of the Seventh International Workshop on Database and Expert Systems Applications, 1996