Frank Zalkow

Orcid: 0000-0003-1383-4541

According to our database1, Frank Zalkow authored at least 29 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Audio-visual speech synthesis using vision transformer-enhanced autoencoders with ensemble of loss functions.
Appl. Intell., March, 2024

Meta Learning Text-to-Speech Synthesis in over 7000 Languages.
CoRR, 2024

PAD-VC: A Prosody-Aware Decoder for Any-to-Few Voice Conversion.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

2023

Wagner Ring Dataset: A Complex Opera Scenario for Music Processing and Computational Musicology.
Trans. Int. Soc. Music. Inf. Retr., January, 2023

Theme Transformer: Symbolic Music Generation With Theme-Conditioned Transformer.
IEEE Trans. Multim., 2023

Evaluating Speech-Phoneme Alignment and its Impact on Neural Text-To-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

FMP Notebooks.
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023

The AudioLabs System for the Blizzard Challenge 2023.
Proceedings of the 18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023, 2023

2021

libfmp: A Python Package for Fundamentals of Music Processing.
Dataset, July, 2021

Learning Audio Representations for Cross-Version Retrieval of Western Classical Music (Lernen von Audiodarstellungen für die versionsübergreifende Suche westlicher klassischer Musik)
PhD thesis, 2021

CTC-Based Learning of Chroma Features for Score-Audio Music Retrieval.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

libfmp: A Python Package for Fundamentals of Music Processing.
J. Open Source Softw., 2021

Schubert Winterreise Dataset: A Multimodal Scenario for Music Analysis.
ACM Journal on Computing and Cultural Heritage, 2021

2020



MTD: A Multimodal Dataset of Musical Themes for MIR Research.
Trans. Int. Soc. Music. Inf. Retr., 2020

Using Weakly Aligned Score-Audio Pairs to Train Deep Chroma Models for Cross-Modal Music Retrieval.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Modeling and Estimating Local Tempo: A Case Study on Chopin's Mazurkas.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Classifying Leitmotifs in Recordings of Operas by Richard Wagner.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

2019
A Comparison of Recent Neural Vocoders for Speech Signal Reconstruction.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

FMP Notebooks: Educational Material for Teaching and Learning Fundamentals of Music Processing.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Evaluating Salience Representations for Cross-modal Retrieval of Western Classical Music Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2019

2017
A Multi-Version Approach for Transferring Measure Annotations between Music Recordings.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

Exploring Tonal-Dramatic Relationships in Richard Wagner's Ring Cycle.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Versionsübergreifende Visualisierung harmonischer Verläufe.
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017

2016
Musical Style Modification as an Optimization Problem.
Proceedings of the 2016 International Computer Music Conference, 2016


  Loading...