Zalan Borsos

Orcid: 0000-0003-0007-829X

According to our database1, Zalan Borsos authored at least 23 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Data Summarization via Bilevel Optimization.
J. Mach. Learn. Res., 2024

MusicRL: Aligning Music Generation to Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
AudioLM: A Language Modeling Approach to Audio Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision.
Trans. Assoc. Comput. Linguistics, 2023

AudioPaLM: A Large Language Model That Can Speak and Listen.
CoRR, 2023

SoundStorm: Efficient Parallel Audio Generation.
CoRR, 2023

MusicLM: Generating Music From Text.
CoRR, 2023

TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Disentangling Speech from Surroundings with Neural Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2023

LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
AudioLM: a Language Modeling Approach to Audio Generation.
CoRR, 2022

Disentangling speech from surroundings in a neural audio codec.
CoRR, 2022

SpeechPainter: Text-conditioned Speech Inpainting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Data Summarization in Modern Machine Learning.
PhD thesis, 2021

Semi-Supervised Batch Active Learning Via Bilevel Optimization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Micaugment: One-Shot Microphone Style Transfer.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Coresets via Bilevel Optimization for Continual Learning and Streaming.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Transfer NAS: Knowledge Transfer between Search Spaces with Transformer Agents.
CoRR, 2019

Online Variance Reduction with Mixtures.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Dealing with overlap and imbalance: a new metric and approach.
Pattern Anal. Appl., 2018

Inference of the three-dimensional chromatin structure and its temporal behavior.
CoRR, 2018

Online Variance Reduction for Stochastic Optimization.
Proceedings of the Conference On Learning Theory, 2018

2013
Implementing Modular FFTs in FPGAs - A Basic Block for Lattice-Based Cryptography.
Proceedings of the 2013 Euromicro Conference on Digital System Design, 2013


  Loading...