Tal Remez

Orcid: 0000-0002-9930-8624

According to our database1, Tal Remez authored at least 31 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Discrete Flow Matching.
CoRR, 2024

The Larger the Better? Improved LLM Code-Generation via Budget Reallocation.
CoRR, 2024

Masked Audio Generation using a Single Non-Autoregressive Transformer.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Code Llama: Open Foundation Models for Code.
CoRR, 2023

Textually Pretrained Speech Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Simple and Controllable Music Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement.
CoRR, 2022

Translatotron 2: High-quality direct speech-to-speech translation with voice preservation.
Proceedings of the International Conference on Machine Learning, 2022

AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation.
Proceedings of the Computer Vision - ECCV 2022, 2022

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Translatotron 2: Robust direct speech-to-speech translation.
CoRR, 2021

Improving On-Screen Sound Separation for Open Domain Videos with Audio-Visual Self-attention.
CoRR, 2021

Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds.
Proceedings of the 9th International Conference on Learning Representations, 2021

2019

2018
Class-Aware Fully Convolutional Gaussian and Poisson Denoising.
IEEE Trans. Image Process., 2018

Learning to Segment via Cut-and-Paste.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
ASIST: Automatic semantically invariant scene transformation.
Comput. Vis. Image Underst., 2017

Deep Class Aware Denoising.
CoRR, 2017

Deep Convolutional Denoising of Low-Light Images.
CoRR, 2017

White Matter Fiber Representation Using Continuous Dictionary Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2017, 2017

Deep class-aware image denoising.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Deep Functional Maps: Structured Prediction for Dense Shape Correspondence.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Efficient Deformable Shape Correspondence via Kernel Matching.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
FPGA system for real-time computational extended depth of field imaging using phase aperture coding.
CoRR, 2016

Cloud Dictionary: Sparse Coding and Modeling for Point Clouds.
CoRR, 2016

A picture is worth a billion bits: Real-time image reconstruction from dense binary threshold pixels.
Proceedings of the 2016 IEEE International Conference on Computational Photography, 2016

2015
A Picture is Worth a Billion Bits: Real-Time Image Reconstruction from Dense Binary Pixels.
CoRR, 2015

Spatially Coherent Random Forests.
CoRR, 2015

Image reconstruction from dense binary pixels.
CoRR, 2015


  Loading...