Alexander Richard

According to our database1, Alexander Richard authored at least 42 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation.
CoRR, 2024

ScoreDec: A Phase-Preserving High-Fidelity Audio Codec with a Generalized Score-Based Diffusion Post-Filter.
Proceedings of the IEEE International Conference on Acoustics, 2024

Modeling and Driving Human Body Soundfields Through Acoustic Primitives.
Proceedings of the Computer Vision - ECCV 2024, 2024

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Spatialization Quality Metric for Binaural Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec.
Proceedings of the IEEE International Conference on Acoustics, 2023

Nord: Non-Matching Reference Based Relative Depth Estimation from Binaural Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

Novel-View Acoustic Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multiface: A Dataset for Neural Face Rendering.
CoRR, 2022

Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Binaural Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

Conditional Diffusion Probabilistic Model for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space.
Proceedings of the Computer Vision - ECCV 2022, 2022

Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Audio- and Gaze-driven Facial Animation of Codec Avatars.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Neural Synthesis of Binaural Speech From Mono Audio.
Proceedings of the 9th International Conference on Learning Representations, 2021

MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Implicit HRTF Modeling Using Temporal Convolutional Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Basic Distributed Algorithms Visual Simulations for Distributed Systems Students.
Proceedings of the IEEE Global Engineering Education Conference, 2021

2020
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

On Evaluating Weakly Supervised Action Segmentation Methods.
CoRR, 2020

2019
Temporal Segmentation of Human Actions in Videos
PhD thesis, 2019

Mining YouTube - A dataset for learning fine-grained action concepts from webly supervised video data.
CoRR, 2019

Enhancing Temporal Action Localization with Transfer Learning from Action Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018
Two Stream 3D Semantic Scene Completion.
CoRR, 2018

NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

When Will You Do What? - Anticipating Temporal Occurrences of Activities.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
A bag-of-words equivalent recurrent neural network for action recognition.
Comput. Vis. Image Underst., 2017

Weakly supervised learning of actions from transcripts.
Comput. Vis. Image Underst., 2017

Temporal Action Labeling using Action Sets.
CoRR, 2017

Recurrent Residual Learning for Action Recognition.
Proceedings of the Pattern Recognition - 39th German Conference, 2017

Weakly Supervised Action Learning with RNN Based Fine-to-Coarse Modeling.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Temporal Action Detection Using a Statistical Language Model.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
A BoW-equivalent Recurrent Neural Network for Action Recognition.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Mean-normalized stochastic gradient for large-scale deep learning.
Proceedings of the IEEE International Conference on Acoustics, 2014

RASR/NN: The RWTH neural network toolkit for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A critical evaluation of stochastic algorithms for convex optimization.
Proceedings of the IEEE International Conference on Acoustics, 2013

2011
Feature selection for log-linear acoustic models.
Proceedings of the IEEE International Conference on Acoustics, 2011


  Loading...