Ambuj Mehrish

Orcid: 0000-0003-4240-9915

According to our database1, Ambuj Mehrish authored at least 25 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech.
CoRR, 2024

Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation.
CoRR, 2024

Improving Text-To-Audio Models with Synthetic Captions.
CoRR, 2024

Reward Steering with Evolutionary Heuristics for Decoding-time Alignment.
CoRR, 2024

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training.
CoRR, 2024

CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models.
CoRR, 2024

CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

HYPERTTS: Parameter Efficient Adaptation in Text to Speech Using Hypernetworks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
A review of deep learning techniques for speech processing.
Inf. Fusion, November, 2023

Towards lifelong human assisted speaker diarization.
Comput. Speech Lang., 2023

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model.
CoRR, 2023

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding.
CoRR, 2023

Text-to-Audio Generation using Instruction Guided Latent Diffusion Model.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder.
CoRR, 2022

Learning Accent Representation with Multi-Level VAE Towards Controllable Speech Synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021
The LIUM Human Active Correction Platform for Speaker Diarization.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speaker Embeddings for Diarization of Broadcast Data In The Allies Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Egocentric Analysis of Dash-Cam Videos for Vehicle Forensics.
IEEE Trans. Circuits Syst. Video Technol., 2020

2019
Joint Spatial and Discrete Cosine Transform Domain-Based Counter Forensics for Adaptive Contrast Enhancement.
IEEE Access, 2019

2018
Robust PRNU estimation from probabilistic raw measurements.
Signal Process. Image Commun., 2018

2017
Multimedia signatures for vehicle forensics.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Sensor Pattern Noise Estimation Using Probabilistically Estimated RAW Values.
IEEE Signal Process. Lett., 2016

Anti-forensic technique for median filtering using L1-L2 TV model.
Proceedings of the IEEE International Workshop on Information Forensics and Security, 2016


  Loading...