We stand with Ukraine

We stand with Ukraine

Ambuj Mehrish

Orcid: 0000-0003-4240-9915

According to our database¹, Ambuj Mehrish authored at least 26 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization.

[BibT_eX]

[DOI]

,

Navonil Majumder

,

,

,

,

Bryan Catanzaro

,

CoRR, 2024

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech.

[BibT_eX]

[DOI]

Jan Melechovský

,

,

,

Dorien Herremans

CoRR, 2024

Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Improving Text-To-Audio Models with Synthetic Captions.

[BibT_eX]

[DOI]

,

,

Deepanway Ghosal

,

Navonil Majumder

,

,

,

,

Bryan Catanzaro

CoRR, 2024

Reward Steering with Evolutionary Heuristics for Decoding-time Alignment.

[BibT_eX]

[DOI]

,

Navonil Majumder

,

,

CoRR, 2024

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training.

[BibT_eX]

[DOI]

Jan Melechovský

,

,

,

Dorien Herremans

CoRR, 2024

CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

HYPERTTS: Parameter Efficient Adaptation in Text to Speech Using Hypernetworks.

[BibT_eX]

[DOI]

,

Rishabh Bhardwaj

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

A review of deep learning techniques for speech processing.

[BibT_eX]

[DOI]

,

Navonil Majumder

,

Rishabh Bharadwaj

,

,

Inf. Fusion, November, 2023

Towards lifelong human assisted speaker diarization.

[BibT_eX]

[DOI]

,

Anthony Larcher

,

,

Sylvain Meignier

,

Yevhenii Prokopalo

,

,

,

Simon Petitrenaud

,

Olivier Galibert

,

,

,

Sébastien Marcel

,

Marta R. Costa-jussà

Comput. Speech Lang., 2023

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model.

[BibT_eX]

[DOI]

Deepanway Ghosal

,

Navonil Majumder

,

,

CoRR, 2023

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding.

[BibT_eX]

[DOI]

,

,

,

Rishabh Bhardwaj

,

,

Navonil Majumder

,

,

CoRR, 2023

Text-to-Audio Generation using Instruction Guided Latent Diffusion Model.

[BibT_eX]

[DOI]

Deepanway Ghosal

,

Navonil Majumder

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation.

[BibT_eX]

[DOI]

,

Abhinav Ramesh Kashyap

,

,

Navonil Majumder

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding.

[BibT_eX]

[DOI]

,

,

Rishabh Bhardwaj

,

Navonil Majumder

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder.

[BibT_eX]

[DOI]

Jan Melechovský

,

,

,

Dorien Herremans

CoRR, 2022

Learning Accent Representation with Multi-Level VAE Towards Controllable Speech Synthesis.

[BibT_eX]

[DOI]

Jan Melechovský

,

,

Dorien Herremans

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021

The LIUM Human Active Correction Platform for Speaker Diarization.

[BibT_eX]

[DOI]

Alexandre Flucha

,

Anthony Larcher

,

,

Sylvain Meignier

,

,

,

Yevhenii Prokopalo

,

Adrien Puertolas

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speaker Embeddings for Diarization of Broadcast Data In The Allies Challenge.

[BibT_eX]

[DOI]

Anthony Larcher

,

,

,

Sylvain Meignier

,

,

,

Olivier Galibert

,

Nicholas W. D. Evans

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Egocentric Analysis of Dash-Cam Videos for Vehicle Forensics.

[BibT_eX]

[DOI]

,

,

,

A. Venkata Subramanyam

,

Mohan S. Kankanhalli

IEEE Trans. Circuits Syst. Video Technol., 2020

2019

Joint Spatial and Discrete Cosine Transform Domain-Based Counter Forensics for Adaptive Contrast Enhancement.

[BibT_eX]

[DOI]

,

A. Venkata Subramanyam

,

IEEE Access, 2019

2018

Robust PRNU estimation from probabilistic raw measurements.

[BibT_eX]

[DOI]

,

A. Venkata Subramanyam

,

Signal Process. Image Commun., 2018

2017

Multimedia signatures for vehicle forensics.

[BibT_eX]

[DOI]

,

A. Venkata Subramanyam

,

Mohan S. Kankanhalli

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016

Sensor Pattern Noise Estimation Using Probabilistically Estimated RAW Values.

[BibT_eX]

[DOI]

,

A. Venkata Subramanyam

,

IEEE Signal Process. Lett., 2016

Anti-forensic technique for median filtering using L1-L2 TV model.

[BibT_eX]

[DOI]

,

A. Venkata Subramanyam

,

,

,

Proceedings of the IEEE International Workshop on Information Forensics and Security, 2016

Loading...