Jagadeesh Balam

According to our database¹, Jagadeesh Balam authored at least 39 papers between 2006 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts.

[BibT_eX]

[DOI]

CoRR, 2024

Anticipating Future with Large Language Model for Simultaneous Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2024

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data.

[BibT_eX]

[DOI]

CoRR, 2024

EMMeTT: Efficient Multimodal Machine Translation Training.

[BibT_eX]

[DOI]

CoRR, 2024

META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR.

[BibT_eX]

[DOI]

CoRR, 2024

Chain-of-Thought Prompting for Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2024

Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations.

[BibT_eX]

[DOI]

CoRR, 2024

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data.

[BibT_eX]

[DOI]

CoRR, 2024

Instruction Data Generation and Unsupervised Adaptation for Speech Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines For Speech Recognition, Speaker Tagging, and Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training for Enhanced Speech Recognition and Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Bestow: Efficient and Streamable Speech Language Model with The Best of Two Worlds in GPT and T5.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Investigating End-to-End ASR Architectures for Long Form Audio Transcription.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System.

[BibT_eX]

[DOI]

CoRR, 2023

Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Flexible Multichannel Speech Enhancement for Noise-Robust Frontend.

[BibT_eX]

[DOI]

Ante Jukic

Jagadeesh Balam

Boris Ginsburg

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

A Compact End-to-End Model with Local and Global Context for Spoken Language Identification.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling.

[BibT_eX]

[DOI]

He Huang

Jagadeesh Balam

Boris Ginsburg

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Fast Conformer With Linearly Scalable Attention For Efficient Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

AmberNet: A Compact End-to-End Model for Spoken Language Identification.

[BibT_eX]

[DOI]

CoRR, 2022

NeMo Open Source Speaker Diarization System.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-scale Speaker Diarization with Dynamic Scale Weighting.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

CarneliNet: Neural Mixture Model for Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

SPGISpeech: 5, 000 Hours of Transcribed Financial Audio for Fully Formatted End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2008

A Transcoding-Free Multiple Description Coder for Voice over Mobile Ad-Hoc Networks.

[BibT_eX]

[DOI]

Jagadeesh Balam

Jerry D. Gibson

Proceedings of the WCNC 2008, IEEE Wireless Communications & Networking Conference, March 31 2008, 2008

2007

Multiple Descriptions and Path Diversity for Voice Communications Over Wireless Mesh Networks.

[BibT_eX]

[DOI]

Jagadeesh Balam

Jerry D. Gibson

IEEE Trans. Multim., 2007

Two-Hop Two-Path Voice Communications Over a Mobile Ad-Hoc Network.

[BibT_eX]

[DOI]

Jagadeesh Balam

Jerry D. Gibson

Proceedings of the Global Communications Conference, 2007

2006

Multiple descriptions and path diversity using the AMR-WB speech codec for voice communication over MANETs.

[BibT_eX]

[DOI]

Jagadeesh Balam

Jerry D. Gibson

Proceedings of the International Conference on Wireless Communications and Mobile Computing, 2006

Jagadeesh Balam

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...