Siddhant Arora

Orcid: 0000-0003-0375-496X

According to our database1, Siddhant Arora authored at least 50 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens.
CoRR, 2024

Task Arithmetic for Language Expansion in Speech Translation.
CoRR, 2024

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration.
CoRR, 2024

Decoder-only Architecture for Streaming End-to-end Speech Recognition.
CoRR, 2024

Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting.
CoRR, 2024

Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model.
CoRR, 2024

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

Phoneme-Aware Encoding for Prefix-Tree-Based Contextual ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

Semi-Autoregressive Streaming ASR with Label Context.
Proceedings of the IEEE International Conference on Acoustics, 2024

Creation and Analysis of an International Corpus of Privacy Laws.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023

Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech.
CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.
CoRR, 2023

CMU's IWSLT 2023 Simultaneous Speech Translation System.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Tensor decomposition for minimization of E2E SLU model toward on-device processing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

BASS: Block-wise Adaptation for Speech Summarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Joint Modelling of Spoken Language Understanding Tasks with Integrated Dialog History.
Proceedings of the IEEE International Conference on Acoustics, 2023

Teaching Old DB Neu(ral) Tricks: Learning Embeddings on Multi-tabular Databases.
Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Creation and Analysis of an International Corpus of Privacy Laws.
CoRR, 2022

A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Tale of Two Regulatory Regimes: Creation and Analysis of a Bilingual Privacy Policy Corpus.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Two-Pass Low Latency End-to-End Spoken Language Understanding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet.
Proceedings of the IEEE International Conference on Acoustics, 2022

BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
BERT Meets Relational DB: Contextual Representations of Relational Databases.
CoRR, 2021

Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
A Survey on Graph Neural Networks for Knowledge Graph Completion.
CoRR, 2020

On Embeddings in Relational Databases.
CoRR, 2020

Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

IterefinE: Iterative KG Refinement Embeddings using Symbolic Knowledge.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

2019
Understanding Community Rivalry on Social Media: A Case Study of Two Footballing Giants.
Proceedings of the Joint Proceedings of the ACM IUI 2019 Workshops co-located with the 24th ACM Conference on Intelligent User Interfaces (ACM IUI 2019), 2019

Investigating Retrieval Method Selection with Axiomatic Features.
Proceedings of the 1st Interdisciplinary Workshop on Algorithm Selection and Meta-Learning in Information Retrieval co-located with the 41st European Conference on Information Retrieval (ECIR 2019), 2019

2018
A Naive Deep Nets Based Approach for Authenticating Viral Textual Content on Social Media.
Proceedings of the Intelligent Systems and Applications, 2018


  Loading...