William Chen

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2024
Indoor and Outdoor 3D Scene Graph Generation Via Language-Enabled Spatial Ontologies.
IEEE Robotics Autom. Lett., June, 2024

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration.
CoRR, 2024

CMU's IWSLT 2024 Simultaneous Speech Translation System.
CoRR, 2024

Robotic Control via Embodied Chain-of-Thought Reasoning.
CoRR, 2024

Nollywood: Let's Go to the Movies!
CoRR, 2024

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models.
CoRR, 2024

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets.
CoRR, 2024

Vision-Language Models Provide Promptable Representations for Reinforcement Learning.
CoRR, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024

AugSumm: towards generalizable speech summarization using synthetic labels from large language model.
CoRR, 2024

AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Train Long and Test Long: Leveraging Full Document Contexts in Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Robust Speech Representation Learning for Thousands of Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evaluating Self-Supervised Speech Representations for Indigenous American Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond.
CoRR, 2023

EFFUSE: Efficient Self-Supervised Feature Fusion for E2E ASR in Multilingual and Low Resource Scenarios.
CoRR, 2023

LaMPP: Language Models as Probabilistic Priors for Perception and Action.
CoRR, 2023

CMU's IWSLT 2023 Simultaneous Speech Translation System.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

QUESPA Submission for the IWSLT 2023 Dialect and Low-resource Speech Translation Tasks.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving Massively Multilingual ASR with Auxiliary CTC Objectives.
Proceedings of the IEEE International Conference on Acoustics, 2023

Poster: Mujaz: A Summarization-based Approach for Normalized Vulnerability Description.
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security, 2023

Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Yodas: Youtube-Oriented Dataset for Audio and Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Summarize While Translating: Universal Model With Parallel Decoding for Summarization and Translation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Joint Prediction and Denoising for Large-Scale Multilingual Self-Supervised Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Leveraging Large Language Models for Robot 3D Scene Understanding.
CoRR, 2022

Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding.
CoRR, 2022

The Colour of Horror.
Proceedings of the European Conference on Visual Media Production, 2022

Benchmarking Azerbaijani Neural Machine Translation.
Proceedings of the ALTNLP The International Conference and workshop on Agglutanative Language Technologies as a challenge of Natural Language Processing, 2022

2021
Genetic Algorithms For Extractive Summarization.
CoRR, 2021

In silico model for miRNA-mediated regulatory network in cancer.
Briefings Bioinform., 2021

Analysis of Negative Electricity Price to Identify Demand Management Opportunity for Consumers in Renewable-rich Power Systems.
Proceedings of the 2021 IEEE PES Innovative Smart Grid Technologies, 2021

Longitudinal Data of Cancer Patients with Prior Mental Health Diagnoses Show Differences in Demographics, Emergency Visits, and Suicidality Rates.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

2020
Weakly Supervised Deep Learning for Segmentation of Remote Sensing Imagery.
Remote. Sens., 2020

Audrey: A Personalized Open-Domain Conversational Bot.
CoRR, 2020

An automatic approach to establish clinically desired final dental occlusion for one-piece maxillary orthognathic surgery.
Int. J. Comput. Assist. Radiol. Surg., 2020

2018
Some Results on Tight Stationarity, University of California, Los Angeles, USA, 2016. Supervised by Itay Neeman.
Bull. Symb. Log., 2018

2017
Synergistic drug combinations from electronic health records and gene expression.
J. Am. Medical Informatics Assoc., 2017

2015
Tight stationarity and tree-like scales.
Ann. Pure Appl. Log., 2015

Square principles with tail-end agreement.
Arch. Math. Log., 2015

2014
Osiris: accessible and reproducible phylogenetic and phylogenomic analyses within the Galaxy workflow management system.
BMC Bioinform., 2014

Analyzing Abstract Factory and Strategy Design Patterns using Design Structure Matrix: A Role-Playing Game Case.
Proceedings of the Intelligent Systems and Applications, 2014

Minimizing expected loss for risk-avoiding reinforcement learning.
Proceedings of the International Conference on Data Science and Advanced Analytics, 2014

2010
An analogue of the Gallai-Edmonds Structure Theorem for non-zero roots of the matching polynomial.
J. Comb. Theory B, 2010

2006
Visualization of Remote Hyperspectral Image Data Using Google Earth.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2006

2002
Fast and memory efficient algorithm for DCT-domain inverse motion compensation of low bitrate video.
Proceedings of the 14th International Conference on Digital Signal Processing, 2002

Design of an Auxiliary Power Distribution Network for an Electric Vehicle.
Proceedings of the 1st IEEE International Workshop on Electronic Design, 2002

1991
3-D camera calibration using vanishing point concept.
Pattern Recognit., 1991


  Loading...