Peidong Wang

Orcid: 0000-0002-7042-0209

According to our database1, Peidong Wang authored at least 41 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation.
CoRR, 2024

Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation.
CoRR, 2024

Variational Optimization for Quantum Problems using Deep Generative Networks.
CoRR, 2024

Diarist: Streaming Speech Translation with Speaker Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

TIGER: A Unified Generative Model Framework for Multimodal Dialogue Response Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

STICKERCONV: Generating Multimodal Empathetic Responses from Scratch.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

DATA2VEC-SG: Improving Self-Supervised Learning Representations for Speech Generation Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Ambient Formaldehyde over the United States from Ground-Based (AQS) and Satellite (OMI) Observations.
Remote. Sens., 2022

LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers.
CoRR, 2022

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition.
CoRR, 2022

Large-Scale Streaming End-to-End Speech Translation with Neural Transducers.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Continuous Speech Separation with Recurrent Selective Attention Network.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Speaker Separation Using Speaker Inventories and Estimated Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Predicting Atlantic Multidecadal Variability.
CoRR, 2021

Multitask Training with Text Data for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
A Novel Virtual Space Vector Modulation With Reduced Common-Mode Voltage and Eliminated Neutral Point Voltage Oscillation for Neutral Point Clamped Three-Level Inverter.
IEEE Trans. Ind. Electron., 2020

Complex Spectral Mapping for Single- and Multi-Channel Speech Enhancement and Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Bridging the Gap Between Monaural Speech Enhancement and Recognition With Distortion-Independent Acoustic Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speaker Separation.
CoRR, 2020

2019
Enhanced Spectral Features for Distortion-Independent Acoustic Modeling.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Large Margin Training for Attention Based End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Token-wise Training for Attention Based End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Speech Recognition Error Prediction for Modern and Off-the-shelf Speech Recognizers.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Separation Using Speaker Inventory.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Improving Attention-Based End-to-End ASR Systems with Sequence-Based Loss Functions.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Filter-and-Convolve: A Cnn Based Multichannel Complex Concatenation Acoustic Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Utterance-Wise Recurrent Dropout and Iterative Speaker Adaptation for Robust Monaural Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Second-Order Average Consensus with Buffer Design in Multi-agent System with Time-Varying Delay.
Proceedings of the Intelligent Computing, Networked Control, and Their Engineering Applications, 2017

2013
Path Recognition for Agricultural Robot Vision Navigation under Weed Environment.
Proceedings of the Computer and Computing Technologies in Agriculture VII, 2013

2009
HIT2Lab at CLEF: an Exploration of the TEL Task.
Proceedings of the Working Notes for CLEF 2009 Workshop co-located with the 13th European Conference on Digital Libraries (ECDL 2009) , Corfù, Greece, September 30, 2009

2008
Using Google Translation in Cross-Lingual Information Retrieval.
Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2008


  Loading...