Yang Zhang

Affiliations:
  • MIT-IBM Watson AI Lab, Cambridge, MA, USA
  • University of Illinois at Urbana-Champaign, Urbana, IL, USA (former)


According to our database1, Yang Zhang authored at least 79 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs.
CoRR, 2024

Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference.
CoRR, 2024

Towards Unsupervised Speech Recognition Without Pronunciation Models.
CoRR, 2024

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation.
CoRR, 2024

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing.
CoRR, 2024

Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion.
CoRR, 2024

Advancing the Robustness of Large Language Models through Self-Denoised Smoothing.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Speech Self-Supervised Learning Using Diffusion Model Synthetic Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Correcting Diffusion Generation Through Resampling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Certified Robustness for Large Language Models with Self-Denoising.
CoRR, 2023

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning.
CoRR, 2023

Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models.
Proceedings of the International Conference on Machine Learning, 2023

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning.
Proceedings of the International Conference on Machine Learning, 2023

PromptBoosting: Black-Box Text Classification with Ten Forward Passes.
Proceedings of the International Conference on Machine Learning, 2023

TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Audio-Visual Neural Syntax Acquisition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Domain Generalization for Language-Independent Automatic Speech Recognition.
Frontiers Artif. Intell., 2022

Improving Self-Supervised Speech Representations by Disentangling Speakers.
CoRR, 2022

SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks.
CoRR, 2022

Topogivity: A Machine-Learned Chemical Rule for Discovering Topological Materials.
CoRR, 2022

Fairness Reprogramming.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers.
Proceedings of the International Conference on Machine Learning, 2022

Data-Efficient Double-Win Lottery Tickets from Robust Pre-training.
Proceedings of the International Conference on Machine Learning, 2022

Linking Emergent and Natural Languages via Corpus Transfer.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Adversarial Support Alignment.
Proceedings of the Tenth International Conference on Learning Representations, 2022

On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

SpeechSplit2.0: Unsupervised Speech Disentanglement for Voice Conversion without Tuning Autoencoder Bottlenecks.
Proceedings of the IEEE International Conference on Acoustics, 2022

Knowledge Graph Guided Simultaneous Forecasting and Network Learning for Multivariate Financial Time Series.
Proceedings of the 3rd ACM International Conference on AI in Finance, 2022

An Adversarial Framework for Generating Unseen Images by Activation Maximization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Global Rhythm Style Transfer Without Text Transcriptions.
CoRR, 2021

Understanding Interlocking Dynamics of Cooperative Rationalization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Speech Denoising with Auditory Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Global Prosody Style Transfer Without Text Transcriptions.
Proceedings of the 38th International Conference on Machine Learning, 2021

Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators.
Proceedings of the 38th International Conference on Machine Learning, 2021

SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Continuous Cnn For Nonuniform Time Series.
Proceedings of the IEEE International Conference on Acoustics, 2021

Probabilistic framework for modeling event shocks to financial time series.
Proceedings of the ICAIF'21: 2nd ACM International Conference on AI in Finance, Virtual Event, November 3, 2021

The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Generating Visually Aligned Sound From Videos.
IEEE Trans. Image Process., 2020

Deep Network Perceptual Losses for Speech Denoising.
CoRR, 2020

The Lottery Ticket Hypothesis for Pre-trained BERT Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Unsupervised Speech Decomposition via Triple Information Bottleneck.
Proceedings of the 37th International Conference on Machine Learning, 2020

Invariant Rationalization.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
An Efficient and Margin-Approaching Zero-Confidence Adversarial Attack.
CoRR, 2019

Zero-Shot Voice Style Transfer with Only Autoencoder Loss.
CoRR, 2019

A Game Theoretic Approach to Class-wise Selective Rationalization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss.
Proceedings of the 36th International Conference on Machine Learning, 2019

Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Grounding Spoken Words in Unlabeled Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Deep Learning Based Speech Beamforming.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Geometry-Aware Traffic Flow Analysis by Detection and Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Application of generative models in speech processing tasks
PhD thesis, 2017

A multidisciplinary approach to designing and evaluating Electronic Medical Record portal messages that support patient self-care.
J. Biomed. Informatics, 2017

Streaming Recommender Systems.
Proceedings of the 26th International Conference on World Wide Web, 2017

Dilated Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speech Enhancement Using Bayesian Wavenet.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Fast Generation for Convolutional Autoregressive Models.
Proceedings of the 5th International Conference on Learning Representations, 2017



2016
Fast Wavenet Generation Algorithm.
CoRR, 2016

Use of particle filtering and MCMC for inference in Probabilistic Acoustic Tube model.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2016

Positive-Unlabeled Learning in Streaming Networks.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015
Incorporating AM-FM effect in voiced speech for probabilistic acoustic tube model.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Multichannel transient acoustic signal classification using task-driven dictionary with joint sparsity and beamforming.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
An iterative approach to decision tree training for context dependent speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Improvement of Probabilistic Acoustic Tube model for speech decomposition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2012
Probabilistic acoustic tube: a probabilistic generative model of speech for speech analysis/synthesis.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012


  Loading...