Yuan Gong

Jin Yu

Proceedings of the IEEE International Conference on Acoustics, 2022

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Detecting Dementia from Long Neuropsychological Interviews.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

SSAST: Self-Supervised Audio Spectrogram Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation.

[BibT_eX]

[DOI]

Yu-An Chung

IEEE ACM Trans. Audio Speech Lang. Process., 2021

PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation.

[BibT_eX]

[DOI]

Yu-An Chung

CoRR, 2021

AST: Audio Spectrogram Transformer.

[BibT_eX]

[DOI]

Yu-An Chung

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems.

[BibT_eX]

[DOI]

Dataset, June, 2020

Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method.

[BibT_eX]

[DOI]

Jian Yang

IEEE Signal Process. Lett., 2020

2019

Second-order Non-local Attention Networks for Person Re-identification.

[BibT_eX]

[DOI]

Bryan (Ning) Xia

Yizhe Zhang

CoRR, 2019

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Real-Time Adversarial Attacks.

[BibT_eX]

[DOI]

Boyang Li

Yiyu Shi

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Second-Order Non-Local Attention Networks for Person Re-Identification.

[BibT_eX]

[DOI]

Bryan Bryan

Yizhe Zhang

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Towards Learning Fine-Grained Disentangled Representations from Speech.

[BibT_eX]

[DOI]

CoRR, 2018

An Overview of Vulnerabilities of Voice Controlled Systems.

[BibT_eX]

[DOI]

CoRR, 2018

Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computer Communication and Networks, 2018

Automatic Autism Spectrum Disorder Detection Using Everyday Vocalizations Captured by Smart Devices.

[BibT_eX]

[DOI]

Hasini Yatawatte

Sandra L. Schneider

Susan Latham

Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

Improving LIWC Using Soft Word Matching.

[BibT_eX]

[DOI]

Kevin Shin

Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

2017

Crafting Adversarial Examples For Speech Paralinguistics Applications.

[BibT_eX]

[DOI]

CoRR, 2017

Topic Modeling Based Multi-modal Depression Detection.

[BibT_eX]

[DOI]

Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

Continuous Assessment of Children's Emotional States Using Acoustic Analysis.

[BibT_eX]

[DOI]