We stand with Ukraine

We stand with Ukraine

Haonan Cheng

Orcid: 0000-0003-3407-4318

According to our database¹, Haonan Cheng authored at least 30 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

2017

2018

2019

2020

2021

2022

2023

2024

2025

0

5

10

1

8

2

1

2

4

7

3

1

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Visual primitives as words: Alignment and interaction for compositional zero-shot learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Pattern Recognit., 2025

2024

DiffuseRoll: multi-track multi-attribute music generation based on diffusion model.

[BibT_eX]

[DOI]

,

,

,

Multim. Syst., February, 2024

Domain Generalization via Aggregation and Separation for Audio Deepfake Detection.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Inf. Forensics Secur., 2024

MusicECAN: An Automatic Denoising Network for Music Recordings With Efficient Channel Attention.

[BibT_eX]

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Visual-guided scene-aware audio generation method based on hierarchical feature codec and rendering decision.

[BibT_eX]

[DOI]

,

,

,

Displays, 2024

Artifact feature purification for cross-domain detection of AI-generated images.

[BibT_eX]

[DOI]

,

,

,

,

Comput. Vis. Image Underst., 2024

Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

FSD: An Initial Chinese Dataset for Fake Song Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Binauralmusic: A Diverse Dataset for Improving Cross-Modal Binaural Audio Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

DNIT: Enhancing Day-Night Image-to-Image Translation through Fine-Grained Feature Handling (Student Abstract).

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

SemDM: Task-oriented masking strategy for self-supervised visual learning.

[BibT_eX]

[DOI]

,

,

Displays, September, 2023

PQG-A2SA: Performance Quantification Guided Audio-to-Score Alignment for Orchestral Music.

[BibT_eX]

[DOI]

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Behaviourally-based Synthesis of Scene-aware Footstep Sound.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Lightweight Scene-aware Rain Sound Simulation for Interactive Virtual Environments.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2023

RD-FGFS: A Rule-Data Hybrid Framework for Fine-Grained Footstep Sound Synthesis from Visual Guidance.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning A Self-Supervised Domain-Invariant Feature Representation for Generalized Audio Deepfake Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MABC-Net: Multimodal Mixed Attentional Network with Balanced Class for Temporal Forgery Localization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Digital Multimedia Communications, 2023

CACEE: Computational Aesthetic Classification of Expressive Effects Based on Emotional Consistency.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 4th International Workshop on Human-centric Multimedia Analysis, 2023

Single Domain Generalization for Audio Deepfake Detection.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022

Towards an End-to-End Visual-to-Raw-Audio Generation With GAN.

[BibT_eX]

[DOI]

,

,

IEEE Trans. Circuits Syst. Video Technol., 2022

Emotional Acceptance Measure (EAM): An Objective Evaluation Method Towards Information Communication Effect.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

Global-Local Similarity Function for Automatic Playlist Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Persong: Multi-Modality Driven Music Recommendation System.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

2019

Physically-based statistical simulation of rain sound.

[BibT_eX]

[DOI]

,

,

ACM Trans. Graph., 2019

Liquid-solid interaction sound synthesis.

[BibT_eX]

[DOI]

,

Graph. Model., 2019

Haptic Force Guided Sound Synthesis in Multisensory Virtual Reality (VR) Simulation for Rigid-Fluid Interaction.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019

2017

Efficient sound synthesis for natural scenes.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2017 IEEE Virtual Reality, 2017

Loading...