Haonan Cheng

Orcid: 0000-0003-3407-4318

According to our database1, Haonan Cheng authored at least 30 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Visual primitives as words: Alignment and interaction for compositional zero-shot learning.
Pattern Recognit., 2025

2024
DiffuseRoll: multi-track multi-attribute music generation based on diffusion model.
Multim. Syst., February, 2024

Domain Generalization via Aggregation and Separation for Audio Deepfake Detection.
IEEE Trans. Inf. Forensics Secur., 2024

MusicECAN: An Automatic Denoising Network for Music Recordings With Efficient Channel Attention.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Visual-guided scene-aware audio generation method based on hierarchical feature codec and rendering decision.
Displays, 2024

Artifact feature purification for cross-domain detection of AI-generated images.
Comput. Vis. Image Underst., 2024

Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge.
CoRR, 2024

Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy.
CoRR, 2024

The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio.
CoRR, 2024

FSD: An Initial Chinese Dataset for Fake Song Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

Binauralmusic: A Diverse Dataset for Improving Cross-Modal Binaural Audio Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

DNIT: Enhancing Day-Night Image-to-Image Translation through Fine-Grained Feature Handling (Student Abstract).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SemDM: Task-oriented masking strategy for self-supervised visual learning.
Displays, September, 2023

PQG-A2SA: Performance Quantification Guided Audio-to-Score Alignment for Orchestral Music.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Behaviourally-based Synthesis of Scene-aware Footstep Sound.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Lightweight Scene-aware Rain Sound Simulation for Interactive Virtual Environments.
Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2023

RD-FGFS: A Rule-Data Hybrid Framework for Fine-Grained Footstep Sound Synthesis from Visual Guidance.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning A Self-Supervised Domain-Invariant Feature Representation for Generalized Audio Deepfake Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MABC-Net: Multimodal Mixed Attentional Network with Balanced Class for Temporal Forgery Localization.
Proceedings of the Digital Multimedia Communications, 2023

CACEE: Computational Aesthetic Classification of Expressive Effects Based on Emotional Consistency.
Proceedings of the 4th International Workshop on Human-centric Multimedia Analysis, 2023

Single Domain Generalization for Audio Deepfake Detection.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022
Towards an End-to-End Visual-to-Raw-Audio Generation With GAN.
IEEE Trans. Circuits Syst. Video Technol., 2022

Emotional Acceptance Measure (EAM): An Objective Evaluation Method Towards Information Communication Effect.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

Global-Local Similarity Function for Automatic Playlist Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Persong: Multi-Modality Driven Music Recommendation System.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022

2019
Physically-based statistical simulation of rain sound.
ACM Trans. Graph., 2019

Liquid-solid interaction sound synthesis.
Graph. Model., 2019

Haptic Force Guided Sound Synthesis in Multisensory Virtual Reality (VR) Simulation for Rigid-Fluid Interaction.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019

2017
Efficient sound synthesis for natural scenes.
Proceedings of the 2017 IEEE Virtual Reality, 2017


  Loading...