Jiahao Pan

Orcid: 0009-0007-7370-2467

According to our database1, Jiahao Pan authored at least 17 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DS-HyFA-Net: A Deeply Supervised Hybrid Feature Aggregation Network With Multiencoders for Change Detection in High-Resolution Imagery.
IEEE Trans. Geosci. Remote. Sens., 2024

Wavelet Tree Transformer: Multihead Attention With Frequency-Selective Representation and Interaction for Remote Sensing Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model.
CoRR, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions.
CoRR, 2024

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling.
CoRR, 2024

CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild.
CoRR, 2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs.
CoRR, 2024

FlashSpeech: Efficient Zero-Shot Speech Synthesis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

A Scale-Temporal Interaction Network For Remote Sensing Image Change Detection And A UAV-CD Dataset.
Proceedings of the IGARSS 2024, 2024

MSFA-Net : Multiple Spatial-Channel Feature Aggregation Network for Change Detection and a UAV-CD Dataset.
Proceedings of the IGARSS 2024, 2024

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MA-Net: Multi-Scale Adaptive Network for Oriented Object Detection in Remote Sensing Images.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Bridging Knowledge and Technology: Constructing a Knowledge-Driven Repository with 3D CNN for Interpreting Education.
Proceedings of the Emerging Technologies for Education - 8th International Symposium, 2023

LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Wavelet-Based Frequency-Dividing Interactive CNN for Image Classification.
Proceedings of the IEEE International Conference on Image Processing, 2023

NTIRE 2023 Challenge on Efficient Super-Resolution: Methods and Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2017
Deformable Patterned Fabric Defect Detection With Fisher Criterion-Based Deep Learning.
IEEE Trans Autom. Sci. Eng., 2017


  Loading...