Hongyuan Zhu

Orcid: 0000-0001-5177-8320

According to our database1, Hongyuan Zhu authored at least 95 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

Learning Student Network Under Universal Label Noise.
IEEE Trans. Image Process., 2024

Blessing few-shot segmentation via semi-supervised learning with noisy support images.
Pattern Recognit., 2024

Revisiting 3D visual grounding with Context-aware Feature Aggregation.
Neurocomputing, 2024

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image.
CoRR, 2024

Efficient Multi-modal Human-Centric Contrastive Pre-training with a Pseudo Body-Structured Prior.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Robust Variational Contrastive Learning for Partially View-unaligned Clustering.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

G-Former: A Grouping Transformer for Weakly Supervised Point Cloud Segmentation.
Proceedings of the International Joint Conference on Neural Networks, 2024

HCMA'24: The 5th International Workshop on Human-centric Multimedia Analysis Summary.
Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

Direct Distillation Between Different Domains.
Proceedings of the Computer Vision - ECCV 2024, 2024

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions.
Proceedings of the Computer Vision - ECCV 2024, 2024

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Contributing Dimension Structure of Deep Feature for Coreset Selection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

PrefAce: Face-Centric Pretraining with Self-Structure Aware Distillation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Closer Look at Video Sampling for Sequential Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Unsupervised Contrastive Cross-Modal Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

A Closer Look at Few-Shot 3D Point Cloud Classification.
Int. J. Comput. Vis., March, 2023

Dual-Stream Contrastive Learning for Channel State Information Based Human Activity Recognition.
IEEE J. Biomed. Health Informatics, 2023

LPCL: Localized prominence contrastive learning for self-supervised dense visual pre-training.
Pattern Recognit., 2023

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts.
CoRR, 2023

Exploit the antenna response consistency to define the alignment criteria for CSI data.
CoRR, 2023

Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study.
CoRR, 2023

An Overview of Challenges in Egocentric Text-Video Retrieval.
CoRR, 2023

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
CoRR, 2023

HCMA '23: 4th International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ROAD: Robust Unsupervised Domain Adaptation with Noisy Labels.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Language Models can do Zero-Shot Visual Referring Expression Comprehension.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Semi-Supervised Few-Shot Segmentation with Noisy Support Images.
Proceedings of the IEEE International Conference on Image Processing, 2023

Zero-Shot Point Cloud Segmentation by Semantic-Visual Aware Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Image Super Resolution from Long-Tailed Distribution Learning Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

End-to-End 3D Dense Captioning with Vote2Cap-DETR.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
A Survey of Embodied AI: From Simulators to Research Tasks.
IEEE Trans. Emerg. Top. Comput. Intell., 2022

Deep Semisupervised Multiview Learning With Increasing Views.
IEEE Trans. Cybern., 2022

Hierarchical Point Cloud Encoding and Decoding With Lightweight Self-Attention Based Model.
IEEE Robotics Autom. Lett., 2022

Locality-Aware Crowd Counting.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Point Cloud Instance Segmentation With Semi-Supervised Bounding-Box Mining.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

XAI Beyond Classification: Interpretable Neural Clustering.
J. Mach. Learn. Res., 2022

Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.
CoRR, 2022

RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval.
CoRR, 2022

What Makes for Effective Few-shot Point Cloud Classification?
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

HCMA'22: 3rd International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Spectral Representation Learning From Multi-View Data.
IEEE Trans. Image Process., 2021

Single-Image Dehazing via Compositional Adversarial Network.
IEEE Trans. Cybern., 2021

Joint Versus Independent Multiview Hashing for Cross-View Retrieval.
IEEE Trans. Cybern., 2021

Cross-modal discriminant adversarial network.
Pattern Recognit., 2021

A comprehensive survey of procedural video datasets.
Comput. Vis. Image Underst., 2021

MusicBERT: A Self-supervised Learning of Music Representation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Semantic Role Aware Correlation Transformer For Text To Video Retrieval.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Spcr: semi-supervised point cloud instance segmentation with perturbation consistency regularization.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

A Diagnostic Study Of Visual Question Answering With Analogical Reasoning.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Learning Cross-Modal Retrieval With Noisy Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

OPQ: Compressing Deep Neural Networks with One-shot Pruning-Quantization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Deep Clustering With Sample-Assignment Invariance Prior.
IEEE Trans. Neural Networks Learn. Syst., 2020

Pop Music Generation: From Melody to Multi-style Arrangement.
ACM Trans. Knowl. Discov. Data, 2020

Zero-Shot Image Dehazing.
IEEE Trans. Image Process., 2020

Combining Faster R-CNN and Model-Driven Clustering for Elongated Object Detection.
IEEE Trans. Image Process., 2020

Holistic Multi-Modal Memory Network for Movie Question Answering.
IEEE Trans. Image Process., 2020

Improving Night-Time Pedestrian Retrieval With Distribution Alignment and Contextual Distance.
IEEE Trans. Ind. Informatics, 2020

A novel hybrid approach for crack detection.
Pattern Recognit., 2020

Partition level multiview subspace clustering.
Neural Networks, 2020

6D Pose Estimation with Correlation Fusion.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Semi-Supervised Multi-Modal Learning with Balanced Spectral Decomposition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
AnomalyNet: An Anomaly Detection Network for Video Surveillance.
IEEE Trans. Inf. Forensics Secur., 2019

Multiple Marginal Fisher Analysis.
IEEE Trans. Ind. Electron., 2019

Clustering with similarity preserving.
Neurocomputing, 2019

6D Pose Estimation with Correlation Fusion.
CoRR, 2019

Efficient Robotic Task Generalization Using Deep Model Fusion Reinforcement Learning.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

Cross-channel Communication Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Multi-view Spectral Clustering Network.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

COMIC: Multi-view Clustering Without Parameter Selection.
Proceedings of the 36th International Conference on Machine Learning, 2019

Spatial Fusion GAN for Image Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Singe Image Rain Removal with Unpaired Information: A Differentiable Programming Perspective.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
k-meansNet: When k-means Meets Differentiable Programming.
CoRR, 2018

XiaoIce Band: A Melody and Arrangement Generation Framework for Pop Music.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

DehazeGAN: When Image Dehazing Meets Differential Programming.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
YoTube: Searching Action Proposal via Recurrent and Static Regression Networks.
CoRR, 2017

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text.
CoRR, 2017

Search video action proposal with recurrent and static YOLO.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Automatic visual impairment detection system for age-related eye diseases through gaze analysis.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

2016
Multiple Human Identification and Cosegmentation: A Human-Oriented CRF Approach With Poselets.
IEEE Trans. Multim., 2016

Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation.
J. Vis. Commun. Image Represent., 2016

Shape based co-segmentation repairing by segment evaluation and object proposals.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Semantic image segmentation and cosegmentation
PhD thesis, 2015

Diagnosing state-of-the-art object proposal methods.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Multiple foreground recognition and cosegmentation: An object-oriented CRF model with robust higher-order potentials.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Poselet-based multiple human identification and cosegmentation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Object-Level Image Segmentation Using Low Level Cues.
IEEE Trans. Image Process., 2013

Multi-class Cosegmentation with Pairwise Active Learning.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Salient object cutout using Google images.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013


  Loading...