Bin Zhao

Pengfei Han

IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Edge-Aware Network for Flow-Based Video Frame Interpolation.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., January, 2024

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Artif. Intell., January, 2024

Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance.

[BibT_eX]

[DOI]

Guanlin Li

IEEE Trans. Multim., 2024

Progressive Feature Interleaved Fusion Network for Remote-Sensing Image Salient Object Detection.

[BibT_eX]

[DOI]

Pengfei Han

IEEE Trans. Geosci. Remote. Sens., 2024

Image harmonization with Simple Hybrid CNN-Transformer Network.

[BibT_eX]

[DOI]

Guanlin Li

Neural Networks, 2024

Motion-Aware Video Frame Interpolation.

[BibT_eX]

[DOI]

Neural Networks, 2024

FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Flexible and Efficient Diffusion Low Light Enhancer.

[BibT_eX]

[DOI]

CoRR, 2024

Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding.

[BibT_eX]

[DOI]

CoRR, 2024

Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection.

[BibT_eX]

[DOI]

CoRR, 2024

KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Lensless fiber endomicroscopic phase imaging with speckle-conditioned diffusion model.

[BibT_eX]

[DOI]

CoRR, 2024

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control.

[BibT_eX]

[DOI]

CoRR, 2024

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Manipulation by Predicting Interaction.

[BibT_eX]

[DOI]

CoRR, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Optics-driven drone.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2024

TAS: Personalized Text-guided Audio Spatialization.

[BibT_eX]

[DOI]

Zhaojian Li

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

A Coarse-to-Fine Reconstruction Framework for Non-Lambertian Photometric Stereo.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Implicit Event-RGBD Neural SLAM.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cyclic Learning for Binaural Audio Generation and Localization.

[BibT_eX]

[DOI]

Zhaojian Li

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Color Event Enhanced Single-Exposure HDR Imaging.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

AudioVisual Video Summarization.

[BibT_eX]

[DOI]

Maoguo Gong

IEEE Trans. Neural Networks Learn. Syst., August, 2023

Edge-Guided Remote-Sensing Image Compression.

[BibT_eX]

[DOI]

Pengfei Han

IEEE Trans. Geosci. Remote. Sens., 2023

Calibration-free quantitative phase imaging in multi-core fiber endoscopes using end-to-end deep learning.

[BibT_eX]

[DOI]

Nektarios Koukourakis

Jürgen W. Czarske

CoRR, 2023

Implicit Event-RGBD Neural SLAM.

[BibT_eX]

[DOI]

CoRR, 2023

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Disentangled Contrastive Image Translation for Nighttime Surveillance.

[BibT_eX]

[DOI]

Guanzhou Lan

CoRR, 2023

On the Value of Myopic Behavior in Policy Reuse.

[BibT_eX]

[DOI]

CoRR, 2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance.

[BibT_eX]

[DOI]

CoRR, 2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bio-Inspired Audiovisual Multi-Representation Integration via Self-Supervised Learning.

[BibT_eX]

[DOI]

Zhaojian Li

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fully Self-Supervised Depth Estimation from Defocus Clue.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Affordance-Driven Next-Best-View Planning for Robotic Grasping.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

Video Crowd Localization With Multifocus Gaussian Neighborhood Attention and a Large-Scale Benchmark.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Semantics-Consistent Representation Learning for Remote Sensing Image-Voice Retrieval.

[BibT_eX]

[DOI]

Hailong Ning

IEEE Trans. Geosci. Remote. Sens., 2022

Low-Light Hyperspectral Image Enhancement.

[BibT_eX]

[DOI]

Guanlin Li

IEEE Trans. Geosci. Remote. Sens., 2022

Reconstructive Sequence-Graph Network for Video Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Audio-visual collaborative representation learning for Dynamic Saliency Prediction.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2022

Hierarchical multimodal transformer to summarize videos.

[BibT_eX]

[DOI]

Maoguo Gong

Neurocomputing, 2022

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

TTH-RNN: Tensor-Train Hierarchical Recurrent Neural Network for Video Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2021

Bio-Inspired Audio-Visual Cues Integration for Visual Attention Prediction.

[BibT_eX]

[DOI]

Hailong Ning

CoRR, 2021

Video Crowd Localization with Multi-focus Gaussian Neighbor Attention and a Large-Scale Benchmark.

[BibT_eX]

[DOI]

CoRR, 2021

EA-Net: Edge-Aware Network for Flow-based Video Frame Interpolation.

[BibT_eX]

[DOI]

CoRR, 2021

Weather GAN: Multi-Domain Weather Translation Using Generative Adversarial Networks.

[BibT_eX]

[DOI]

Kai Kou

CoRR, 2021

2020

Property-Constrained Dual Learning for Video Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2020

2019

CAM-RNN: Co-Attention Model Based RNN for Video Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Weather recognition via classification labels and weather-cue maps.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

C^3 Framework: An Open-source PyTorch Code for Crowd Counting.

[BibT_eX]

[DOI]

CoRR, 2019

2018

Key Frame Extraction in the Summary Space.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2018

A CNN-RNN architecture for multi-label weather recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Video Captioning with Tube Features.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

A General Framework for Edited Video and Raw Video Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Hierarchical Recurrent Neural Network for Video Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

MAM-RNN: Multi-level Attention Model Based RNN for Video Captioning.

[BibT_eX]

[DOI]