Bin Zhao

Orcid: 0000-0002-0294-8538

Affiliations:
  • Northwestern Polytechnical University, School of Artificial Intelligence, Optics and Electronics, iOPEN, Xi'an, Shaanxi, China
  • Xidian University, Academy of Advanced Interdisciplinary Research, Xi'an, China


According to our database1, Bin Zhao authored at least 78 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Weather Translation via Weather-Cue Transferring.
IEEE Trans. Neural Networks Learn. Syst., June, 2024

Vehicle Perception From Satellite.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Edge-Aware Network for Flow-Based Video Frame Interpolation.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.
Artif. Intell., January, 2024

Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance.
IEEE Trans. Multim., 2024

Progressive Feature Interleaved Fusion Network for Remote-Sensing Image Salient Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

Image harmonization with Simple Hybrid CNN-Transformer Network.
Neural Networks, 2024

Motion-Aware Video Frame Interpolation.
Neural Networks, 2024

FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives.
CoRR, 2024

Towards Flexible and Efficient Diffusion Low Light Enhancer.
CoRR, 2024

Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning.
CoRR, 2024

COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models.
CoRR, 2024

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding.
CoRR, 2024

Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection.
CoRR, 2024

KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance.
CoRR, 2024

Lensless fiber endomicroscopic phase imaging with speckle-conditioned diffusion model.
CoRR, 2024

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control.
CoRR, 2024

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models.
CoRR, 2024

Learning Manipulation by Predicting Interaction.
CoRR, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.
CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.
CoRR, 2024

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.
CoRR, 2024

Optics-driven drone.
Sci. China Inf. Sci., 2024

TAS: Personalized Text-guided Audio Spatialization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A Coarse-to-Fine Reconstruction Framework for Non-Lambertian Photometric Stereo.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Implicit Event-RGBD Neural SLAM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cyclic Learning for Binaural Audio Generation and Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Color Event Enhanced Single-Exposure HDR Imaging.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
AudioVisual Video Summarization.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Edge-Guided Remote-Sensing Image Compression.
IEEE Trans. Geosci. Remote. Sens., 2023

Calibration-free quantitative phase imaging in multi-core fiber endoscopes using end-to-end deep learning.
CoRR, 2023

Implicit Event-RGBD Neural SLAM.
CoRR, 2023

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs.
CoRR, 2023

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.
CoRR, 2023

Disentangled Contrastive Image Translation for Nighttime Surveillance.
CoRR, 2023

On the Value of Myopic Behavior in Policy Reuse.
CoRR, 2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance.
CoRR, 2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bio-Inspired Audiovisual Multi-Representation Integration via Self-Supervised Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.
Proceedings of the International Conference on Machine Learning, 2023

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fully Self-Supervised Depth Estimation from Defocus Clue.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Affordance-Driven Next-Best-View Planning for Robotic Grasping.
Proceedings of the Conference on Robot Learning, 2023

2022
Video Crowd Localization With Multifocus Gaussian Neighborhood Attention and a Large-Scale Benchmark.
IEEE Trans. Image Process., 2022

Semantics-Consistent Representation Learning for Remote Sensing Image-Voice Retrieval.
IEEE Trans. Geosci. Remote. Sens., 2022

Low-Light Hyperspectral Image Enhancement.
IEEE Trans. Geosci. Remote. Sens., 2022

Reconstructive Sequence-Graph Network for Video Summarization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Audio-visual collaborative representation learning for Dynamic Saliency Prediction.
Knowl. Based Syst., 2022

Hierarchical multimodal transformer to summarize videos.
Neurocomputing, 2022

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
TTH-RNN: Tensor-Train Hierarchical Recurrent Neural Network for Video Summarization.
IEEE Trans. Ind. Electron., 2021

Bio-Inspired Audio-Visual Cues Integration for Visual Attention Prediction.
CoRR, 2021

Video Crowd Localization with Multi-focus Gaussian Neighbor Attention and a Large-Scale Benchmark.
CoRR, 2021

EA-Net: Edge-Aware Network for Flow-based Video Frame Interpolation.
CoRR, 2021

Weather GAN: Multi-Domain Weather Translation Using Generative Adversarial Networks.
CoRR, 2021

2020
Property-Constrained Dual Learning for Video Summarization.
IEEE Trans. Neural Networks Learn. Syst., 2020

2019
CAM-RNN: Co-Attention Model Based RNN for Video Captioning.
IEEE Trans. Image Process., 2019

Weather recognition via classification labels and weather-cue maps.
Pattern Recognit., 2019

C^3 Framework: An Open-source PyTorch Code for Crowd Counting.
CoRR, 2019

2018
Key Frame Extraction in the Summary Space.
IEEE Trans. Cybern., 2018

A CNN-RNN architecture for multi-label weather recognition.
Neurocomputing, 2018

Video Captioning with Tube Features.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
A General Framework for Edited Video and Raw Video Summarization.
IEEE Trans. Image Process., 2017

Hierarchical Recurrent Neural Network for Video Summarization.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MAM-RNN: Multi-level Attention Model Based RNN for Video Captioning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017


  Loading...