Yandong Guo

Orcid: 0000-0002-4594-8415

According to our database1, Yandong Guo authored at least 129 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection.
IEEE Trans. Intell. Veh., January, 2024

Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation.
IEEE Trans. Multim., 2024

Training-Free Robust Interactive Video Object Segmentation.
CoRR, 2024

RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation.
CoRR, 2024

The SkatingVerse Workshop & Challenge: Methods and Results.
CoRR, 2024

Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision.
CoRR, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Tag2Text: Guiding Vision-Language Model via Image Tagging.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Recognize Anything: A Strong Image Tagging Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTO3D: Neural Target Object 3D Reconstruction with Segment Anything.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Debiased Novel Category Discovering and Localization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
AdvFAS: A robust face anti-spoofing framework against adversarial examples.
Comput. Vis. Image Underst., October, 2023

Theme-Aware Visual Attribute Reasoning for Image Aesthetics Assessment.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

SuperFast: 200× Video Frame Interpolation via Event Camera.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Anchor-based knowledge embedding for image aesthetics assessment.
Neurocomputing, June, 2023

Improving the robustness of adversarial attacks using an affine-invariant gradient estimator.
Comput. Vis. Image Underst., March, 2023

Learning Personalized Image Aesthetics From Subjective and Objective Attributes.
IEEE Trans. Multim., 2023

Knowledge-Guided Blind Image Quality Assessment With Few Training Samples.
IEEE Trans. Multim., 2023

Grouping by Center: Predicting Centripetal Offsets for the Bottom-up Human Pose Estimation.
IEEE Trans. Multim., 2023

Explainable and Generalizable Blind Image Quality Assessment via Semantic Attribute Reasoning.
IEEE Trans. Multim., 2023

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.
CoRR, 2023

Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.
CoRR, 2023

Seeing through the Mask: Multi-task Generative Mask Decoupling Face Recognition.
CoRR, 2023

NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything.
CoRR, 2023

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.
CoRR, 2023

Recognize Anything: A Strong Image Tagging Model.
CoRR, 2023

The 3rd Anti-UAV Workshop & Challenge: Methods and Results.
CoRR, 2023

SGL: Structure Guidance Learning for Camera Localization.
CoRR, 2023

Video Object Matting via Hierarchical Space-Time Semantic Guidance.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.
Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

SELVO: A Semantic-Enhanced Lidar-Visual Odometry.
IROS, 2023

Data-Driven Based Cascading Orientation and Translation Estimation for Inertial Navigation.
IROS, 2023

ContrastMotion: Self-supervised Scene Motion Learning for Large-Scale LiDAR Point Clouds.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Attribute-assisted Multimodal Network for Image Aesthetics Assessment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Mosaic Representation Learning for Self-supervised Visual Pre-training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Neural Reconstruction of Relightable Human Model from Monocular Video.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Mixed Sample Augmentation for Online Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Comprehensive Comparison of Projections in Omnidirectional Super-Resolution.
Proceedings of the IEEE International Conference on Acoustics, 2023

Ultra Real-Time Portrait Matting via Parallel Semantic Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Box-Level Active Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
BAM: a balanced attention mechanism to optimize single image super-resolution.
J. Real Time Image Process., 2022

A Survey of Face Recognition.
CoRR, 2022

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection.
CoRR, 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.
CoRR, 2022

A Real-Time Fusion Framework for Long-term Visual Localization.
CoRR, 2022

Online Distillation with Mixed Sample Augmentation.
CoRR, 2022

BANet: Motion Forecasting with Boundary Aware Network.
CoRR, 2022

SEAL: A Large-scale Video Dataset of Multi-grained Spatio-temporally Action Localization.
CoRR, 2022

Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network.
CoRR, 2022

Semantic Distillation Guided Salient Object Detection.
CoRR, 2022

SHREC'22 track: Open-Set 3D Object Retrieval.
Comput. Graph., 2022

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

FloRen: Real-time High-quality Human Performance Rendering via Appearance Flow Using Sparse RGB Cameras.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Situational Perception Guided Image Matting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Transductive Aesthetic Preference Propagation for Personalized Image Aesthetics Assessment.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


CrossHuman: Learning Cross-guidance from Multi-frame Images for Human Reconstruction.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Pose Refinement with Joint Optimization of Visual Points and Lines.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

ONavi: Data-driven based Multi-sensor Fusion Positioning System in Indoor Environments.
Proceedings of the 12th IEEE International Conference on Indoor Positioning and Indoor Navigation, 2022

Psychology Inspired Model for Hierarchical Image Aesthetic Attribute Prediction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

DARTS-PD: Differentiable Architecture Search with Path-Wise Weight Sharing Derivation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

SDETR: Attention-Guided Salient Object Detection with Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2022

Adaptive Patch Exiting for Scalable Single Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Meta-Tuning for Content-Aware Neural Video Delivery.
Proceedings of the Computer Vision - ECCV 2022, 2022

Structured Local Radiance Fields for Human Avatar Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Personalized Image Aesthetics Assessment with Rich Attributes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CRIS: CLIP-Driven Referring Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MV-TAL: Mulit-view Temporal Action Localization in Naturalistic Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards Real-world Shadow Removal with a Shadow Simulation Method and a Two-stage Framework.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Self-Distillation from the Last Mini-Batch for Consistency Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
To See in the Dark: N2DGAN for Background Modeling in Nighttime Scene.
IEEE Trans. Circuits Syst. Video Technol., 2021

Simple and Robust Loss Design for Multi-Label Learning with Missing Labels.
CoRR, 2021

Pose Refinement with Joint Optimization of Visual Points and Lines.
CoRR, 2021

Federated Self-Supervised Contrastive Learning via Ensemble Similarity Distillation.
CoRR, 2021

The 2nd Anti-UAV Workshop & Challenge: Methods and Results.
CoRR, 2021

Single-Machine Rework Rescheduling to Minimize Total Waiting Time With Fixed Sequence of Jobs and Release Times.
IEEE Access, 2021

Retrieval and Localization with Observation Constraints.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Distance Restricted Transformer Encoder for Multi-Label Classification.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

View-Guided Point Cloud Completion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study.
IEEE Trans. Image Process., 2020

Generator Pyramid for High-Resolution Image Inpainting.
CoRR, 2020

Visual Localization Using Semantic Segmentation and Depth Prediction.
CoRR, 2020

Watch to Listen Clearly: Visual Speech Enhancement Driven Multi-modality Speech Recognition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

The $2^\mathrm{nd}$ 106-Point Lightweight Facial Landmark Localization Grand Challenge.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020


Discriminative Multi-Modality Speech Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Perceptual Extreme Super Resolution Network with Receptive Field Block.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Edge Heuristic GAN for Non-Uniform Blind Deblurring.
IEEE Signal Process. Lett., 2019

Deep class-skewed learning for face recognition.
Neurocomputing, 2019

Dually Supervised Feature Pyramid for Object Detection and Segmentation.
CoRR, 2019

Generative One-Shot Face Recognition.
CoRR, 2019

Learning to Count Objects with Few Exemplar Annotations.
CoRR, 2019

Large Scale Incremental Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Revisit Multinomial Logistic Regression in Deep Learning: Data Dependent Model Initialization for Image Recognition.
CoRR, 2018

Incremental Classifier Learning with Generative Adversarial Networks.
CoRR, 2018

One-Shot Face Recognition via Generative Learning.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

2017
One-shot Face Recognition by Promoting Underrepresented Classes.
CoRR, 2017

Model-Based Iterative Restoration for Binary Document Image Compression with Dictionary Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Single-machine rework rescheduling to minimize maximum waiting-times with fixed sequence of jobs and ready times.
Comput. Ind. Eng., 2016

A Bayesian Approach to Infer Photo Aesthetic Quality Scores From Psychophysical Experiment.
Proceedings of the Imaging and Multimedia Analytics in a Web and Mobile World 2016, 2016

MS-Celeb-1M: Challenge of Recognizing One Million Celebrities in the Real World.
Proceedings of the Imaging and Multimedia Analytics in a Web and Mobile World 2016, 2016

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Rescheduling rework jobs on single-machine of original jobs with release times.
Comput. Syst. Sci. Eng., 2015

Image quality evaluation using image quality ruler and graphical model.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Online image classification under monotonic decision boundary constraint.
Proceedings of the Color Imaging XX: Displaying, 2015

Text line detection based on cost optimized local text line direction estimation.
Proceedings of the Color Imaging XX: Displaying, 2015

2013
Message passing with l1 penalized KL minimization.
Proceedings of the 30th International Conference on Machine Learning, 2013

Dynamic hierarchical dictionary design for multi-page binary document image compression.
Proceedings of the IEEE International Conference on Image Processing, 2013

Binary image compression using conditional entropy-based dictionary design and indexing.
Proceedings of the Color Imaging XVIII: Displaying, 2013

2012
Message passing with relaxed moment matching
CoRR, 2012

A multi-label incremental learning algorithm.
Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012

A Lightweight RFID Mutual Authentication Protocol with Ownership Transfer.
Proceedings of the Advances in Wireless Sensor Networks, 2012

2011
A hyper ellipsoidal incremental learning algorithm.
Proceedings of the Eighth International Conference on Fuzzy Systems and Knowledge Discovery, 2011

2010
High dimensional regression using the sparse matrix transform (SMT).
Proceedings of the IEEE International Conference on Acoustics, 2010

2008
A Comparative Review of Aspect Ratio Conversion Methods.
Proceedings of the 2008 International Conference on Multimedia and Ubiquitous Engineering (MUE 2008), 2008

2007
Adaptive Video Presentation for Small Display While Maximize Visual Information.
Proceedings of the Advances in Visual Information Systems, 9th International Conference, 2007

Denoising Saliency Map for Region of Interest Extraction.
Proceedings of the Advances in Visual Information Systems, 9th International Conference, 2007


  Loading...