Yanwei Fu

Orcid: 0000-0002-6595-6893

Affiliations:
  • Fudan University, School of Data Science, Shanghai, China
  • Zhejiang Normal University, Fudan ISTBI-ZJNU Algorithm Centre for Brain-inspired Intelligence, Jinhua, China
  • Disney Research, Pittsburgh, PA, USA (2015 - 2016)
  • Queen Mary University of London, UK (PhD 2014)
  • Nanjing University, National Key Laboratory for Novel Software Technology, China (former)


According to our database1, Yanwei Fu authored at least 255 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Dynamic Routing and Knowledge Re-Learning for Data-Free Black-Box Attack.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

2024
Vision Transformers: From Semantic Segmentation to Dense Prediction.
Int. J. Comput. Vis., December, 2024

Learning a Mixture of Conditional Gating Blocks for Visual Question Answering.
J. Comput. Sci. Technol., July, 2024

DeepSFM: Robust Deep Iterative Refinement for Structure From Motion.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Knockoffs-SPR: Clean Sample Selection in Learning With Noisy Labels.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

FS-OreDet: Feature enhancement and relationship exploration for boosting few-shot object detector of ore images.
Eng. Appl. Artif. Intell., 2024

Robust Network Learning via Inverse Scale Variational Sparsification.
CoRR, 2024

fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction.
CoRR, 2024

SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model.
CoRR, 2024

MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing.
CoRR, 2024

Polaris: Open-ended Interactive Robotic Manipulation via Syn2Real Visual Grounding and Large Language Models.
CoRR, 2024

LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion.
CoRR, 2024

Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image.
CoRR, 2024

Unified Lexical Representation for Interpretable Visual-Language Alignment.
CoRR, 2024

TemporalStory: Enhancing Consistency in Story Visualization using Spatial-Temporal Attention.
CoRR, 2024

EFCNet: Every Feature Counts for Small Medical Object Segmentation.
CoRR, 2024

AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection.
CoRR, 2024

Hyper-Transformer for Amodal Completion.
CoRR, 2024

3D StreetUnveiler with Semantic-Aware 2DGS.
CoRR, 2024

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation.
CoRR, 2024

Image-Text-Image Knowledge Transferring for Lifelong Person Re-Identification with Hybrid Clothing States.
CoRR, 2024

Content and Salient Semantics Collaboration for Cloth-Changing Person Re-Identification.
CoRR, 2024

Towards Global Optimal Visual In-Context Learning Prompt Selection.
CoRR, 2024

A Generalization Theory of Cross-Modality Distillation with Contrastive Learning.
CoRR, 2024

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation.
CoRR, 2024

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT.
CoRR, 2024

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability.
CoRR, 2024

Repositioning the Subject within Image.
CoRR, 2024

Doubly Robust Proximal Causal Learning for Continuous Treatments.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image.
Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Neural Surface Reconstruction with Feature Priors from Multi-view Images.
Proceedings of the Computer Vision - ECCV 2024, 2024

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment.
Proceedings of the Computer Vision - ECCV 2024, 2024

NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation.
Proceedings of the Computer Vision - ECCV 2024, 2024

MinD-3D: Reconstruct High-Quality 3D Objects in Human Brain.
Proceedings of the Computer Vision - ECCV 2024, 2024

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector.
Proceedings of the Computer Vision - ECCV 2024, 2024

Test-Time Linear Out-of-Distribution Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MemFlow: Optical Flow Estimation and Prediction with Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Class-Incremental Generalized Zero-Shot Learning.
Multim. Tools Appl., October, 2023

Faster OreFSDet: A lightweight and effective few-shot object detector for ore images.
Pattern Recognit., September, 2023

Recent Few-shot Object Detection Algorithms: A Survey with Performance Comparison.
ACM Trans. Intell. Syst. Technol., August, 2023

PatchMix Augmentation to Identify Causal Features in Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Clustering by the Probability Distributions From Extreme Value Theory.
IEEE Trans. Artif. Intell., April, 2023

Multi-view Shape Generation for a 3D Human-like Body.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Worst-case Feature Risk Minimization for Data-Efficient Learning.
Trans. Mach. Learn. Res., 2023

Specialized re-ranking: A novel retrieval-verification framework for cloth changing person re-identification.
Pattern Recognit., 2023

Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Exploring Structural Sparsity of Deep Networks Via Inverse Scale Spaces.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Exploring lottery ticket hypothesis in few-shot learning.
Neurocomputing, 2023

Towards Stable and Faithful Inpainting.
CoRR, 2023

Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation.
CoRR, 2023

fMRI-PTE: A Large-scale fMRI Pretrained Transformer Encoder for Multi-Subject Brain Activity Decoding.
CoRR, 2023

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model.
CoRR, 2023

Rethinking Person Re-identification from a Projection-on-Prototypes Perspective.
CoRR, 2023

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification.
CoRR, 2023

Pushing the Limits of 3D Shape Generation at Scale.
CoRR, 2023

A Unified Prompt-Guided In-Context Inpainting Framework for Reference-based Image Manipulations.
CoRR, 2023

Semantic Neural Decoding via Cross-Modal Generation.
CoRR, 2023

Learning Versatile 3D Shape Generation with Improved AR Models.
CoRR, 2023

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation.
CoRR, 2023

Entity-Level Text-Guided Image Manipulation.
CoRR, 2023

Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.
CoRR, 2023

ImpDet: Exploring Implicit Fields for 3D Object Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Language Guided Robotic Grasping with Fine-Grained Instructions.
IROS, 2023

Object-Centric Multiple Object Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Versatile 3D Shape Generation with Improved Auto-regressive Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Coarse-to-Fine Amodal Segmentation with Shape Prior.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Open-Vocabulary Object Localization in Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Causally-Aware Intraoperative Imputation for Overall Survival Time Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Rethinking Optical Flow from Geometric Matching Consistent Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RankDNN: Learning to Rank for Few-Shot Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Exploring Efficient Few-shot Adaptation for Vision Transformers.
Trans. Mach. Learn. Res., 2022

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth.
Trans. Mach. Learn. Res., 2022

Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.
IEEE Trans. Image Process., 2022

How to Trust Unlabeled Data? Instance Credibility Inference for Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

HandO: a hybrid 3D hand-object reconstruction model for unknown objects.
Multim. Syst., 2022

Learning the Compositional Domains for Generalized Zero-shot Learning.
Comput. Vis. Image Underst., 2022

RankDNN: Learning to Rank for Few-shot Learning.
CoRR, 2022

MVSFormer: Multi-View Stereo with Pre-trained Vision Transformers and Temperature-based Depth.
CoRR, 2022

Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective.
CoRR, 2022

A Simple Test-Time Method for Out-of-Distribution Detection.
CoRR, 2022

Wavelet Prior Attention Learning in Axial Inpainting Network.
CoRR, 2022

A Framework of Meta Functional Learning for Regularising Knowledge Transfer.
CoRR, 2022

An Empirical Study and Comparison of Recent Few-Shot Object Detection Algorithms.
CoRR, 2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation.
CoRR, 2022

Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning.
CoRR, 2022

Self-supervised Amodal Video Object Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Split-PU: Hardness-aware Training Strategy for Positive-Unlabeled Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Local Slot Attention for Vision and Language Navigation.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Learning 6-DoF Object Poses to Grasp Category-Level Objects by Language Instructions.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

UAST: Uncertainty-Aware Siamese Tracking.
Proceedings of the International Conference on Machine Learning, 2022

High-Fidelity Portrait Editing Via Exploring Differentiable Guided Sketches from the Latent Space.
Proceedings of the IEEE International Conference on Acoustics, 2022

RCLane: Relay Chain Prediction for Lane Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Prior Feature and Attention Enhanced Image Inpainting.
Proceedings of the Computer Vision - ECCV 2022, 2022

ONCE-3DLanes: Building Monocular 3D Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Memorize Feature Hallucination for One-Shot Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DST: Dynamic Substitute Training for Data-free Black-box Attack.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Ranking Distance Calibration for Cross-Domain Few-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

H4D: Human 4D Modeling by Learning Neural Compositional Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Density-preserving Deep Point Cloud Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FFD Augmentor: Towards Few-Shot Oracle Character Recognition from Scratch.
Proceedings of the Computer Vision - ACCV 2022, 2022

Co-attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-identification.
Proceedings of the Computer Vision - ACCV 2022, 2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk Screening by Eye-region Manifestations.
CoRR, 2021

DONet: Learning Category-Level 6D Object Pose and Size Estimation from Depth Observation.
CoRR, 2021

Rapid COVID-19 Risk Screening by Eye-region Manifestations.
CoRR, 2021

Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Whose hand is this? Person Identification from Egocentric Hand Gestures.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

The Image Local Autoregressive Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Neural Symbolic Representation Learning for Image Captioning.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Regularising Knowledge Transfer by Meta Functional Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Distance Restricted Transformer Encoder for Multi-Label Classification.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Depth-Guided AdaIN and Shift Attention Network for Vision-And-Language Navigation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Global-to-Local Dynamic Feature Aggregation for Unsupervised Person Re-Identification.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A Unified Efficient Pyramid Transformer for Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Deep Hybrid Self-Prior for Full 3D Mesh Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple Feature Augmentation for Domain Generalization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Delving into Data: Effectively Substitute Training for Black-box Attack.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Compositional Representation for 4D Captures With Neural ODE.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive End-to-End Budgeted Network Learning via Inverse Scale Space.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Learning a Few-shot Embedding Model with Contrastive Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization.
IEEE Trans. Multim., 2020

M$^3$Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening From CT Imaging.
IEEE J. Biomed. Health Informatics, 2020

Pose-Guided Person Image Synthesis in the Non-Iconic Views.
IEEE Trans. Image Process., 2020

Learning Layer-Skippable Inference Network.
IEEE Trans. Image Process., 2020

Deep Ranking for Image Zero-Shot Multi-Label Classification.
IEEE Trans. Image Process., 2020

Needles in a Haystack: Tracking City-Scale Moving Vehicles From Continuously Moving Satellite.
IEEE Trans. Image Process., 2020

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.
IEEE Trans. Cybern., 2020

Learning to Score Figure Skating Sport Videos.
IEEE Trans. Circuits Syst. Video Technol., 2020

Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Vocabulary-Informed Zero-Shot and Open-Set Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Extreme vocabulary learning.
Frontiers Comput. Sci., 2020

M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging.
CoRR, 2020

A New Screening Method for COVID-19 based on Ocular Feature Recognition by Machine Learning Tools.
CoRR, 2020

Self-supervised Video Object Segmentation.
CoRR, 2020

When Person Re-identification Meets Changing Clothes.
CoRR, 2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.
CoRR, 2020

Main-Secondary Network for Defect Segmentation of Textured Surface Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Incrementally Zero-Shot Detection by an Extreme Value Analyzer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths.
Proceedings of the 37th International Conference on Machine Learning, 2020

DeepSFM: Structure from Motion via Deep Bundle Adjustment.
Proceedings of the Computer Vision - ECCV 2020, 2020

Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Instance Credibility Inference for Few-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Pose Transfer by Spatially Adaptive Instance Normalization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

When Person Re-identification Meets Changing Clothes.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

An Embarrassingly Simple Baseline to One-shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Second Order Enhanced Multi-glimpse Attention in Visual Question Answering.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Long-Term Cloth-Changing Person Re-identification.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Self-supervised Learning of Orc-Bert Augmentor for Recognizing Few-Shot Oracle Characters.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Feature Deformation Meta-Networks in Image Captioning of Novel Objects.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-Level Semantic Feature Augmentation for One-Shot Learning.
IEEE Trans. Image Process., 2019

Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models.
J. Comput. Sci. Technol., 2019

A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition.
CoRR, 2019

Parsimonious Deep Learning: A Differential Inclusion Approach with Global Convergence.
CoRR, 2019

S<sup>2</sup>-LBI: Stochastic Split Linearized Bregman Iterations for Parsimonious Deep Learning.
CoRR, 2019

Question Guided Modular Routing Networks for Visual Question Answering.
CoRR, 2019

Learning decomposed subspaces for supervised bidirectional image generation.
Cogn. Comput. Syst., 2019

Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Stacked Self-Attention Networks for Visual Question Answering.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Take Goods from Shelves: A Dataset for Class-Incremental Object Detection.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Large-Scale Datasets for Going Deeper in Image Understanding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Wavelet U-Net and the Chromatic Adaptation Transform for Single Image Dehazing.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Parasitic GAN for Semi-Supervised Brain Tumor Segmentation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Large-Scale Attribute Dataset for Zero-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Image Deformation Meta-Networks for One-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Image Block Augmentation for One-Shot Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization.
IEEE Trans. Affect. Comput., 2018

Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content.
IEEE Signal Process. Mag., 2018

Stacked multichannel autoencoder - an efficient way of learning from synthetic data.
Multim. Tools Appl., 2018

Learning Large Euclidean Margin for Sketch-based Image Retrieval.
CoRR, 2018

Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective.
CoRR, 2018

Progressive Deep Neural Networks Acceleration via Soft Filter Pruning.
CoRR, 2018

Detecting Tiny Moving Vehicles in Satellite Videos.
CoRR, 2018

SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners.
CoRR, 2018

Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning.
CoRR, 2018

Semantic Feature Augmentation in Few-shot Learning.
CoRR, 2018

Learning to score and summarize figure skating sport videos.
CoRR, 2018

Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

Pose-Normalized Image Generation for Person Re-identification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Dual Skipping Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep learning for video classification and captioning.
Proceedings of the Frontiers of Multimedia Research, 2018

2017
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding.
CoRR, 2017

Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization.
CoRR, 2017

Recent Advances in Zero-shot Recognition.
CoRR, 2017

Semi-Latent GAN: Learning to generate and modify facial images from attributes.
CoRR, 2017

A Jointly Learned Deep Architecture for Facial Attribute Analysis and Face Detection in the Wild.
CoRR, 2017

Learning to Generate and Edit Hairstyles.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Weighted Multi-task Deep Network for Person Attribute Classification.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Frame-Transformer Emotion Classification Network.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Multi-scale Deep Learning Architectures for Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Robust Subjective Visual Property Prediction from Crowdsourced Pairwise Labels.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Deep Learning for Video Classification and Captioning.
CoRR, 2016

Video Emotion Recognition with Transferred Deep Feature Encodings.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

BigVid at MediaEval 2016: Predicting Interestingness in Images and Videos.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Multi-view Metric Learning for Multi-view Video Summarization.
Proceedings of the 2016 International Conference on Cyberworlds, 2016

Harnessing Object and Scene Semantics for Large-Scale Video Understanding.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Semi-supervised Vocabulary-Informed Learning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learning to Generate Posters of Scientific Papers.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Transductive Multi-View Zero-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning Classifiers from Synthetic Data Using a Multichannel Autoencoder.
CoRR, 2015

Transductive Multi-class and Multi-label Zero-shot Learning.
CoRR, 2015

Learning from Synthetic Data Using a Stacked Multichannel Autoencoder.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

2014
Learning Multimodal Latent Attributes.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Multi-view Metric Learning for Multi-view Video Summarization.
CoRR, 2014

Interestingness Prediction by Robust Learning to Rank.
Proceedings of the Computer Vision - ECCV 2014, 2014

Transductive Multi-view Embedding for Zero-Shot Recognition and Annotation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Transductive Multi-label Zero-shot Learning.
Proceedings of the British Machine Vision Conference, 2014

2012
Attribute Learning for Understanding Unstructured Social Activity.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Content-sensitive collection snapping.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010
Multi-View Video Summarization.
IEEE Trans. Multim., 2010


  Loading...