Yanwei Fu

Xiangyang Xue

J. Comput. Sci. Technol., July, 2024

DeepSFM: Robust Deep Iterative Refinement for Structure From Motion.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Knockoffs-SPR: Clean Sample Selection in Learning With Noisy Labels.

[BibT_eX]

[DOI]

Yikai Wang

Xinwei Sun

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition.

[BibT_eX]

[DOI]

Satoshi Tsutsui

David J. Crandall

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2024

FS-OreDet: Feature enhancement and relationship exploration for boosting few-shot object detector of ore images.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2024

Robust Network Learning via Inverse Scale Variational Sparsification.

[BibT_eX]

[DOI]

CoRR, 2024

fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model.

[BibT_eX]

[DOI]

CoRR, 2024

MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing.

[BibT_eX]

[DOI]

CoRR, 2024

Polaris: Open-ended Interactive Robotic Manipulation via Syn2Real Visual Grounding and Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion.

[BibT_eX]

[DOI]

CoRR, 2024

Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image.

[BibT_eX]

[DOI]

CoRR, 2024

Unified Lexical Representation for Interpretable Visual-Language Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

TemporalStory: Enhancing Consistency in Story Visualization using Spatial-Temporal Attention.

[BibT_eX]

[DOI]

Sixiao Zheng

CoRR, 2024

EFCNet: Every Feature Counts for Small Medical Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection.

[BibT_eX]

[DOI]

CoRR, 2024

Hyper-Transformer for Amodal Completion.

[BibT_eX]

[DOI]

CoRR, 2024

3D StreetUnveiler with Semantic-Aware 2DGS.

[BibT_eX]

[DOI]

CoRR, 2024

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation.

[BibT_eX]

[DOI]

CoRR, 2024

Image-Text-Image Knowledge Transferring for Lifelong Person Re-Identification with Hybrid Clothing States.

[BibT_eX]

[DOI]

CoRR, 2024

Content and Salient Semantics Collaboration for Cloth-Changing Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Global Optimal Visual In-Context Learning Prompt Selection.

[BibT_eX]

[DOI]

CoRR, 2024

A Generalization Theory of Cross-Modality Distillation with Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2024

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation.

[BibT_eX]

[DOI]

CoRR, 2024

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT.

[BibT_eX]

[DOI]

CoRR, 2024

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability.

[BibT_eX]

[DOI]

CoRR, 2024

Repositioning the Subject within Image.

[BibT_eX]

[DOI]

CoRR, 2024

Doubly Robust Proximal Causal Learning for Continuous Treatments.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo.

[BibT_eX]

[DOI]

Xinlin Ren

Proceedings of the Twelfth International Conference on Learning Representations, 2024

T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Neural Surface Reconstruction with Feature Priors from Multi-view Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

MinD-3D: Reconstruct High-Quality 3D Objects in Human Brain.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Test-Time Linear Out-of-Distribution Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MemFlow: Optical Flow Estimation and Prediction with Memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Class-Incremental Generalized Zero-Shot Learning.

[BibT_eX]

[DOI]

Zhenfeng Sun

Rui Feng

Multim. Tools Appl., October, 2023

Faster OreFSDet: A lightweight and effective few-shot object detector for ore images.

[BibT_eX]

[DOI]

Pattern Recognit., September, 2023

Recent Few-shot Object Detection Algorithms: A Survey with Performance Comparison.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., August, 2023

PatchMix Augmentation to Identify Causal Features in Few-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Clustering by the Probability Distributions From Extreme Value Theory.

[BibT_eX]

[DOI]

IEEE Trans. Artif. Intell., April, 2023

Multi-view Shape Generation for a 3D Human-like Body.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Worst-case Feature Risk Minimization for Data-Efficient Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Specialized re-ranking: A novel retrieval-verification framework for cloth changing person re-identification.

[BibT_eX]

[DOI]

Pattern Recognit., 2023

Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Exploring Structural Sparsity of Deep Networks Via Inverse Scale Spaces.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Exploring lottery ticket hypothesis in few-shot learning.

[BibT_eX]

[DOI]

Yu Xie

Neurocomputing, 2023

Towards Stable and Faithful Inpainting.

[BibT_eX]

[DOI]

Yikai Wang

CoRR, 2023

Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation.

[BibT_eX]

[DOI]

Bo Zhao

Carl-Johann Simon-Gabriel

CoRR, 2023

fMRI-PTE: A Large-scale fMRI Pretrained Transformer Encoder for Multi-Subject Brain Activity Decoding.

[BibT_eX]

[DOI]

CoRR, 2023

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking Person Re-identification from a Projection-on-Prototypes Perspective.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2023

Pushing the Limits of 3D Shape Generation at Scale.

[BibT_eX]

[DOI]

CoRR, 2023

A Unified Prompt-Guided In-Context Inpainting Framework for Reference-based Image Manipulations.

[BibT_eX]

[DOI]

CoRR, 2023

Semantic Neural Decoding via Cross-Modal Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Versatile 3D Shape Generation with Improved AR Models.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Entity-Level Text-Guided Image Manipulation.

[BibT_eX]

[DOI]

CoRR, 2023

Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2023

ImpDet: Exploring Implicit Fields for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Language Guided Robotic Grasping with Fine-Grained Instructions.

[BibT_eX]

[DOI]

IROS, 2023

Object-Centric Multiple Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Versatile 3D Shape Generation with Improved Auto-regressive Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring.

[BibT_eX]

[DOI]

Haitao Lin

Carl-Johann Simon-Gabriel

Xiangyang Xue

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Coarse-to-Fine Amodal Segmentation with Shape Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Open-Vocabulary Object Localization in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Causally-Aware Intraoperative Imputation for Overall Survival Time Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Rethinking Optical Flow from Geometric Matching Consistent Perspective.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RankDNN: Learning to Rank for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Exploring Efficient Few-shot Adaptation for Vision Transformers.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth.

[BibT_eX]

[DOI]

Xinlin Ren

Trans. Mach. Learn. Res., 2022

Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

How to Trust Unlabeled Data? Instance Credibility Inference for Few-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

HandO: a hybrid 3D hand-object reconstruction model for unknown objects.

[BibT_eX]

[DOI]

Multim. Syst., 2022

Learning the Compositional Domains for Generalized Zero-shot Learning.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2022

RankDNN: Learning to Rank for Few-shot Learning.

[BibT_eX]

[DOI]

CoRR, 2022

MVSFormer: Multi-View Stereo with Pre-trained Vision Transformers and Temperature-based Depth.

[BibT_eX]

[DOI]

Xinlin Ren

CoRR, 2022

Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective.

[BibT_eX]

[DOI]

CoRR, 2022

A Simple Test-Time Method for Out-of-Distribution Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Wavelet Prior Attention Learning in Axial Inpainting Network.

[BibT_eX]

[DOI]

CoRR, 2022

A Framework of Meta Functional Learning for Regularising Knowledge Transfer.

[BibT_eX]

[DOI]

Pan Li

CoRR, 2022

An Empirical Study and Comparison of Recent Few-Shot Object Detection Algorithms.

[BibT_eX]

[DOI]

CoRR, 2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation.

[BibT_eX]

[DOI]

CoRR, 2022

Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Self-supervised Amodal Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Split-PU: Hardness-aware Training Strategy for Positive-Unlabeled Learning.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Local Slot Attention for Vision and Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Learning 6-DoF Object Poses to Grasp Category-Level Objects by Language Instructions.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

UAST: Uncertainty-Aware Siamese Tracking.

[BibT_eX]

[DOI]

Dawei Zhang

Zhonglong Zheng

Proceedings of the International Conference on Machine Learning, 2022

High-Fidelity Portrait Editing Via Exploring Differentiable Guided Sketches from the Latent Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

RCLane: Relay Chain Prediction for Lane Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Prior Feature and Attention Enhanced Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

ONCE-3DLanes: Building Monocular 3D Lane Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Memorize Feature Hallucination for One-Shot Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DST: Dynamic Substitute Training for Data-free Black-box Attack.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels.

[BibT_eX]

[DOI]

Yikai Wang

Xinwei Sun

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Ranking Distance Calibration for Cross-Domain Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

H4D: Human 4D Modeling by Learning Neural Compositional Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Density-preserving Deep Point Cloud Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FFD Augmentor: Towards Few-Shot Oracle Character Recognition from Scratch.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

Co-attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

2021

Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk Screening by Eye-region Manifestations.

[BibT_eX]

[DOI]

CoRR, 2021

DONet: Learning Category-Level 6D Object Pose and Size Estimation from Depth Observation.

[BibT_eX]

[DOI]

CoRR, 2021

Rapid COVID-19 Risk Screening by Eye-region Manifestations.

[BibT_eX]

[DOI]

CoRR, 2021

Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Whose hand is this? Person Identification from Egocentric Hand Gestures.

[BibT_eX]

[DOI]

Satoshi Tsutsui

David J. Crandall

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

The Image Local Autoregressive Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.

[BibT_eX]

[DOI]

Yuqian Fu

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Neural Symbolic Representation Learning for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos.

[BibT_eX]

[DOI]

Yuqian Fu

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Regularising Knowledge Transfer by Meta Functional Learning.

[BibT_eX]

[DOI]

Pan Li

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Distance Restricted Transformer Encoder for Multi-Label Classification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Depth-Guided AdaIN and Shift Attention Network for Vision-And-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Global-to-Local Dynamic Feature Aggregation for Unsupervised Person Re-Identification.

[BibT_eX]

[DOI]

Wei Li

Jiayuan Fan

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A Unified Efficient Pyramid Transformer for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Deep Hybrid Self-Prior for Full 3D Mesh Generation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple Feature Augmentation for Domain Generalization.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Delving into Data: Effectively Substitute Training for Black-box Attack.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Compositional Representation for 4D Captures With Neural ODE.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive End-to-End Budgeted Network Learning via Inverse Scale Space.

[BibT_eX]

[DOI]

Zuyuan Zhong

Chen Liu

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Learning a Few-shot Embedding Model with Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

M$^3$Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening From CT Imaging.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, 2020

Pose-Guided Person Image Synthesis in the Non-Iconic Views.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Learning Layer-Skippable Inference Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Deep Ranking for Image Zero-Shot Multi-Label Classification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Needles in a Haystack: Tracking City-Scale Moving Vehicles From Continuously Moving Satellite.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2020

Learning to Score Figure Skating Sport Videos.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Vocabulary-Informed Zero-Shot and Open-Set Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Extreme vocabulary learning.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2020

M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging.

[BibT_eX]

[DOI]

CoRR, 2020

A New Screening Method for COVID-19 based on Ocular Feature Recognition by Machine Learning Tools.

[BibT_eX]

[DOI]

CoRR, 2020

Self-supervised Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2020

When Person Re-identification Meets Changing Clothes.

[BibT_eX]

[DOI]

CoRR, 2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Main-Secondary Network for Defect Segmentation of Textured Surface Images.

[BibT_eX]

[DOI]

Yu Xie

Fangrui Zhu

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Incrementally Zero-Shot Detection by an Extreme Value Analyzer.

[BibT_eX]

[DOI]

Sixiao Zheng

Yanxi Hou

Proceedings of the 25th International Conference on Pattern Recognition, 2020

DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

DeepSFM: Structure from Motion via Deep Bundle Adjustment.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Instance Credibility Inference for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Pose Transfer by Spatially Adaptive Instance Normalization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

When Person Re-identification Meets Changing Clothes.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

An Embarrassingly Simple Baseline to One-shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Second Order Enhanced Multi-glimpse Attention in Visual Question Answering.

[BibT_eX]

[DOI]

Binghui Xie

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Long-Term Cloth-Changing Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Self-supervised Learning of Orc-Bert Augmentor for Recognizing Few-Shot Oracle Characters.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Feature Deformation Meta-Networks in Image Captioning of Novel Objects.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Multi-Level Semantic Feature Augmentation for One-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2019

A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Parsimonious Deep Learning: A Differential Inclusion Approach with Global Convergence.

[BibT_eX]

[DOI]

CoRR, 2019

S<sup>2</sup>-LBI: Stochastic Split Linearized Bregman Iterations for Parsimonious Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Question Guided Modular Routing Networks for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2019

Learning decomposed subspaces for supervised bidirectional image generation.

[BibT_eX]

[DOI]

Cogn. Comput. Syst., 2019

Meta-Reinforced Synthetic Data for One-Shot Fine-Grained Visual Recognition.

[BibT_eX]

[DOI]

Satoshi Tsutsui

David J. Crandall

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Stacked Self-Attention Networks for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Take Goods from Shelves: A Dataset for Class-Incremental Object Detection.

[BibT_eX]

[DOI]

Yu Hao

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Large-Scale Datasets for Going Deeper in Image Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Wavelet U-Net and the Chromatic Adaptation Transform for Single Image Dehazing.

[BibT_eX]

[DOI]

Hao-Hsiang Yang

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Parasitic GAN for Semi-Supervised Brain Tumor Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Large-Scale Attribute Dataset for Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Image Deformation Meta-Networks for One-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Image Block Augmentation for One-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2018

Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2018

Stacked multichannel autoencoder - an efficient way of learning from synthetic data.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2018

Learning Large Euclidean Margin for Sketch-based Image Retrieval.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective.

[BibT_eX]

[DOI]

CoRR, 2018

Progressive Deep Neural Networks Acceleration via Soft Filter Pruning.

[BibT_eX]

[DOI]

CoRR, 2018

Detecting Tiny Moving Vehicles in Satellite Videos.

[BibT_eX]

[DOI]

Wei Ao

Feng Xu

CoRR, 2018

SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners.

[BibT_eX]

[DOI]

CoRR, 2018

Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Semantic Feature Augmentation in Few-shot Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to score and summarize figure skating sport videos.

[BibT_eX]

[DOI]

CoRR, 2018

Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot Learning.

[BibT_eX]

[DOI]

Zhongfei (Mark) Zhang

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Pose-Normalized Image Generation for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Dual Skipping Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep learning for video classification and captioning.

[BibT_eX]

[DOI]

Proceedings of the Frontiers of Multimedia Research, 2018

2017

AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding.

[BibT_eX]

[DOI]

CoRR, 2017

Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization.

[BibT_eX]

[DOI]

CoRR, 2017

Recent Advances in Zero-shot Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

Semi-Latent GAN: Learning to generate and modify facial images from attributes.

[BibT_eX]

[DOI]

CoRR, 2017

A Jointly Learned Deep Architecture for Facial Attribute Analysis and Face Detection in the Wild.

[BibT_eX]

[DOI]

Keke He

Xiangyang Xue

CoRR, 2017

Learning to Generate and Edit Hairstyles.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Weighted Multi-task Deep Network for Person Attribute Classification.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Frame-Transformer Emotion Classification Network.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Multi-scale Deep Learning Architectures for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

2016

Robust Subjective Visual Property Prediction from Crowdsourced Pairwise Labels.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

Deep Learning for Video Classification and Captioning.

[BibT_eX]

[DOI]

CoRR, 2016

Video Emotion Recognition with Transferred Deep Feature Encodings.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

BigVid at MediaEval 2016: Predicting Interestingness in Images and Videos.

[BibT_eX]

[DOI]

Baohan Xu

Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Multi-view Metric Learning for Multi-view Video Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Cyberworlds, 2016

Harnessing Object and Scene Semantics for Large-Scale Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Semi-supervised Vocabulary-Informed Learning.

[BibT_eX]

[DOI]

Leonid Sigal

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learning to Generate Posters of Scientific Papers.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Transductive Multi-View Zero-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning Classifiers from Synthetic Data Using a Multichannel Autoencoder.

[BibT_eX]

[DOI]

CoRR, 2015

Transductive Multi-class and Multi-label Zero-shot Learning.

[BibT_eX]

[DOI]

Yongxin Yang

CoRR, 2015

Learning from Synthetic Data Using a Stacked Multichannel Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

2014

Learning Multimodal Latent Attributes.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Multi-view Metric Learning for Multi-view Video Summarization.

[BibT_eX]

[DOI]

CoRR, 2014

Interestingness Prediction by Robust Learning to Rank.

[BibT_eX]

[DOI]

Yuan Yao

Proceedings of the Computer Vision - ECCV 2014, 2014

Transductive Multi-view Embedding for Zero-Shot Recognition and Annotation.

[BibT_eX]

[DOI]

Zhen-Yong Fu

Proceedings of the Computer Vision - ECCV 2014, 2014

Transductive Multi-label Zero-shot Learning.

[BibT_eX]

[DOI]

Yongxin Yang

Proceedings of the British Machine Vision Conference, 2014

2012

Attribute Learning for Understanding Unstructured Social Activity.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

2011

Content-sensitive collection snapping.

[BibT_eX]

[DOI]