Yu Qiao
Orcid: 0000-0002-1889-2567Affiliations:
- Shanghai AI Laboratory, OpenGVLab, China
- Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, China
- University of Tokyo, Graduate School of Information Science and Technology, Japan (former)
- University of Electro-Communications, Tokyo, Japan (PhD 2006)
According to our database1,
Yu Qiao
authored at least 663 papers
between 2003 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Int. J. Comput. Vis., November, 2024
J. Real Time Image Process., May, 2024
Int. J. Comput. Vis., May, 2024
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024
Temporally consistent video colorization with deep feature propagation and self-regularization learning.
Comput. Vis. Media, April, 2024
Int. J. Comput. Vis., February, 2024
CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning.
IEEE Trans. Multim., 2024
IEEE Trans. Multim., 2024
IEEE Trans. Image Process., 2024
AdaptBIR: Adaptive Blind Image Restoration with latent diffusion prior for higher fidelity.
Pattern Recognit., 2024
Int. J. Comput. Vis., 2024
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training.
CoRR, 2024
ToMiE: Towards Modular Growth in Enhanced SMPL Skeleton for 3D Human with Animatable Garments.
CoRR, 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation.
CoRR, 2024
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation.
CoRR, 2024
CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation.
CoRR, 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions.
CoRR, 2024
CoRR, 2024
CoRR, 2024
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.
CoRR, 2024
VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge.
CoRR, 2024
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models.
CoRR, 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining.
CoRR, 2024
CoRR, 2024
The Shadow of Fraud: The Emerging Danger of AI-powered Social Engineering and its Possible Cure.
CoRR, 2024
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Building Intelligence Identification System via Large Language Model Watermarking: A Survey and Beyond.
CoRR, 2024
CoRR, 2024
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.
CoRR, 2024
CoRR, 2024
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI.
CoRR, 2024
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model.
CoRR, 2024
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs.
CoRR, 2024
CoRR, 2024
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models.
CoRR, 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality.
CoRR, 2024
CoRR, 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks.
CoRR, 2024
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
CoRR, 2024
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving.
CoRR, 2024
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers.
CoRR, 2024
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites.
CoRR, 2024
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.
CoRR, 2024
CoRR, 2024
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control.
CoRR, 2024
AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition.
CoRR, 2024
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation.
CoRR, 2024
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning.
CoRR, 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models.
CoRR, 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.
CoRR, 2024
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities.
CoRR, 2024
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer.
CoRR, 2024
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning.
CoRR, 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning.
Proceedings of the Thirty-Fourth International Conference on Automated Planning and Scheduling, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
A Comparative Study of Image Restoration Networks for General Backbone Network Design.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-End Oriented Object Detection with Single Point Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
COCAS+: Large-Scale Clothes-Changing Person Re-Identification With Clothes Templates.
IEEE Trans. Circuits Syst. Video Technol., April, 2023
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023
ActFloor-GAN: Activity-Guided Adversarial Networks for Human-Centric Floorplan Design.
IEEE Trans. Vis. Comput. Graph., March, 2023
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset.
Comput. Vis. Media, February, 2023
Protein engineering via Bayesian optimization-guided evolutionary algorithm and robotic experiments.
Briefings Bioinform., January, 2023
ACM Trans. Multim. Comput. Commun. Appl., 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.
CoRR, 2023
Towards the Unification of Generative and Discriminative Visual Foundation Model: A Survey.
CoRR, 2023
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving.
CoRR, 2023
MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding.
CoRR, 2023
CoRR, 2023
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision.
CoRR, 2023
CoRR, 2023
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models.
CoRR, 2023
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving.
CoRR, 2023
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models.
CoRR, 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.
CoRR, 2023
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation.
CoRR, 2023
CoRR, 2023
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.
CoRR, 2023
MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification.
CoRR, 2023
CoRR, 2023
DiffRoom: Diffusion-based High-Quality 3D Room Reconstruction and Generation with Occupancy Prior.
CoRR, 2023
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory.
CoRR, 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
CoRR, 2023
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model.
CoRR, 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.
CoRR, 2023
Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving.
CoRR, 2023
CoRR, 2023
STU-Net: Scalable and Transferable Medical Image Segmentation Models Empowered by Large-Scale Supervised Pre-training.
CoRR, 2023
CoRR, 2023
Policy Pre-training for End-to-end Autonomous Driving via Self-supervised Geometric Modeling.
CoRR, 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023
Proceedings of the Neural Information Processing - 30th International Conference, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross- Modal Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
IEEE Trans. Image Process., 2022
IEEE Trans. Inf. Forensics Secur., 2022
IEEE Signal Process. Lett., 2022
Unsupervised person re-identification with multi-label learning guided self-paced clustering.
Pattern Recognit., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Author Correction: Development and clinical deployment of a smartphone-based visual field deep learning system for glaucoma detection.
npj Digit. Medicine, 2022
Comput. Vis. Media, 2022
ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation.
CoRR, 2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning.
CoRR, 2022
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE.
CoRR, 2022
CoRR, 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe.
CoRR, 2022
CoRR, 2022
CoRR, 2022
CoRR, 2022
CoRR, 2022
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning.
CoRR, 2022
CoRR, 2022
Asynchronous feature regularization and cross-modal distillation for OCT based glaucoma diagnosis.
Comput. Biol. Medicine, 2022
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the 26th International Conference on Pattern Recognition, 2022
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach.
Proceedings of the Conference on Robot Learning, 2022
Wider and Higher: Intensive Integration and Global Foreground Perception for Image Matting.
Proceedings of the Advances in Computer Graphics, 2022
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
You Only Need 90K Parameters to Adapt Light: a Light Weight Transformer for Image Enhancement and Exposure Correction.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE Trans. Multim., 2021
Deep Relation Transformer for Diagnosing Glaucoma With Optical Coherence Tomography and Visual Field Function.
IEEE Trans. Medical Imaging, 2021
IEEE Trans. Image Process., 2021
IEEE Trans. Circuits Syst. Video Technol., 2021
Multi-view self-supervised learning for 3D facial texture reconstruction from single image.
Image Vis. Comput., 2021
TTPP: Temporal Transformer with Progressive Prediction for efficient action anticipation.
Neurocomputing, 2021
Int. J. Autom. Comput., 2021
Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results.
CoRR, 2021
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition.
CoRR, 2021
CoRR, 2021
Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021.
CoRR, 2021
Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification.
CoRR, 2021
CoRR, 2021
Self-speculation of clinical features based on knowledge distillation for accurate ocular disease classification.
Biomed. Signal Process. Control., 2021
Multi-label ocular disease classification with a dense correlation deep neural network.
Biomed. Signal Process. Control., 2021
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021
A Novel Hybrid Convolutional Neural Network for Accurate Organ Segmentation in 3D Head and Neck CT Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021
Collaborative Multi-View Convolutions With Gating For Accurate And Fast Volumetric Medical Image Segmentation.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the International Conference on 3D Vision, 2021
2020
IEEE Trans. Parallel Distributed Syst., 2020
Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition.
IEEE Trans. Image Process., 2020
IEEE Trans. Image Process., 2020
Pattern Recognit. Lett., 2020
Development and clinical deployment of a smartphone-based visual field deep learning system for glaucoma detection.
npj Digit. Medicine, 2020
Comput. Vis. Image Underst., 2020
Comput. Vis. Image Underst., 2020
Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition.
CoRR, 2020
Exploring Multi-Scale Feature Propagation and Communication for Image Super Resolution.
CoRR, 2020
CoRR, 2020
Dense Correlation Network for Automated Multi-Label Ocular Disease Detection with Paired Color Fundus Photographs.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020
Classification of Ocular Diseases Employing Attention-Based Unilateral and Bilateral Feature Weighting and Fusion.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020
Learning Discriminative Representation For Facial Expression Recognition From Uncertainties.
Proceedings of the IEEE International Conference on Image Processing, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax.
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Interactive Multi-dimension Modulation with Dynamic Controllable Residual Learning for Image Restoration.
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Multiple Transfer Learning and Multi-label Balanced Training Strategies for Facial AU Detection In the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
FD-GAN: Generative Adversarial Networks with Fusion-Discriminator for Single Image Dehazing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
IEEE Trans. Image Process., 2019
A Literature Review: Geometric Methods and Their Applications in Human-Related Analysis.
Sensors, 2019
Pattern Recognit. Lett., 2019
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Pedestrian detection with unsupervised multispectral feature learning using deep neural networks.
Inf. Fusion, 2019
Int. J. Comput. Vis., 2019
Multi-Dimension Modulation for Image Restoration with Dynamic Controllable Residual Learning.
CoRR, 2019
Learning Category Correlations for Multi-label Image Recognition with Graph Networks.
CoRR, 2019
Correction to: Automatic differentiation of Glaucoma visual field from non-glaucoma visual field using deep convolutional neural network.
BMC Medical Imaging, 2019
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019
The Equipment Nameplate Dataset for Scene Text Detection and Recognition<sup>∗</sup>.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Intelligent Glaucoma Diagnosis Via Active Learning And Adversarial Data Augmentation.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition.
Proceedings of the International Conference on Multimodal Interaction, 2019
Proceedings of the International Conference on Multimodal Interaction, 2019
Exploring Regularizations with Face, Body and Image Cues for Group Cohesion Prediction.
Proceedings of the International Conference on Multimodal Interaction, 2019
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Modulating Image Restoration With Continual Levels via Adaptive Feature Modification Layers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
IEEE Trans. Image Process., 2018
IEEE Trans. Image Process., 2018
Deep embedding convolutional neural network for synthesizing CT image from T1-Weighted MR image.
Medical Image Anal., 2018
Transferring Deep Object and Scene Representations for Event Recognition in Still Images.
Int. J. Comput. Vis., 2018
Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering.
CoRR, 2018
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward.
CoRR, 2018
Automatic differentiation of Glaucoma visual field from non-glaucoma visual filed using deep convolutional neural network.
BMC Medical Imaging, 2018
Structured Triplet Learning with POS-Tag Guided Attention for Visual Question Answering.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Visual Field Based Automatic Diagnosis of Glaucoma Using Deep Convolutional Neural Network.
Proceedings of the Computational Pathology and Ophthalmic Medical Image Analysis, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Deep Recurrent Multi-instance Learning with Spatio-temporal Features for Engagement Intensity Prediction.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the IEEE International Conference on Cyborg and Bionic Systems, 2018
Proceedings of the British Machine Vision Conference 2018, 2018
Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition.
IEEE Trans. Image Process., 2017
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs.
IEEE Trans. Image Process., 2017
IEEE Trans. Image Process., 2017
Improving scale invariant feature transform with local color contrastive descriptor for image classification.
J. Electronic Imaging, 2017
Neurocomputing, 2017
Deep auto-context convolutional neural networks for standard-dose PET image estimation from low-dose PET/MRI.
Neurocomputing, 2017
Deep Embedding Convolutional Neural Network for Synthesizing CT Image from T1-Weighted MR Image.
CoRR, 2017
Group emotion recognition with individual facial emotion CNNs and global image based CNNs.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017
Proceedings of the IEEE International Conference on Information and Automation, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017
Proceedings of the Working Notes of CLEF 2017, 2017
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017
Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
IEEE Trans. Image Process., 2016
IEEE Signal Process. Lett., 2016
IEEE Signal Process. Lett., 2016
Int. J. Comput. Vis., 2016
Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice.
Comput. Vis. Image Underst., 2016
CoRR, 2016
Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images.
CoRR, 2016
Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network.
CoRR, 2016
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016
Deep rehabilitation gait learning for modeling knee joints of lower-limb exoskeleton.
Proceedings of the 2016 IEEE International Conference on Robotics and Biomimetics, 2016
Proceedings of the IEEE International Conference on Information and Automation, 2016
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 International Conference on Advanced Robotics and Mechatronics, 2016
Proceedings of the Computer Vision - ECCV 2016, 2016
Proceedings of the Computer Vision - ECCV 2016, 2016
Proceedings of the Computer Vision - ECCV 2016, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016
Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Local Multi-Grouped Binary Descriptor With Ring-Based Pooling Configuration and Optimization.
IEEE Trans. Image Process., 2015
On feature-specific parameter learning in conditional random field-based approach for interactive object segmentation.
J. Electronic Imaging, 2015
CoRR, 2015
Deep classification of vehicle makers and models: The effectiveness of pre-training and data enhancement.
Proceedings of the 2015 IEEE International Conference on Robotics and Biomimetics, 2015
Proceedings of the 2015 IEEE International Conference on Robotics and Biomimetics, 2015
Proceedings of the 14th IAPR International Conference on Machine Vision Applications, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
2014
IEEE Trans. Image Process., 2014
Common Feature Discriminant Analysis for Matching Infrared Face Images to Optical Face Images.
IEEE Trans. Image Process., 2014
IEEE Signal Process. Lett., 2014
Signal Process. Image Commun., 2014
IEEE Trans. Pattern Anal. Mach. Intell., 2014
Motion boundary based sampling and 3D co-occurrence descriptors for action recognition.
Image Vis. Comput., 2014
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014
A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014
2013
J. Electronic Imaging, 2013
IET Signal Process., 2013
A Study on Unsupervised Dictionary Learning and Feature Encoding for Action Classification.
CoRR, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the Neural Information Processing - 20th International Conference, 2013
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the IEEE International Conference on Information and Automation, 2013
Proceedings of the IEEE International Conference on Information and Automation, 2013
Exploring dense trajectory feature and encoding methods for human interaction recognition.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013
Proceedings of the British Machine Vision Conference, 2013
Multi-scale Joint Encoding of Local Binary Patterns for Texture and Material Classification.
Proceedings of the British Machine Vision Conference, 2013
Exploring Motion Boundary based Sampling and Spatial-Temporal Context Descriptors for Action Recognition.
Proceedings of the British Machine Vision Conference, 2013
2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Voice conversion using Bayesian mixture of Probabilistic Linear Regressions and dynamic kernel features.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Proceedings of the Sixth International Conference on Distributed Smart Cameras, 2012
Proceedings of the Sixth International Conference on Distributed Smart Cameras, 2012
A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition.
Proceedings of the Computer Vision - ACCV 2012, 2012
2011
Regularized Maximum Likelihood Linear Regression Adaptation for Computer-Assisted Language Learning Systems.
IEICE Trans. Inf. Syst., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Neural Information Processing - 18th International Conference, 2011
Proceedings of the Neural Information Processing - 18th International Conference, 2011
Proceedings of the Neural Information Processing - 18th International Conference, 2011
BioSecure Signature Evaluation Campaign (ESRA'2011): evaluating systems on quality-based categories of skilled forgeries.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011
Structure-constrained distribution matching using quadratic programming and its application to pronunciation evaluation.
Proceedings of the First Asian Conference on Pattern Recognition, 2011
2010
IEEE Trans. Signal Process., 2010
New Gener. Comput., 2010
Face recognition based on gradient gabor feature and Efficient Kernel Fisher analysis.
Neural Comput. Appl., 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Integration of multilayer regression analysis with structure-based pronunciation assessment.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
A Theory of Phase Singularities for Image Representation and its Applications to Object Tracking and Image Matching.
IEEE Trans. Image Process., 2009
Optimal event search using a structural cost function - improvement of structure to speech conversion.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
On invariant structural representation for speech recognition: theoretical validation and experimental improvement.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Analysis and utilization of MLLR speaker adaptation technique for learners' pronunciation evaluation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques.
Proceedings of the IEEE International Conference on Acoustics, 2009
Free hand sketch understanding using SVMs-chain modeling for spatial and temporal patterns.
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the International Conference on Image Processing, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
Random discriminant structure analysis for automatic recognition of connected vowels.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
A Framework Toward Restoration of Writing Order from Single-Stroked Handwriting Image.
IEEE Trans. Pattern Anal. Mach. Intell., 2006
Recover Writing Trajectory from Multiple Stroked Image Using Bidirectional Dynamic Search.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Affine Invariant Dynamic Time Warping and its Application to Online Rotated Handwriting Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Recovering Drawing Order from Offline Handwritten Image Using Direction Context and Optimal Euler Path.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
A Novel Approach to Recover Writing Order From Single Stroke Offline Handwritten Images.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005
2004
Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition, 2004
2003
Vehicle Detection on Highway Based on Direction-Fractal Dimension.
Proceedings of the Wavelet Analysis and Its Applications, 2003