Shijian Lu

Orcid: 0000-0002-6766-2506

According to our database1, Shijian Lu authored at least 311 papers between 1997 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
S2Match: Self-paced sampling for data-limited semi-supervised learning.
Pattern Recognit., 2025

2024
A Survey of Label-Efficient Deep Learning for 3D Point Clouds.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Domain Adaptive LiDAR Point Cloud Segmentation via Density-Aware Self-Training.
IEEE Trans. Intell. Transp. Syst., October, 2024

Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Exploring Prototype-Anchor Contrast for Semantic Segmentation.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Vision-Language Models for Vision Tasks: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion.
Int. J. Comput. Vis., August, 2024

Self-Supervised 3D Action Representation Learning With Skeleton Cloud Colorization.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

MorphNeRF: Text-Guided 3D-Aware Editing via Morphing Generative Neural Radiance Fields.
IEEE Trans. Multim., 2024

Domain Adaptive LiDAR Point Cloud Segmentation With 3D Spatial Consistency.
IEEE Trans. Multim., 2024

Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation.
IEEE Trans. Multim., 2024

One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

Open-vocabulary object detection via debiased curriculum self-training.
Expert Syst. Appl., 2024

Open-Vocabulary Object Detection via Language Hierarchy.
CoRR, 2024

Historical Test-time Prompt Tuning for Vision Foundation Models.
CoRR, 2024

Foundation Models for Remote Sensing and Earth Observation: A Survey.
CoRR, 2024

Mitigating Object Hallucination via Concentric Causal Attention.
CoRR, 2024

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio.
CoRR, 2024

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models.
CoRR, 2024

Segment Anything with Multiple Modalities.
CoRR, 2024

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention.
CoRR, 2024

MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era.
CoRR, 2024

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders.
CoRR, 2024

MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models.
CoRR, 2024

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting.
CoRR, 2024

DivAvatar: Diverse 3D Avatar Generation with a Single Prompt.
CoRR, 2024

Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model.
CoRR, 2024

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception.
CoRR, 2024

Domain Adaptation for Large-Vocabulary Object Detectors.
CoRR, 2024

Learning to Prompt Segment Anything Models.
CoRR, 2024

Efficient MAE towards Large-Scale Vision Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting.
Proceedings of the SIGGRAPH Asia 2024 Technical Communications, 2024

Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unlearnable Examples Detection via Iterative Filtering.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception.
Proceedings of the Computer Vision - ECCV 2024, 2024

FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Efficient Test-Time Adaptation of Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Weakly Supervised Monocular 3D Detection with a Single-View Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Masked AutoDecoder is Effective Multi-Task Vision Generalist.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Modeling Continuous Motion for 3D Point Cloud Object Tracking.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Audio-driven talking face generation with diverse yet realistic facial animations.
Pattern Recognit., December, 2023

Multimodal Image Synthesis and Editing: The Generative AI Era.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Meta-DETR: Image-Level Few-Shot Detection With Inter-Class Correlation Exploitation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Unsupervised Point Cloud Representation Learning With Deep Neural Networks: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification.
IEEE Trans. Multim., 2023

POCE: Pose-Controllable Expression Editing.
IEEE Trans. Image Process., 2023

Cross-Domain Facial Expression Recognition via Contrastive Warm up and Complexity-Aware Self-Training.
IEEE Trans. Image Process., 2023

Investigating Pose Representations and Motion Contexts Modeling for 3D Motion Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey.
CoRR, 2023

AI-Generated Images as Data Source: The Dawn of Synthetic Era.
CoRR, 2023

Noise-Tolerant Unsupervised Adapter for Vision-Language Models.
CoRR, 2023

Bridging Semantic Gaps for Language-Supervised Semantic Segmentation.
CoRR, 2023

Prompt Ensemble Self-training for Open-Vocabulary Domain Adaptation.
CoRR, 2023

3D Open-vocabulary Segmentation with Foundation Models.
CoRR, 2023

TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Online Map Vectorization for Autonomous Driving: A Rasterization Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Weakly Supervised 3D Open-vocabulary Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pose-Free Neural Radiance Fields via Implicit Pose Regularization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Domain Generalization via Balancing Training Difficulty and Model Capability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Regularized Vector Quantization for Tokenized Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DA-DETR: Domain Adaptive Detection Transformer with Information Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FAC: 3D Representation Learning via Foreground Aware Feature Contrast.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

KD-DLGAN: Data Limited Image Generation via Knowledge Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Face Transformer: Towards High Fidelity and Accurate Face Swapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Class-Independent Regularization for Learning with Noisy Labels.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection.
IEEE Trans. Multim., 2022

Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection.
IEEE Trans. Multim., 2022

GMLight: Lighting Estimation via Geometric Distribution Approximation.
IEEE Trans. Image Process., 2022

AppFuse: An Appearance Fusion Framework for Saliency Cues.
IEEE Trans. Circuits Syst. Video Technol., 2022

Detection and rectification of arbitrary shaped scene texts by using text keypoints and links.
Pattern Recognit., 2022

Domain consistency regularization for unsupervised multi-source domain adaptive classification.
Pattern Recognit., 2022

Multi-level adversarial network for domain adaptive semantic segmentation.
Pattern Recognit., 2022

GCDB-UNet: A novel robust cloud detection approach for remote sensing images.
Knowl. Based Syst., 2022

DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention.
CoRR, 2022

Domain Adaptive Scene Text Detection via Subcategorization.
CoRR, 2022

Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors.
CoRR, 2022

Latent Multi-Relation Reasoning for GAN-Prior based Image Super-Resolution.
CoRR, 2022

Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion.
CoRR, 2022

Contextual Text Block Detection towards Scene Text Understanding.
CoRR, 2022

VMRF: View Matching Neural Radiance Fields.
CoRR, 2022

Hierarchical Mask Calibration for Unified Domain Adaptive Panoptic Segmentation.
CoRR, 2022

Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting.
CoRR, 2022

Unsupervised Representation Learning for Point Clouds: A Survey.
CoRR, 2022

PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Masked Generative Adversarial Networks are Data-Efficient Generation Learners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

VMRF: View Matching Neural Radiance Fields.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Counterfactual Image Manipulation via CLIP.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

D-LC-Nets: Robust Denoising and Loop Closing Networks for LiDAR SLAM in Complicated Circumstances with Noisy Point Clouds.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Music-to-Dance Generation with Optimal Transport.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Auto-regressive Image Synthesis with Integrated Quantization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Bi-level Feature Alignment for Versatile Image Translation and Manipulation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting.
Proceedings of the Computer Vision - ECCV 2022, 2022

Contextual Text Block Detection Towards Scene Text Understanding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision.
Proceedings of the Computer Vision - ECCV 2022, 2022

PTTR: Relational 3D Point Cloud Object Tracking with Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Accelerating DETR Convergence via Semantic-Aligned Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Spectral Unsupervised Domain Adaptation for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Modulated Contrast for Versatile Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Marginal Contrastive Correspondence for Guided Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Fourier Document Restoration for Robust Document Dewarping and Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Category Contrast for Unsupervised Domain Adaptation in Visual Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

GenCo: Generative Co-training for Generative Adversarial Networks with Limited Data.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Visual Navigation With Multiple Goals Based on Deep Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., 2021

Part-aware Progressive Unsupervised Domain Adaptation for Person Re-Identification.
IEEE Trans. Multim., 2021

Salient Object Detection by Fusing Local and Global Contexts.
IEEE Trans. Multim., 2021

PoT-GAN: Pose Transform GAN for Person Image Synthesis.
IEEE Trans. Image Process., 2021

Single-Image Dehazing via Compositional Adversarial Network.
IEEE Trans. Cybern., 2021

Scale variance minimization for unsupervised domain adaptation in image segmentation.
Pattern Recognit., 2021

Brain MRI super-resolution using coupled-projection residual network.
Neurocomputing, 2021

Multimodal Image Synthesis and Editing: A Survey.
CoRR, 2021

GenCo: Generative Co-training on Data-Limited Image Generation.
CoRR, 2021

SynLiDAR: Learning From Synthetic LiDAR Sequential Point Cloud for Semantic Segmentation.
CoRR, 2021

FBC-GAN: Diverse and Flexible Image Synthesis via Foreground-Background Composition.
CoRR, 2021

Bi-level Feature Alignment for Versatile Image Translation and Manipulation.
CoRR, 2021

Blind Image Super-Resolution via Contrastive Representation Learning.
CoRR, 2021

Spectral Unsupervised Domain Adaptation for Visual Recognition.
CoRR, 2021

Semi-Supervised Domain Adaptation via Adaptive and Progressive Feature Alignment.
CoRR, 2021

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition.
CoRR, 2021

DA-DETR: Domain Adaptive Detection Transformer by Hybrid Attention.
CoRR, 2021

MLAN: Multi-Level Adversarial Network for Domain Adaptive Semantic Segmentation.
CoRR, 2021

Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class Correlation Exploitation.
CoRR, 2021

FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation.
CoRR, 2021

GMLight: Lighting Estimation via Geometric Distribution Approximation.
CoRR, 2021

The Evolution and Determinants of Interorganizational Coinvention Networks in New Energy Vehicles: Evidence from Shenzhen, China.
Complex., 2021

PNPDet: Efficient Few-shot Detection without Forgetting via Plug-and-Play Sub-networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Defect-GAN: High-Fidelity Defect Synthesis for Automated Defect Inspection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Diverse Image Inpainting with Bidirectional and Autoregressive Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Dual Learning Music Composition and Dance Choreography.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Sparse Needlets for Lighting Estimation with Spherical Transport Loss.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

WaveFill: A Wavelet-based Generation Network for Image Inpainting.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Domain Adaptive Video Segmentation via Temporal Consistency Regularization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RDA: Robust Domain Adaptation via Fourier Adversarial Attacking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unbalanced Feature Transport for Exemplar-Based Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cross-View Regularization for Domain Adaptive Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FSDR: Frequency Space Domain Randomization for Domain Generalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

EMLight: Lighting Estimation via Spherical Distribution Approximation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Matching on Sets: Conquer Occluded Person Re-identification Without Alignment.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Synergistic 2D/3D Convolutional Neural Network for Hyperspectral Image Classification.
Remote. Sens., 2020

Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection.
CoRR, 2020

A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Training Lightweight yet Competent Network via Transferring Complementary Features.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

AMLN: Adversarial-Based Mutual Learning Network for Online Knowledge Distillation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Multiple Expert Brainstorming for Domain Adaptive Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-order Feature Analysis.
Proceedings of the Computer Vision - ECCV 2020, 2020

LEED: Label-Free Expression Editing via Disentanglement.
Proceedings of the Computer Vision - ECCV 2020, 2020

Contextual-Relation Consistent Domain Adaptation for Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Suppressing Uncertainties for Large-Scale Facial Expression Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Adversarial Image Composition with Auxiliary Illumination.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

RF-GAN: A Light and Reconfigurable Network for Unpaired Image-to-Image Translation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
SS-HCNN: Semi-Supervised Hierarchical Convolutional Neural Network for Image Classification.
IEEE Trans. Image Process., 2019

CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery.
IEEE Trans. Geosci. Remote. Sens., 2019

Attention driven person re-identification.
Pattern Recognit., 2019

A pooling based scene text proposal technique for scene text reading in the wild.
Pattern Recognit., 2019

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard.
CoRR, 2019

Spatial-Aware GAN for Unsupervised Person Re-identification.
CoRR, 2019

Coupled-Projection Residual Network for MRI Super-Resolution.
CoRR, 2019

MSR: Multi-Scale Shape Regression for Scene Text Detection.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Exploring the Task Cooperation in Multi-goal Visual Navigation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Spatial Fusion GAN for Image Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Natural and Accurate Future Motion Prediction of Humans and Animals.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Superpixel Guided Deep-Sparse-Representation Learning for Hyperspectral Image Classification.
IEEE Trans. Circuits Syst. Video Technol., 2018

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

CCUS evaluation and simulation in a Chinese oil field.
Int. J. Simul. Process. Model., 2018

Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes.
Proceedings of the Computer Vision - ECCV 2018, 2018

Accurate Scene Text Detection Through Border Semantics Awareness and Bootstrapping.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Unsupervised Feature Learning for Land-Use Scene Recognition.
IEEE Trans. Geosci. Remote. Sens., 2017

Subcategory-Aware Feature Selection and SVM Optimization for Automatic Aerial Image-Based Oil Spill Inspection.
IEEE Trans. Geosci. Remote. Sens., 2017

Object-Level Motion Detection From Moving Cameras.
IEEE Trans. Circuits Syst. Video Technol., 2017

Robust Vehicle Detection and Viewpoint Estimation With Soft Discriminative Mixture Model.
IEEE Trans. Circuits Syst. Video Technol., 2017

Accurate recognition of words in scenes without character segmentation using recurrent neural network.
Pattern Recognit., 2017

The research of net carbon reduction model for CCS-EOR projects and cases study.
Int. J. Simul. Process. Model., 2017

YoTube: Searching Action Proposal via Recurrent and Static Regression Networks.
CoRR, 2017

Accurate HEp-2 cell classification based on sparse bag of words coding.
Comput. Medical Imaging Graph., 2017

Text-Edge-Box: An Object Proposal Approach for Scene Texts Localization.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Enriched Deep Recurrent Visual Attention Model for Multiple Object Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

An integrated approach to visual attention modelling using spatial-temporal saliency and objectness.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Search video action proposal with recurrent and static YOLO.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Saliency-based change detection for aerial and remote sensing imageries.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Wordfence: Text detection in natural images with border awareness.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Max-Pooling Based Scene Text Proposal for Scene Text Detection.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17).
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal.
Proceedings of the IEEE International Conference on Computer Vision, 2017

WeText: Scene Text Detection under Weak Supervision.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Accurate and Efficient Traffic Sign Detection Using Discriminative AdaBoost and Support Vector Regression.
IEEE Trans. Veh. Technol., 2016

Multiple Human Identification and Cosegmentation: A Human-Oriented CRF Approach With Poselets.
IEEE Trans. Multim., 2016

Accurate HEp-2 cell classification based on Sparse Coding of Superpixels.
Pattern Recognit. Lett., 2016

Multilingual scene character recognition with co-occurrence of histogram of oriented gradients.
Pattern Recognit., 2016

Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation.
J. Vis. Commun. Image Represent., 2016

Modelling and evaluating CCUS: a survey.
Int. J. Comput. Appl. Technol., 2016

Classification of HEp-2 cells using distributed dictionary learning.
Proceedings of the 24th European Signal Processing Conference, 2016

Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Context-aware vocabulary tree for mobile landmark recognition.
J. Vis. Commun. Image Represent., 2015

Scene text extraction based on edges and support vector regression.
Int. J. Document Anal. Recognit., 2015

Vegetation coverage detection from very high resolution satellite imagery.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Detection of high-grade atypia nuclei in breast cancer imaging.
Proceedings of the Medical Imaging 2015: Digital Pathology, 2015

Multimodal Dictionary Learning and Joint Sparse Representation for HEp-2 Cell Classification.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 - 18th International Conference Munich, Germany, October 5, 2015

Sparse non-parametric Bayesian model for HEP-2 cell image classification.
Proceedings of the 12th IEEE International Symposium on Biomedical Imaging, 2015

Context-aware lane marking detection on urban roads.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

DPM revisited: Utilizing root-part spatial distribution for vehicle viewpoint estimation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Robust text segmentation using graph cut.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Segmented handwritten text recognition with recurrent neural network classifiers.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

ICDAR 2015 competition on Robust Reading.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Text Flow: A Unified Text Detection System in Natural Scene Images.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Diagnosing state-of-the-art object proposal methods.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Robust and Efficient Saliency Modeling from Image Co-Occurrence Histograms.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

An engineering-economic model for CO<sub>2</sub> pipeline transportation in China.
Int. J. Comput. Appl. Technol., 2014

A System for Parking Lot Marking Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Scene Text Segmentation with Multi-level Maximally Stable Extremal Regions.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Character Recognition in Natural Scenes Using Convolutional Co-occurrence HOG.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Automated Prediction of Glasgow Outcome Scale for Traumatic Brain Injury.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

A Bag of Words Based Approach for Classification of HEp-2 Cell Images.
Proceedings of the 1st Workshop on Pattern Recognition Techniques for Indirect Immunofluorescence Images, 2014

Automatic CAD System for HEp-2 Cell Image Classification.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Context-aware codebook learning for mobile landmark recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

3D reconstruction of neurons in electron microscopy images.
Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2014

Accurate Scene Text Recognition Based on Recurrent Neural Network.
Proceedings of the Computer Vision - ACCV 2014, 2014

Search Guided Saliency.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Robust Document Image Binarization Technique for Degraded Document Images.
IEEE Trans. Image Process., 2013

Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images.
IEEE Trans. Circuits Syst. Video Technol., 2013

Scene Text Recognition Using Co-occurrence of Histogram of Oriented Gradients.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Self Learning Classification for Degraded Document Images by Sparse Representation.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Adaptive picture-in-picture technology based on visual saliency.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing.
IEEE Trans. Circuits Syst. Video Technol., 2012

Restoration of motion blurred document images.
Proceedings of the ACM Symposium on Applied Computing, 2012

Autonomous Viewpoint Control from Saliency.
Proceedings of the Biomimetic and Biohybrid Systems - First International Conference, 2012

A learning framework for degraded document image binarization using Markov Random Field.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Character extraction in web image for text recognition.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Saliency Modeling from Image Histograms.
Proceedings of the Computer Vision - ECCV 2012, 2012

New Spatial-Gradient-Features for Video Script Identification.
Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, 2012

An Effective Staff Detection and Removal Technique for Musical Documents.
Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, 2012

Visual Attention is Attracted by Text Features Even in Scenes without Text.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

2011
Accurate and Efficient Optic Disc Detection and Segmentation by a Circular Transformation.
IEEE Trans. Medical Imaging, 2011

Automatic Optic Disc Detection From Retinal Images by a Line Operator.
IEEE Trans. Biomed. Eng., 2011

Blurred image region detection and classification.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Combination of Document Image Binarization Techniques.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Video Character Recognition through Hierarchical Classification.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

A New Fourier-Moments Based Video Word and Character Extraction Method for Recognition.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Video Script Identification Based on Text Lines.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

2010
Automated Layer Segmentation of Optical Coherence Tomography Images.
IEEE Trans. Biomed. Eng., 2010

Identification of scripts and orientations of degraded document images.
Pattern Anal. Appl., 2010

Document image binarization using background estimation and stroke edges.
Int. J. Document Anal. Recognit., 2010

Harvesting discourse strategies for rapid prototyping of tailored information delivery systems.
Proceedings of the 2010 International Symposium on Visual Information Communication, 2010

Automatic optic disc segmentation based on image brightness and contrast.
Proceedings of the Medical Imaging 2010: Image Processing, 2010

Enhancement of optic cup detection through an improved vessel kink detection framework.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, San Diego, 2010

Classification of left and right eye retinal images.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, San Diego, 2010

Automatic classification of pathological myopia in retinal fundus images using PAMELA.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, San Diego, 2010

A Self-Training Learning Document Binarization Framework.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Automatic macula detection from retinal images by a line operator.
Proceedings of the International Conference on Image Processing, 2010

Automatic optic disc detection through background estimation.
Proceedings of the International Conference on Image Processing, 2010

Binarization of historical document images using the local maximum and minimum.
Proceedings of the Ninth IAPR International Workshop on Document Analysis Systems, 2010

2009
Photometric correction of retinal images by polynomial interpolation.
Proceedings of the International Conference on Image Processing, 2009

Neuro-Retinal Optic Cup Detection in Glaucoma Diagnosis.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009

Computerized Systems for Cataract Grading.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009

2008
Retrieval of machine-printed Latin documents through Word Shape Coding.
Pattern Recognit., 2008

Script and Language Identification in Noisy and Degraded Document Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Document Image Retrieval through Word Shape Coding.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Learning adaptive subject-independent P300 models for EEG-based brain-computer interfaces.
Proceedings of the International Joint Conference on Neural Networks, 2008

Subject-independent brain computer interface through boosting.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

2007
Specifying documents in an adaptive hypermedia generation environment: an authoring tool prototype.
Int. J. Learn. Technol., 2007

Fast and Accurate Detection of Document Skew and Orientation.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Keyword Spotting and Retrieval of Document Images Captured by a Digital Camera.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Binarization of Badly Illuminated Document Images through Shading Estimation and Compensation.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Automatic Detection of Document Script and Orientation.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Identification of Latin-Based Languages through Character Stroke Categorization.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

A Fast Keyword-Spotting Technique.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

A Figure Image Processing System.
Proceedings of the Graphics Recognition. Recent Advances and New Opportunities, 2007

Thresholding of badly illuminated document images through photometric correction.
Proceedings of the 2007 ACM Symposium on Document Engineering, 2007

2006
A partition approach for the restoration of camera images of planar and curled document.
Image Vis. Comput., 2006

Information Access Efficiency: A Measure and Case Study.
Aust. J. Intell. Inf. Process. Syst., 2006

Automatic document orientation detection and categorization through document vectorization.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Camera Text Recognition based on Perspective Invariants.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Document Flattening through Grid Modeling and Regularization.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Language Identification in Degraded and Distorted Document Images.
Proceedings of the Document Analysis Systems VII, 7th International Workshop, 2006

The Restoration of Camera Documents Through Image Segmentation.
Proceedings of the Document Analysis Systems VII, 7th International Workshop, 2006

Script and Language Identification in Degraded and Distorted Document Images.
Proceedings of the Proceedings, 2006

2005
Perspective rectification of document images using fuzzy set and morphological operations.
Image Vis. Comput., 2005

A Novel Approach for Web-based Data Input Panel Design.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005

2004
Document image rectification using fuzzy sets and morphological operators.
Proceedings of the 2004 International Conference on Image Processing, 2004

Myriad: An Architecture for Contextualized Information Retrieval and Delivery.
Proceedings of the Adaptive Hypermedia and Adaptive Web-Based Systems, 2004

2003
Generating UML diagrams from task models.
Proceedings of the 4th Annual Conference of the ACM Special Interest Group on Computer-Human Interaction, 2003

2002
Automated knowledge acquisition for instructional text generation.
Proceedings of the 20st annual international conference on Documentation, 2002

2000
Generating Personal Travel Guides from Discourse Plans.
Proceedings of the Adaptive Hypermedia and Adaptive Web-Based Systems, 2000

1999
Automatic Acquisition of Task Models from Object Oriented Design Specifications: A Case Study.
Proceedings of the Object-Oriented Technology, ECOOP'99 Workshop Reader, 1999

1998
Incorporating work, process and task analysis into commercial and industrial object-oriented systems development.
ACM SIGCHI Bull., 1998

The Visualisation of Web Usage.
Proceedings of the Engineering for Human-Computer Interaction, 1998

Toward the Automatic Construction of Task Models from Object-Oriented Diagrams.
Proceedings of the Engineering for Human-Computer Interaction, 1998

1997
Groupware Support Tools for Collaborative Software Engineering.
Proceedings of the 30th Annual Hawaii International Conference on System Sciences (HICSS-30), 1997

A Distributed Multimedia News Archive Service for Interactive Television.
Proceedings of the 30th Annual Hawaii International Conference on System Sciences (HICSS-30), 1997


  Loading...