Tong Lu

Orcid: 0009-0005-5368-5319

According to our database1, Tong Lu authored at least 321 papers between 1992 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions.
Int. J. Comput. Vis., October, 2024

Revisiting of AlphaStar.
IEEE Trans. Games, June, 2024

A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

Development and Calibration of 532 nm Standard Aerosol Lidar with Low Blind Area.
Remote. Sens., February, 2024

Restoring vision in hazy weather with hierarchical contrastive learning.
Pattern Recognit., January, 2024

A new deep CNN for 3D text localization in the wild through shadow removal.
Comput. Vis. Image Underst., January, 2024

TTS: Hilbert Transform-Based Generative Adversarial Network for Tattoo and Scene Text Spotting.
IEEE Trans. Multim., 2024

Feature Selection Based on Intrusive Outliers Rather Than All Instances.
IEEE Trans. Image Process., 2024

WF-Transformer: Learning Temporal Features for Accurate Anonymous Traffic Identification by Using Transformer Networks.
IEEE Trans. Inf. Forensics Secur., 2024

Robust Lidar-Radar Composite Cloud Boundary Detection Method With Rainfall Pixels Removal.
IEEE Trans. Geosci. Remote. Sens., 2024

An extended model for crowded evacuation considering stampede on inclined staircases.
Simul. Model. Pract. Theory, 2024

Similarity search on social networks with incremental graph indexing based on probabilistic inference.
Int. J. Web Inf. Syst., 2024

A robust script independent handwriting system for gender identification.
Expert Syst. Appl., 2024

Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance.
CoRR, 2024

MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding.
CoRR, 2024

EAR: Edge-Aware Reconstruction of 3-D vertebrae structures from bi-planar X-ray images.
CoRR, 2024

MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity.
CoRR, 2024

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation.
CoRR, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks.
CoRR, 2024

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites.
CoRR, 2024

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding.
CoRR, 2024

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures.
CoRR, 2024

PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal.
CoRR, 2024

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer.
CoRR, 2024

Joint Alignment Networks For Few-Shot Website Fingerprinting Attack.
Comput. J., 2024

Multi-objective constraints for path planning in screw fixation of scaphoid fractures.
Comput. Biol. Medicine, 2024

Task Scheduling in Vehicular Networks: A Multi-Agent Reinforcement Learning Based Reverse Auction Mechanism.
Proceedings of the 2024 The 6th World Symposium on Software Engineering (WSSE), 2024

Few-shot Semantic Segmentation via Perceptual Attention and Spatial Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Evaluation of Cloud Recovery System for Financial Applications Based on Hadoop.
Proceedings of the 7th IEEE International Symposium on Telecommunication Technologies, 2024

SVT: Spectral Video Transformer for Video Restoration in Under-Display Camera.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Improved Current Tracking Performance of PMSM at Low Switching Frequency with Double-updating Scheme and Perfect Tracking Control.
Proceedings of the IEEE International Conference on Advanced Intelligent Mechatronics, 2024

CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

AVSegFormer: Audio-Visual Segmentation with Transformer.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Digital Twin of Intelligent Small Surface Defect Detection with Cyber-manufacturing Systems.
ACM Trans. Internet Techn., November, 2023

BasicTAD: An astounding RGB-Only baseline for temporal action detection.
Comput. Vis. Image Underst., July, 2023

Few-shot website fingerprinting attack with cluster adaptation.
Comput. Networks, June, 2023

Writer age estimation through handwriting.
Multim. Tools Appl., May, 2023

Integration- and separation-aware adversarial model for cerebrovascular segmentation from TOF-MRA.
Comput. Methods Programs Biomed., May, 2023

A learnable Gabor Convolution kernel for vessel segmentation.
Comput. Biol. Medicine, May, 2023

Classification of aesthetic natural scene images using statistical and semantic features.
Multim. Tools Appl., April, 2023

A new ontology-based multimodal classification system for social media images of personality traits.
Signal Image Video Process., March, 2023

Fuzzy Assessment of Ecological Security on the Qinghai-Tibet Plateau Based on Pressure-State-Response Framework.
Remote. Sens., March, 2023

Coupled Thorens and Soil Conservation Service Models for Soil Erosion Assessment in a Loess Plateau Watershed, China.
Remote. Sens., February, 2023

AGV-Based Vehicle Transportation in Automated Container Terminals: A Survey.
IEEE Trans. Intell. Transp. Syst., January, 2023

Statistical Network Analysis of High-Dimensional Neuroimaging Data With Complex Topological Structures.
PhD thesis, 2023

A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images.
IEEE Trans. Image Process., 2023

Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.
CoRR, 2023

Deep Video Restoration for Under-Display Camera.
CoRR, 2023

AVSegFormer: Audio-Visual Segmentation with Transformer.
CoRR, 2023

VideoLLM: Modeling Video Sequence with Large Language Models.
CoRR, 2023

Champion Solution for the WSDM2023 Toloka VQA Challenge.
CoRR, 2023

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Graph Propagation Transformer for Graph Representation Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

A Novel Gold Futures Price Prediction Model based on PCA-AGRU.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

MRSN: Multi-Relation Support Network for Video Action Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

ELAN: Enhancing Temporal Action Detection with Location Awareness.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Vision Transformer Adapter for Dense Predictions.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ICDAR 2023 Competition on Born Digital Video Text Question Answering.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Memory-and-Anticipation Transformer for Online Action Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FB-BEV: BEV Representation from Forward-Backward View Transformations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DDP: Diffusion Model for Dense Visual Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Lightweight Website Fingerprinting Defense Method Based on Distribution Distance Padding.
Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Probabilistic Inference Based Incremental Graph Index for Similarity Search on Social Networks.
Proceedings of the Collaborative Computing: Networking, Applications and Worksharing, 2023

End-to-End Recognition Algorithm for Degraded Invoice Text.
Proceedings of the 16th International Congress on Image and Signal Processing, 2023

Fine-grained IoT device identification method based on self-supervised ViT.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023

A New Transformer-Based Approach for Text Detection in Shaky and Non-shaky Day-Night Video.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A Knowledge Enforcement Network-Based Approach for Classifying a Photographer's Images.
Int. J. Pattern Recognit. Artif. Intell., December, 2022

A New Deep Wavefront Based Model for Text Localization in 3D Video.
IEEE Trans. Circuits Syst. Video Technol., 2022

An Episodic Learning Network for Text Detection on Human Bodies in Sports Images.
IEEE Trans. Circuits Syst. Video Technol., 2022

Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning.
IEEE Trans. Games, 2022

A new method for detection and prediction of occluded text in natural scene images.
Signal Process. Image Commun., 2022

Oil palm tree counting in drone images.
Pattern Recognit. Lett., 2022

Discriminative feature selection with directional outliers correcting for data classification.
Pattern Recognit., 2022

A novel forget-update module for few-shot domain generalization.
Pattern Recognit., 2022

PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

A new deep model for family and non-family photo identification.
Multim. Tools Appl., 2022

Fuzzy and genetic algorithm based approach for classification of personality traits oriented social media images.
Knowl. Based Syst., 2022

Rotating Machinery Fault Identification via Adaptive Convolutional Neural Network.
J. Sensors, 2022

On Efficient Reinforcement Learning for Full-length Game of StarCraft II.
J. Artif. Intell. Res., 2022

Text line segmentation from struck-out handwritten document images.
Expert Syst. Appl., 2022

PVT v2: Improved baselines with Pyramid Vision Transformer.
Comput. Vis. Media, 2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges.
CoRR, 2022

Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022.
CoRR, 2022

Exploring Detection-based Method For Speaker Diarization @ Ego4D Audio-only Diarization Challenge 2022.
CoRR, 2022

A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur, Artifact Removal.
CoRR, 2022

Uncertainty-based Network for Few-shot Image Classification.
CoRR, 2022

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers.
CoRR, 2022

Incremental Few-Shot Semantic Segmentation via Embedding Adaptive-Update and Hyper-class Representation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Uncertainty-Based Network for Few-Shot Image Classification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Anomaly Handwritten Text Detection for Automatic Descriptive Answer Evaluation.
Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition, 2022

SeedFormer: Patch Seeds Based Point Cloud Completion with Upsample Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

A Modified Generative Adversarial Network for Reconstruction of Compressed Sesning MRI.
Proceedings of the IEEE Intl. Conf. on Dependable, 2022

An improved attention mechanism based YOLOv4 structure for lung nodule detection.
Proceedings of the IEEE Intl. Conf. on Dependable, 2022

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dynamics and transformation control of a wheeled inverted pendulum mobile robot.
Proceedings of the IEEE/ASME International Conference on Advanced Intelligent Mechatronics, 2022

DCAN: Improving Temporal Action Detection via Dual Context Aggregation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A New Foreground-Background based Method for Behavior-Oriented Social Media Image Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Arbitrarily-Oriented Text Detection in Low Light Natural Scene Images.
IEEE Trans. Multim., 2021

A new DCT-PCM method for license plate number detection in drone images.
Pattern Recognit. Lett., 2021

A new context-based feature for classification of emotions in photographs.
Multim. Tools Appl., 2021

A New Method for Detecting Altered Text in Document Images.
Int. J. Pattern Recognit. Artif. Intell., 2021

A New Hybrid Method for Caption and Scene Text Classification in Action Video Images.
Int. J. Pattern Recognit. Artif. Intell., 2021

Improved Ring Radius Transform-Based Reconstruction for Video Character Recognition.
Int. J. Pattern Recognit. Artif. Intell., 2021

DCT-phase statistics for forged IMEI numbers and air ticket detection.
Expert Syst. Appl., 2021

FAST: Searching for a Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation.
CoRR, 2021

ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter.
CoRR, 2021

Panoptic SegFormer.
CoRR, 2021

Learning Class-level Prototypes for Few-shot Learning.
CoRR, 2021

PVTv2: Improved Baselines with Pyramid Vision Transformer.
CoRR, 2021

PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text.
CoRR, 2021

An Introduction of mini-AlphaStar.
CoRR, 2021

When is it permissible for artificial intelligence to lie? A trust-based approach.
CoRR, 2021

The impact of CCT on driving safety in the normal and accident situation: A VR-based experimental study.
Adv. Eng. Informatics, 2021

Spectrum-to-Kernel Translation for Accurate Blind Image Super-Resolution.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Time Window Based Genetic Algorithm for Multi-AGVs Conflict-free Path Planning in Automated Container Terminals.
Proceedings of the IEEE International Conference on Industrial Engineering and Engineering Management, 2021

A Study of Logistics Efficiency Prediction in Dalian Port Based on Gray-BP Neural Network.
Proceedings of the ICMSS 2021: The 5the International Conference on Management Engineering, 2021

A Novel Attention Enhanced Residual-In-Residual Dense Network for Text Image Super-Resolution.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

A Connected Component-Based Deep Learning Model for Multi-type Struck-Out Component Classification.
Proceedings of the Document Analysis and Recognition, 2021

DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Adaptive Graph Convolution for Point Cloud Analysis.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TAM: Temporal Adaptive Module for Video Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Hardware Trojan Detection Method for Gate-Level Netlists Based on the Idea of Few-Shot Learning.
Proceedings of the 21st International Conference on Communication Technology, 2021

Classification of Chest X-ray Images Using Deep Convolutional Neural Network.
Proceedings of the IEEE Intl Conf on Dependable, 2021

Frequency Consistent Adaptation for Real World Super Resolution.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Dynamic Sampling Networks for Efficient Action Recognition in Videos.
IEEE Trans. Image Process., 2020

A new Fractal Series Expansion based enhancement model for license plate recognition.
Signal Process. Image Commun., 2020

Delaunay triangulation based text detection from multi-view images of natural scene.
Pattern Recognit. Lett., 2020

Graph attention network for detecting license plates in crowded street scenes.
Pattern Recognit. Lett., 2020

A new unified method for detecting text from marathon runners and sports players in video (PR-D-19-01078R2).
Pattern Recognit., 2020

A new augmentation-based method for text detection in night and day license plate images.
Multim. Tools Appl., 2020

Application of Back Propagation Neural Network Model in Prediction and Diagnosis of Osteoporosis.
J. Medical Imaging Health Informatics, 2020

Fault detection for switched systems with all modes unstable based on interval observer.
Inf. Sci., 2020

Forged text detection in video, scene, and document images.
IET Image Process., 2020

Rotation invariant angle-density based features for an ice image classification system.
Expert Syst. Appl., 2020

Channel Relationship Prediction with Forget-Update Module for Few-shot Classification.
CoRR, 2020

A New Unified Method for Detecting Text from Marathon Runners and Sports Players in Video.
CoRR, 2020

A New Multiple-Distribution GAN Model to Solve Complexity in End-to-End Chromosome Karyotyping.
Complex., 2020

Graphology based handwritten character analysis for human behaviour identification.
CAAI Trans. Intell. Technol., 2020

Research on Privacy Paradox in Social Networks Based on Evolutionary Game Theory and Data Mining.
Proceedings of the 19th Wuhan International Conference on E-Business, 2020

SEE-LPR: A Semantic Segmentation Based End-to-End System for Unconstrained License Plate Detection and Recognition.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Multi-scale Comparison Network for Few-Shot Learning.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images.
Proceedings of the Pattern Recognition and Artificial Intelligence, 2020

Local Gradient Difference Features for Classification of 2D-3D Natural Scene Text Images.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Chebyshev-Harmonic-Fourier-Moments and Deep CNNs for Detecting Forged Handwriting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Multi-scale Relational Reasoning with Regional Attention for Visual Question Answering.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Dynamic Low-Light Image Enhancement for Object Detection via End-to-End Training.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

IFSM: An Iterative Feature Selection Mechanism for Few-Shot Image Classification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Logistics Efficiency Analysis of Dalian Port in China Considering Environmental Factors and Random Errors: Measurement based on Three-Stage DEA-Tobit Model.
Proceedings of the ICMSS 2020: 2020 4th International Conference on Management Engineering, 2020

The Effect of Studying with Concept Maps Online on Scientific Argumentation in Higher Education.
Proceedings of the ICETC'20: 12th International Conference on Education Technology and Computers, 2020

Small Area Configurable Deep Neural Network Accelerator for IoT System.
Proceedings of the 20th IEEE International Conference on Communication Technology, 2020

DexFus: An Android Obfuscation Technique Based on Dalvik Bytecode Translation.
Proceedings of the Frontiers in Cyber Security - Third International Conference, 2020

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting.
Proceedings of the Computer Vision - ECCV 2020, 2020

A New Common Points Detection Method for Classification of 2D and 3D Texts in Video/Scene Images.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

A New Context-Based Method for Restoring Occluded Text in Natural Scene Images.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

TEINet: Towards an Efficient Architecture for Video Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images.
IEEE Trans. Circuits Syst. Video Technol., 2019

Curved text detection in blurred/non-blurred video/scene images.
Multim. Tools Appl., 2019

A geometric and fractional entropy-based method for family photo classification.
Expert Syst. Appl. X, 2019

Fractional means based method for multi-oriented keyword spotting in video/scene/license plate images.
Expert Syst. Appl., 2019

A novel character segmentation-reconstruction approach for license plate recognition.
Expert Syst. Appl., 2019

An automatic zone detection system for safe landing of UAVs.
Expert Syst. Appl., 2019

Shape Robust Text Detection with Progressive Scale Expansion Network.
CoRR, 2019

Efficient Reinforcement Learning with a Mind-Game for Full-Length StarCraft II.
CoRR, 2019

Channel-wise attention model-based fire and rating level detection in video.
CAAI Trans. Intell. Technol., 2019

An Automatic System for Generating Artificial Fake Character Images.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Hierarchical Bayesian Network Based Incremental Model for Flood Prediction.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

A Novel Group-Aware Pruning Method for Few-shot Learning.
Proceedings of the International Joint Conference on Neural Networks, 2019

A Novel Two-Factor Attention Encoder-Decoder Network through Combining Temporal and Prior Knowledge for Weather Forecasting.
Proceedings of the International Joint Conference on Neural Networks, 2019

Cropout: A General Mechanism for Reducing Overfitting on Convolutional Neural Networks.
Proceedings of the International Joint Conference on Neural Networks, 2019

Multimodal Image Captioning Through Combining Reinforced Cross Entropy Loss and Stochastic Deprecation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

A Text-Context-Aware CNN Network for Multi-oriented and Multi-language Scene Text Detection.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

CRNN Based Jersey-Bib Number/Text Recognition in Sports and Marathon Images.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

GARN: A Novel Generative Adversarial Recognition Network for End-to-End Scene Character Recognition.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Age Estimation using Disconnectedness Features in Handwriting.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Shape Robust Text Detection With Progressive Scale Expansion Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

New Moments Based Fuzzy Similarity Measure for Text Detection in Distorted Social Media Images.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

A New Forged Handwriting Detection Method Based on Fourier Spectral Density and Variation.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

Structure Function Based Transform Features for Behavior-Oriented Social Media Image Classification.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

A Spatial Density and Phase Angle Based Correlation for Multi-type Family Photo Identification.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

A New U-Net Based License Plate Enhancement Model in Night and Day Images.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

On Reinforcement Learning for Full-Length Game of StarCraft.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Riesz Fractional Based Model for Enhancing License Plate Detection and Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Rough-fuzzy based scene categorization for text detection and recognition in video.
Pattern Recognit., 2018

A New COLD Feature based Handwriting Analysis for Ethnicity/Nationality Identification.
CoRR, 2018

Shape Robust Text Detection with Progressive Scale Expansion Network.
CoRR, 2018

CNN-RNN based method for license plate recognition.
CAAI Trans. Intell. Technol., 2018

Symmetry features for license plate classification.
CAAI Trans. Intell. Technol., 2018

New Fusion Based Enhancement for Text Detection in Night Video Footage.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Text Component Reconstruction for Tracking in Video.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Context and Temporal Aware Attention Model for Flood Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Hand Pose Estimation with Attention-and-Sequence Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

A Novel 3D Human Action Recognition Framework for Video Content Analysis.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Cloud of Line Distribution and Random Forest Based Text Detection from Natural/Video Scene Images.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Mixed Link Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Word-Wise Handwriting Based Gender Identification Using Multi-Gabor Response Fusion.
Proceedings of the Document Analysis and Recognition, 2018

Local and Global Bayesian Network based Model for Flood Prediction.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Fourier Transform based Features for Clean and Polluted Water Image Classification.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Context-Aware Attention LSTM Network for Flood Prediction.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Em-SLAM: a Fast and Robust Monocular SLAM Method for Embedded Systems.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Multi-Gradient Directional Features for Gender Identification.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Weighted-Gradient Features for Handwritten Line Segmentation.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

End-To-End Chromosome Karyotyping with Data Augmentation Using GAN.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A New RGB Based Fusion for Forged IMEI Number Detection in Mobile Images.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Adaptive Multi-Gradient Kernels for Handwritting Based Gender Identification.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

New COLD Feature Based Handwriting Analysis for Enthnicity/Nationality Identification.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

A New Shadow Detection and Depth Removal Method for 3D Text Recognition in Scene Images.
Proceedings of the 2018 2nd International Conference on Computer Science and Artificial Intelligence, 2018

2017
FreeScup: A Novel Platform for Assisting Sculpture Pose Design.
IEEE Trans. Multim., 2017

Learning discriminated and correlated patches for multi-view object detection using sparse coding.
Pattern Recognit., 2017

Fractals based multi-oriented text detection system for recognition in mobile video images.
Pattern Recognit., 2017

A new multi-modal approach to bib number/text detection and recognition in Marathon images.
Pattern Recognit., 2017

Preoperative surgical Planning for robot-Assisted Liver tumour Ablation Therapy based on Collision-Free reachable Workspaces.
Int. J. Robotics Autom., 2017

Script independent approach for multi-oriented text detection in scene image.
Neurocomputing, 2017

Cloud of Line Distribution for Arbitrary Text Detection in Scene/Video/License Plate Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Visual Robotic Object Grasping Through Combining RGB-D Data and 3D Meshes.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Robust Scene Text Detection for Multi-script Languages Using Deep Learning.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Deep-dense Conditional Random Fields for Object Co-segmentation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Valuation of Personal Information in the E-commerce Websites based on Contingent Valuation Method.
Proceedings of the 8th International Conference on E-business, Management and Economics, 2017

A Robust Symmetry-Based Method for Scene/Video Text Detection through Neural Network.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Fourier-Residual for Printer Identification.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Temporal Integration for Word-Wise Caption and Scene Text Identification.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

New Fuzzy-Mass Based Features for Video Image Type Categorization.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Temporal Action Localization by Structured Maximal Sums.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Compressing YOLO Network by Compressive Sensing.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

A New GVF Arrow Pattern for Character Segmentation from Double Line License Plate Images.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

Sharpness and Contrast Based Features for Word-Wise Video Type Classification.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

2016
Contour Restoration of Text Components for Recognition in Video/Scene Images.
IEEE Trans. Image Process., 2016

A new method for multi-oriented graphics-scene-3D text classification in video.
Pattern Recognit., 2016

Fractional poisson enhancement model for text detection and recognition in video frames.
Pattern Recognit., 2016

Weakly-supervised region annotation for understanding scene images.
Multim. Tools Appl., 2016

Modeling spatial layout for scene image understanding via a novel multiscale sum-product network.
Expert Syst. Appl., 2016

EvaToon: A novel graph matching system for evaluating cartoon drawings.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Video scene text frames categorization for text detection and recognition.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

A quad tree based method for blurred and non-blurred video text frames classification through quality metrics.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

New Tampered Features for Scene and Caption Text Classification in Video Frame.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

Fourier Coefficients for Fraud Handwritten Document Classification through Age Analysis.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

New Sharpness Features for Image Type Classification Based on Textual Information.
Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

2015
A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video.
IEEE Trans. Multim., 2015

Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images.
IEEE Trans. Image Process., 2015

Content-oriented multimedia document understanding through cross-media correlation.
Multim. Tools Appl., 2015

Context-based environmental audio event recognition for scene understanding.
Multim. Syst., 2015

Character shape restoration system through medial axis points in video.
Neurocomputing, 2015

A new ring radius transform-based thinning method for multi-oriented video characters.
Int. J. Document Anal. Recognit., 2015

Bayesian classifier for multi-oriented video text recognition system.
Expert Syst. Appl., 2015

New Gradient-Spatial-Structural Features for video script identification.
Comput. Vis. Image Underst., 2015

HIRM: A handle-independent reduced model for incremental mesh editing.
Comput. Aided Geom. Des., 2015

A New Multi-spectral Fusion Method for Degraded Video Text Frame Enhancement.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

A New Multi-modal Technique for Bib Number/Text Detection in Natural Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

A fast color barcode detection method through cross identification on mobile platforms.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Natural Scene character recognition using Markov Random Field.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

A new wavelet-Laplacian method for arbitrarily-oriented character segmentation in video text lines.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

A new method based on bag of filters for character recognition in scene images by learning.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

iSkin: Flexible, Stretchable and Visually Customizable On-Body Touch Sensors for Mobile Computing.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

Text detection in born-digital images by mass estimation.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

New texture-spatial features for keyword spotting in video images.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Video Text Detection
Advances in Computer Vision and Pattern Recognition, Springer, ISBN: 978-1-4471-6515-6, 2014

Sketching Interfaces.
Proceedings of the Handbook of Document Image Processing and Recognition, 2014

Spectral 3D mesh segmentation with a novel single segmentation field.
Graph. Model., 2014

2D and 3D Video Scene Text Classification.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Graphics and Scene Text Classification in Video.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Audiotory Movie Summarization by Detecting Scene Changes and Sound Events.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Anomaly Detection through Spatio-temporal Context Modeling in Crowded Scenes.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Optical flow based dynamic curved video text detection.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

A Novel Topic-Level Random Walk Framework for Scene Image Co-segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Text Detection Using Delaunay Triangulation in Video Sequence.
Proceedings of the 11th IAPR International Workshop on Document Analysis Systems, 2014

A Novel Context-Aware Topic Model for Category Discovery in Natural Scenes.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Online stroke segmentation by quick penalty-based dynamic programming.
IET Comput. Vis., 2013

Incremental 3D reconstruction using Bayesian learning.
Appl. Intell., 2013

Robust Object Tracking Using Motion Context in Crowded Scenes.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

A Real-Time Mesh Animation Framework Using Kinect.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Soft-matter capacitive sensor for measuring shear and pressure deformation.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

A Novel Multi-view Object Class Detection Framework for Document Image Content Analysis.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Discriminative Weighting and Subspace Learning for Ensemble Symbol Recognition.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Recognition of Video Text through Temporal Integration.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

2012
Integration of the Image-Guided Surgery Toolkit (IGSTK) into the Medical Imaging Interaction Toolkit (MITK).
J. Digit. Imaging, 2012

A Novel Multi-modal Integration and Propagation Model for Cross-Media Information Retrieval.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Movie Keyframe Retrieval Based on Cross-Media Correlation Detection and Context Model.
Proceedings of the Advanced Research in Applied Artificial Intelligence, 2012

Ensemble symbol recognition with Hough forest.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Anomaly detection with spatio-temporal context using depth images.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2011
Multiclass object detection by combining local appearances and context.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Environmental sound classification for scene recognition using local discriminant bases and HMM.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

A Robust Color-Independent Text Detection Method from Complex Videos.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Symbol Recognition by Multiresolution Shape Context Matching.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

A Parts-Based Multi-scale Method for Symbol Recognition.
Proceedings of the Graphics Recognition. New Trends and Challenges, 2011

2010
A New Text Detection Algorithm for Content-Oriented Line Drawing Image Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

A Novel Approach for Robust Surveillance Video Content Abstraction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Robust Shape Retrieval through a Novel Statistical Descriptor.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

A New Shape Descriptor for Object Recognition and Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

3D Similarity Search Using a Weighted Structural Histogram Representation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

3D Model Comparison through Kernel Density Matching.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Symbol Recognition Combining Vectorial and Pixel-Level Features for Line Drawings.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
A Novel Knowledge-Based System for Interpreting Complex Engineering Drawings: Theory, Representation, and Implementation.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Saliency Regions for 3D Mesh Abstraction.
Proceedings of the Advances in Multimedia Information Processing, 2009

3D Scene Analysis Using UIMA Framework.
Proceedings of the Next-Generation Applied Intelligence, 2009

Semi-automatic Roof Reconstruction.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

<i>QuickDiagram</i>: A System for Online Sketching and Understanding of Diagrams.
Proceedings of the Graphics Recognition. Achievements, 2009

2008
A New Performance Benchmark for Content-Based 3D Model Retrieval.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

A Needle-Holding Robot for Ultrasound Guided Percutaneous Hepatic Microwave Ablation and Initial Experiments.
Proceedings of the Intelligent Robotics and Applications, First International Conference, 2008

Knowledge Extraction from Structured Engineering Drawings.
Proceedings of the Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008

2007
Automatic analysis and integration of architectural drawings.
Int. J. Document Anal. Recognit., 2007

A Dynamic-Rule-Based Framework of Engineering Drawing Recognition and Interpretation System.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007

2005
A new recognition model for electronic architectural drawings.
Comput. Aided Des., 2005

1998
Evaluation of Declarative n-Queens Recursion: A Deductive Database Approach.
Inf. Sci., 1998

1993
Normalization of Linear Recursions in Deductive Databases.
Proceedings of the Ninth International Conference on Data Engineering, 1993

1992
N-Queens Problem Revisited: A Deductive Database Approach.
Proceedings of the Workshop on Deductive Databases held in conjunction with the Joint International Conference and Symposium on Logic Programming, 1992


  Loading...