Thanh Duc Ngo

Orcid: 0000-0001-6882-0070

Affiliations:
  • Vietnam National University Ho Chi Minh City, Vietnam
  • Graduate University for Advanced Studies, Tokyo, Japan (PhD)


According to our database1, Thanh Duc Ngo authored at least 94 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SignboardText.
Dataset, May, 2024

Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition.
CoRR, 2024

The Art of Camouflage: Few-Shot Learning for Animal Detection and Segmentation.
IEEE Access, 2024

SignboardText: Text Detection and Recognition in In-the-Wild Signboard Images.
IEEE Access, 2024

Controllable Base Class Synthesis with Generative Diffusion Model for Few-Shot Object Detection.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2024

Contrastive Learning with Weakly Pair Images for Traffic Image Deraining.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2024

Weakly Supervised Object Detection Using Class Activation Map.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2024

InstSynth: Instance-wise Prompt-guided Style Masked Conditional Data Synthesis for Scene Understanding.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2024

Robust Motorcycle Helmet Detection in Real-World Scenarios: Using Co-DETR and Minority Class Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Road Object Detection in Fisheye Cameras: An Effective Framework Integrating SAHI and Hybrid Inference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
SignboardText.
Dataset, December, 2023

An Accurate Platform for Investigating TCP Performance in Wi-Fi Networks.
Future Internet, July, 2023

Instance-Level Few-Shot Learning With Class Hierarchy Mining.
IEEE Trans. Image Process., 2023

Abstraction-perception preserving cartoon face synthesis.
Multim. Tools Appl., 2023

Few-shot Camouflaged Animal Detection and Segmentation.
CoRR, 2023

Diverse Search Methods and Multi-Modal Fusion for High-Performance Video Retrieval.
Proceedings of the 12th International Symposium on Information and Communication Technology, 2023

Integrating Multiple Models For Effective Video Retrieval and Multi-stage Search.
Proceedings of the 12th International Symposium on Information and Communication Technology, 2023

News Event Retrieval from Large Video Collection in Ho Chi Minh City AI Challenge 2023.
Proceedings of the 12th International Symposium on Information and Communication Technology, 2023

CE-OST: Contour Emphasis for One-Stage Transformer-based Camouflage Instance Segmentation.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2023

Information Extraction from Rich Text Images with RoBERTa and LION Optimizer.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2023

HTC-BI: Hybrid Task Cascade with Boundary Information for Instance Segmentation.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2023

Masked Face Recognition Using EUM Feature Extraction from Unobstructed Region.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2023

Unsupervised Domain Adaptation with Imbalanced Character Distribution for Scene Text Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2023

2022
Few-shot object detection via baby learning.
Image Vis. Comput., 2022

UIT at VBS 2022: An Unified and Interactive Video Retrieval System with Temporal Search.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

A Crowdsourcing Data Annotation System For Vietnamese Scene Text Detection.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2022

Antique Photo Restoration and Colorization via Generative Model.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2022

2021
MC-OCR Challenge 2021: An end-to-end recognition framework for Vietnamese Receipts.
Proceedings of the RIVF International Conference on Computing and Communication Technologies, 2021

DF-FSOD: A Novel Approach for Few-shot Object Detection via Distinguished Features.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2021

A robust framework for mathematical formula detection.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2021

Multilingual-GAN: A Multilingual GAN-based Approach for Handwritten Generation.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2021

Unweighted Bipartite Matching For Robust Vehicle Counting.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2021

Dictionary-Guided Scene Text Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
An Evaluation of Deep Learning Methods for Small Object Detection.
J. Electr. Comput. Eng., 2020

Single-image crowd counting: a comparative survey on deep learning-based approaches.
Int. J. Multim. Inf. Retr., 2020

NII_UIT AT TRECVID 2020.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2020

U15-Logos: Unconstrained Logo Dataset with Evaluation by Deep learning Methods.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2020

Interpolation based Anime Face Style Transfer.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2020

2019
Video instance search via spatial fusion of visual words and object proposals.
Int. J. Multim. Inf. Retr., 2019


A Software Defined Networking Approach for Guaranteeing Delay in Wi-Fi Networks.
Proceedings of the Tenth International Symposium on Information and Communication Technology, 2019

Targeting Bufferbloat in Wi-Fi Networks: An Emulator-based Approach.
Proceedings of the 19th International Symposium on Communications and Information Technologies, 2019

2018
Video Search Based on Semantic Extraction and Locally Regional Object Proposal.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Measuring Crowd Collectiveness with Trajectory Smoothing.
Proceedings of the 1st International Conference on Multimedia Analysis and Pattern Recognition, 2018

How to Choose Deep Face Models for Surveillance System?
Proceedings of the Modern Approaches for Intelligent Information and Database Systems,, 2018

2017
Person re-identification with mutual re-ranking.
Vietnam. J. Comput. Sci., 2017

Scalable Face Track Retrieval in Video Archives Using Bag-of-Faces Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2017

Persons-In-Places: a Deep Features Based Approach for Searching a Specific Person in a Specific Location.
Informatica (Slovenia), 2017

Efficient large-scale multi-class image classification by learning balanced trees.
Comput. Vis. Image Underst., 2017


Semantic Extraction and Object Proposal for Video Search.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Evaluation of Deep Models for Real-Time Small Object Detection.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

2016
Human Action Recognition from Depth Videos Using Pool of Multiple Projections with Greedy Selection.
IEICE Trans. Inf. Syst., 2016

When face-tracking meets social networks: a story of politics in news videos.
Appl. Netw. Sci., 2016


Searching a specific person in a specific location using deep features.
Proceedings of the Seventh Symposium on Information and Communication Technology, 2016

News Archive Exploration Combining Face Detection and Tracking with Network Visual Analytics.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Using node relationships for hierarchical classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Computational optimization for violent scenes detection.
Proceedings of the 2016 International Conference on Computer, 2016

Efficient Large Scale Image Classification via Prediction Score Decomposition.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval.
Int. J. Multim. Data Eng. Manag., 2015

A Social Network Analysis of Face Tracking in News Video.
Proceedings of the 11th International Conference on Signal-Image Technology & Internet-Based Systems, 2015

Cross-View Action Recognition by Projection-Based Augmentation.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Large scale multi-class classification using latent classifiers.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Human Action recognition from depth videos using multi-projection based representation.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

AttRel: An Approach to Person Re-Identification by Exploiting Attribute Relationships.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

NII-UIT Browser: A Multimodal Video Search System.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Generalized Max Pooling for Action Recognition.
Proceedings of the 2015 Seventh International Conference on Knowledge and Systems Engineering, 2015

Using Textual Semantic Similarity to Improve Clustering Quality of Web Video Search Results.
Proceedings of the 2015 Seventh International Conference on Knowledge and Systems Engineering, 2015

Transfer AdaBoost SVM for Link Prediction in Newly Signed Social Networks using Explicit and PNR Features.
Proceedings of the 19th International Conference in Knowledge Based and Intelligent Information and Engineering Systems, 2015

Learning Balanced Trees for Large Scale Image Classification.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

2014
Multimedia Event Detection Using Segment-Based Approach for Motion Feature.
J. Signal Process. Syst., 2014

National Institute of Informatics, Japan at TRECVID 2014.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

<i>Recommend-Me</i>: recommending query regions for image search.
Proceedings of the Symposium on Applied Computing, 2014

NII-UIT: A Tool for Known Item Search by Sequential Pattern Filtering.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

Using Attribute Relationships for Person Re-Identification.
Proceedings of the Knowledge and Systems Engineering, 2014

Integrating Spatial Information into Inverted Index for Large-Scale Image Retrieval.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

2013
Scalable Approaches for Content -based Video Retrieval.
PhD thesis, 2013

Face Retrieval in Large-Scale News Video Datasets.
IEICE Trans. Inf. Syst., 2013

Violent scene detection using mid-level feature.
Proceedings of the 4th International Symposium on Information and Communication Technology, 2013

Re-ranking for person re-identification.
Proceedings of the 2013 International Conference on Soft Computing and Pattern Recognition, 2013

Evaluation of low-level features for detecting violent scenes in videos.
Proceedings of the 2013 International Conference on Soft Computing and Pattern Recognition, 2013

NII-UIT-VBS: A Video Browsing Tool for Known Item Search.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

2012
National Institute of Informatics, Japan at TRECVID 2012.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Robust eye localization in video by combining eye detector and eye tracker.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A Codeword Visualization Tool for Dense Trajectory Feature.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

2011
NII-KAORI-PERSON-SEARCH: A General Framework for Indexing and Retrieving People's Appearance in Large Video Archives.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

Fast face sequence matching in large-scale video databases.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Boosting global scene classification accuracy by discriminative region localization.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Improving Image Categorization by Using Multiple Instance Learning with Spatial Relation.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

2010
An efficient method for face retrieval from large video datasets.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2008
Robust Face Track Finding in Video Using Tracked Points.
Proceedings of the 4th IEEE International Conference on Signal Image Technology and Internet Based Systems, 2008

A text segmentation based approach to video shot boundary detection.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008


  Loading...