Duy-Dinh Le

Orcid: 0000-0003-0356-5501

According to our database1, Duy-Dinh Le authored at least 120 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SignboardText.
Dataset, May, 2024

SignboardText: Text Detection and Recognition in In-the-Wild Signboard Images.
IEEE Access, 2024

Robust Motorcycle Helmet Detection in Real-World Scenarios: Using Co-DETR and Minority Class Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Road Object Detection in Fisheye Cameras: An Effective Framework Integrating SAHI and Hybrid Inference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
SignboardText.
Dataset, December, 2023

An Accurate Platform for Investigating TCP Performance in Wi-Fi Networks.
Future Internet, July, 2023

Abstraction-perception preserving cartoon face synthesis.
Multim. Tools Appl., 2023

Information Extraction from Rich Text Images with RoBERTa and LION Optimizer.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2023

Masked Face Recognition Using EUM Feature Extraction from Unobstructed Region.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2023

2022
Adaptive multi-vehicle motion counting.
Signal Image Video Process., 2022

UIT at VBS 2022: An Unified and Interactive Video Retrieval System with Temporal Search.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

A Framework for Evaluating Video Summary Approaches.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2022

2021
MC-OCR Challenge 2021: An end-to-end recognition framework for Vietnamese Receipts.
Proceedings of the RIVF International Conference on Computing and Communication Technologies, 2021

2020
An Evaluation of Deep Learning Methods for Small Object Detection.
J. Electr. Comput. Eng., 2020

NII_UIT AT TRECVID 2020.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2020

U15-Logos: Unconstrained Logo Dataset with Evaluation by Deep learning Methods.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2020

2019
YADA: you always dream again for better object detection.
Multim. Tools Appl., 2019

You always look again: Learning to detect the unseen objects.
J. Vis. Commun. Image Represent., 2019

Video instance search via spatial fusion of visual words and object proposals.
Int. J. Multim. Inf. Retr., 2019


Noise Removal Based Query Pre-processing to Improve Face Search Performance in Large Scale Video Databases.
Proceedings of the Tenth International Symposium on Information and Communication Technology, 2019

A Software Defined Networking Approach for Guaranteeing Delay in Wi-Fi Networks.
Proceedings of the Tenth International Symposium on Information and Communication Technology, 2019

Targeting Bufferbloat in Wi-Fi Networks: An Emulator-based Approach.
Proceedings of the 19th International Symposium on Communications and Information Technologies, 2019

2018
NII_HITACHI_UIT at TRECVID 2018.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Video Search Based on Semantic Extraction and Locally Regional Object Proposal.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

2017
Scalable Face Track Retrieval in Video Archives Using Bag-of-Faces Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2017

Evaluation of multiple features for violent scenes detection.
Multim. Tools Appl., 2017

Persons-In-Places: a Deep Features Based Approach for Searching a Specific Person in a Specific Location.
Informatica (Slovenia), 2017

Efficient large-scale multi-class image classification by learning balanced trees.
Comput. Vis. Image Underst., 2017


Semantic Extraction and Object Proposal for Video Search.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Video Indexing, Search, Detection, and Description with Focus on TRECVID.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Graph-based visual instance mining with geometric matching and nearest candidates selection.
Proceedings of the 9th International Conference on Knowledge and Systems Engineering, 2017

Evaluation of Deep Models for Real-Time Small Object Detection.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

2016
Visual Analytics of Political Networks From Face-Tracking of News Video.
IEEE Trans. Multim., 2016

Human Action Recognition from Depth Videos Using Pool of Multiple Projections with Greedy Selection.
IEICE Trans. Inf. Syst., 2016

When face-tracking meets social networks: a story of politics in news videos.
Appl. Netw. Sci., 2016


Searching a specific person in a specific location using deep features.
Proceedings of the Seventh Symposium on Information and Communication Technology, 2016

News Archive Exploration Combining Face Detection and Tracking with Network Visual Analytics.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

NII-UIT at MediaEval 2016 Predicting Media Interestingness Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Using node relationships for hierarchical classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Computational optimization for violent scenes detection.
Proceedings of the 2016 International Conference on Computer, 2016

Efficient Large Scale Image Classification via Prediction Score Decomposition.
Proceedings of the Computer Vision - ECCV 2016, 2016

Video Event Detection by Exploiting Word Dependencies from Image Captions.
Proceedings of the COLING 2016, 2016

2015
A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval.
Int. J. Multim. Data Eng. Manag., 2015

A Social Network Analysis of Face Tracking in News Video.
Proceedings of the 11th International Conference on Signal-Image Technology & Internet-Based Systems, 2015

Cross-View Action Recognition by Projection-Based Augmentation.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Query-adaptive late fusion with neural network for instance search.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Large scale multi-class classification using latent classifiers.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Human Action recognition from depth videos using multi-projection based representation.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

AttRel: An Approach to Person Re-Identification by Exploiting Attribute Relationships.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

NII-UIT Browser: A Multimodal Video Search System.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Multimedia Event Detection Using Event-Driven Multiple Instance Learning.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

NII-UIT at MediaEval 2015 Affective Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Learning Balanced Trees for Large Scale Image Classification.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

2014
Multimedia Event Detection Using Segment-Based Approach for Motion Feature.
J. Signal Process. Syst., 2014

The Video Browser Showdown: a live evaluation of interactive video search tools.
Int. J. Multim. Inf. Retr., 2014

ERI-MAC: An Energy-Harvested Receiver-Initiated MAC Protocol for Wireless Sensor Networks.
Int. J. Distributed Sens. Networks, 2014

National Institute of Informatics, Japan at TRECVID 2014.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Image Annotation Fusing Content-Based and Tag-Based Technique Using Support Vector Machine and Vector Space Model.
Proceedings of the Tenth International Conference on Signal-Image Technology and Internet-Based Systems, 2014

<i>Recommend-Me</i>: recommending query regions for image search.
Proceedings of the Symposium on Applied Computing, 2014

NII-UIT: A Tool for Known Item Search by Sequential Pattern Filtering.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

NII-UIT at MediaEval 2014 Violent Scenes Detection Affect Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Using Attribute Relationships for Person Re-Identification.
Proceedings of the Knowledge and Systems Engineering, 2014

Integrating Spatial Information into Inverted Index for Large-Scale Image Retrieval.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Sum-max video pooling for complex event recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Face Retrieval in Large-Scale News Video Datasets.
IEICE Trans. Inf. Syst., 2013

Violent scene detection using mid-level feature.
Proceedings of the 4th International Symposium on Information and Communication Technology, 2013

Re-ranking for person re-identification.
Proceedings of the 2013 International Conference on Soft Computing and Pattern Recognition, 2013

Evaluation of low-level features for detecting violent scenes in videos.
Proceedings of the 2013 International Conference on Soft Computing and Pattern Recognition, 2013

NII-UIT-VBS: A Video Browsing Tool for Known Item Search.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

NII-UIT at MediaEval 2013 Violent Scenes Detection Affect Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

A Classification-Based Approach for Retake and Scene Detection in Rushes Video.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

Person Re-identification Using Deformable Part Models.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

Efficient Traffic Sign Detection Using Bag of Visual Words and Multi-scales SIFT.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

2012
National Institute of Informatics, Japan at TRECVID 2012.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Robust eye localization in video by combining eye detector and eye tracker.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A Codeword Visualization Tool for Dense Trajectory Feature.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Auto face re-ranking by mining the web and video archives.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
National Institute of Informatics, Japan at TRECVID 2011.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

NTT Communication Science Laboratories and NII at TRECVID 2011 Instance Search Task.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

A Comprehensive Study of Feature Representations for Semantic Concept Detection.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

Summarizing Large News Video Archives by Event Ranking.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

NII-KAORI-PERSON-SEARCH: A General Framework for Indexing and Retrieving People's Appearance in Large Video Archives.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

NII, Japan at MediaEval 2011 Violent Scenes Detection Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Fast face sequence matching in large-scale video databases.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Boosting global scene classification accuracy by discriminative region localization.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Improving Image Categorization by Using Multiple Instance Learning with Spatial Relation.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

Improving Retake Detection by Adding Motion Feature.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

Indexing Faces in Broadcast News Video Archives.
Proceedings of the Data Mining Workshops (ICDMW), 2011

NII, Japan at ImageCLEF 2011 Photo Annotation Task.
Proceedings of the CLEF 2011 Labs and Workshop, 2011

2010
National Institute of Informatics, Japan at TRECVID 2010.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

NTT Communication Science Laboratories and NII in TRECVID 2010 Instance Search Task.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

An efficient method for face retrieval from large video datasets.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009
National Institute of Informatics, Japan at TRECVID 2009.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Efficient concept detection by fusing simple visual features.
Proceedings of the 2009 ACM Symposium on Applied Computing (SAC), 2009

2008
Face Detection, Tracking, and Recognition for Broadcast Video.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

National Institute of Informatics, Japan at TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Robust Face Track Finding in Video Using Tracked Points.
Proceedings of the 4th IEEE International Conference on Signal Image Technology and Internet Based Systems, 2008

A text segmentation based approach to video shot boundary detection.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Rushes summarization using different redundancy elimination approaches.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Unsupervised Face Annotation by Mining the Web.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

2007
Ent-Boost: Boosting using entropy measures for robust object detection.
Pattern Recognit. Lett., 2007

NII-ISM, Japan at TRECVID 2007: High Level Feature Extraction.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

National institute of informatics, japan at TRECVID 2007: BBC rushes summarization.
Proceedings of the 1st ACM Workshop on Video Summarization, 2007

Boosting Face Retrieval by using Relevant Set Correlation Clustering.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Finding Important People in Large News Video Databases Using Multimodal and Clustering Analysis.
Proceedings of the 23rd International Conference on Data Engineering Workshops, 2007

Video search by multi-modal and clustering analysis.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Human face processing techniques with application to large scale video indexing.
PhD thesis, 2006

A Multi-Stage Approach to Fast Face Detection.
IEICE Trans. Inf. Syst., 2006

Concept Detection Using Local Binary Patterns and SVM.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Hand gesture classification using boosted cascade of classifiers.
Proceedings of the 4th International Confernce on Computer Sciences: Research, 2006

Ent-Boost: Boosting Using Entropy Measure for Robust Object Detection.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Robust Object Detection using Fast Feature Selection from Huge Feature Sets.
Proceedings of the International Conference on Image Processing, 2006

Face Retrieval in Broadcasting News Video by Fusing Temporal and Intensity Information.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

2005
An Efficient Feature Selection Method for Object Detection.
Proceedings of the Pattern Recognition and Data Mining, 2005

2004
Person X Detector.
Proceedings of the 2004 TREC Video Retrieval Evaluation, 2004


  Loading...