C. V. Jawahar

Orcid: 0000-0001-6767-7057

Affiliations:
  • IIIT Hyderabad, Centre for Visual Information Technology (CVIT), India


According to our database1, C. V. Jawahar authored at least 465 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Explaining Deep Face Algorithms Through Visualization: A Survey.
IEEE Trans. Biom. Behav. Identity Sci., January, 2024

Advancing Question Answering on Handwritten Documents: A State-of-the-Art Recognition-Based Model for HW-SQuAD.
CoRR, 2024

Multiple Instance Learning for Glioma Diagnosis using Hematoxylin and Eosin Whole Slide Images: An Indian Cohort Study.
CoRR, 2024

Semantic Labels-Aware Transformer Model for Searching over a Large Collection of Lecture-Slides.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Enhancing Road Safety: Predictive Modeling of Accident-Prone Zones with ADAS-Equipped Vehicle Fleet Data.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Unconstrained Camera Captured Indic Offline Handwritten Dataset.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Towards Deployable OCR Models for Indic Languages.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

ICPR 2024 Competition on Word Image Recognition from Indic Scene Images.
Proceedings of the Pattern Recognition. Competitions - 27th International Conference, 2024

ICPR 2024 Competition on Rider Intention Prediction.
Proceedings of the Pattern Recognition. Competitions - 27th International Conference, 2024

CHART-Info 2024: A Dataset for Chart Analysis and Recognition.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Indic Scene Text on the Roadside.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Bridging the Gap in Resource for Offline English Handwritten Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

ICDAR 2024 Competition on Recognition and VQA on Handwritten Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

ICDAR 2024 Competition on Reading Documents Through Aria Glasses.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Early Anticipation of Driving Maneuvers.
Proceedings of the Computer Vision - ECCV 2024, 2024

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

IndicOCR: A Pipeline for Recognizing Printed Documents for Indian Languages.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

An Approach for Speech Enhancement in Low SNR Environments using Granular Speaker Embedding.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

Unlocking the Potential of Unstructured Data in Business Documents Through Document Intelligence.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HWNet v3: a joint embedding framework for recognition and retrieval of handwritten text.
Int. J. Document Anal. Recognit., December, 2023

Dataset agnostic document object detection.
Pattern Recognit., October, 2023

Generating Personalized Summaries of Day Long Egocentric Videos.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Deep semantic binarization for document images.
Multim. Tools Appl., February, 2023

Document Image Analysis Using Deep Multi-modular Features.
SN Comput. Sci., 2023

ScribbleNet: Efficient interactive annotation of urban city scenes for semantic segmentation.
Pattern Recognit., 2023

Towards Real-Time Analysis of Broadcast Badminton Videos.
CoRR, 2023

Reading Between the Lanes: Text VideoQA on the Road.
CoRR, 2023

Unsupervised Audio-Visual Lecture Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Watching the News: Towards VideoQA Models that can Read.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Towards Generating Ultra-High Resolution Talking-Face Videos with Lip synchronization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

FaceOff: A Video-to-Video Face Swapping System.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Audio-Visual Face Reenactment.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Towards Accurate Lip-to-Speech Synthesis in-the-Wild.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CueCAn: Cue-driven Contextual Attention for Identifying Missing Traffic Signs on Unconstrained Roads.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Reading Between the Lanes: Text VideoQA on the Road.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Visual Question Answering on Business Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Indic Handwriting Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

IndicSTR12: A Dataset for Indic Scene Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Efficient Semantic Segmentation Compression via Meta Pruning.
Proceedings of the Computer Vision and Image Processing - 8th International Conference, 2023

2022
A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roads.
Dataset, December, 2022

INR-V: A Continuous Representation Space for Video-based Generative Tasks.
Trans. Mach. Learn. Res., 2022

Removing Atmospheric Turbulence via Deep Adversarial Learning.
IEEE Trans. Image Process., 2022

Improving Scene Text Recognition for Indian Languages with Transfer Learning and Font Diversity.
J. Imaging, 2022

DeepPocket: Ligand Binding Site Detection and Segmentation using 3D Convolutional Neural Networks.
J. Chem. Inf. Model., 2022

Information Retrieval from the Digitized Books.
CoRR, 2022

Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale.
CoRR, 2022

An empirical study of CTC based models for OCR of Indian languages.
CoRR, 2022

Visual Understanding of Complex Table Structures from Document Images.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

FLUID: Few-Shot Self-Supervised Image Deraining.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

InfographicVQA.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

To miss-attend is to misalign! Residual Self-Attentive Feature Alignment for Adapting Object Detectors.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Multi-Domain Incremental Learning for Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Transductive Weakly-Supervised Player Detection using Soccer Broadcast Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

ETL: Efficient Transfer Learning for Face Tasks.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022

Grounded Video Situation Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

New Objects on the Road? No Problem, We'll Learn Them Too.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Generalized Keyword Spotting using ASR embeddings.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Automatic Annotation of Handwritten Document Images at Word Level.
Proceedings of the Thirteenth Indian Conference on Computer Vision, 2022

Towards Robust Handwritten Text Recognition with On-the-fly User Participation.
Proceedings of the Thirteenth Indian Conference on Computer Vision, 2022

A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roads✱.
Proceedings of the Thirteenth Indian Conference on Computer Vision, 2022

Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation.
Proceedings of the Thirteenth Indian Conference on Computer Vision, 2022

Enhancing Indic Handwritten Text Recognition Using Global Semantic Information.
Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments.
Proceedings of the Computer Vision - ECCV 2022, 2022

My View is the Best View: Procedure Learning from Egocentric Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022

Read While You Drive - Multilingual Text Tracking on the Road.
Proceedings of the Document Analysis Systems - 15th IAPR International Workshop, 2022


Detecting, Tracking and Counting Motorcycle Rider Traffic Violations on Unconstrained Roads.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Plant Disease Classification Using Hybrid Features.
Proceedings of the Computer Vision and Image Processing - 7th International Conference, 2022

Mobile Captured Glass Board Image Enhancement.
Proceedings of the Computer Vision and Image Processing - 7th International Conference, 2022

Compressing Video Calls using Synthetic Talking Heads.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Canonical Saliency Maps: Decoding Deep Face Models.
IEEE Trans. Biom. Behav. Identity Sci., 2021

Asking questions on handwritten document collections.
Int. J. Document Anal. Recognit., 2021

Classification of histopathology images using ConvNets to detect Lupus Nephritis.
CoRR, 2021

ICDAR 2021 Competition on Document VisualQuestion Answering.
CoRR, 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

Evaluating Computer Vision Techniques for Urban Mobility on Large-Scale, Unconstrained Roads.
CoRR, 2021

DocVQA: A Dataset for VQA on Document Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Visual Speech Enhancement Without A Real Visual Stream.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Efficient and Generic Interactive Segmentation Framework to Correct Mispredictions During Clinical Evaluation of Medical Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

MMBERT: Multimodal BERT Pretraining for Improved Medical VQA.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Exploring Genetic-histologic Relationships in Breast Cancer.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Towards Automatic Speech to Sign Language Generation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Translating sign language videos to talking faces.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Intelligent video editing: incorporating modern talking face generation algorithms in a video editor.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Automatic quantification and visualization of street trees.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Looking Farther in Parametric Scene Parsing with Ground and Aerial Imagery.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

ICDAR 2021 Competition on Document Visual Question Answering.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Towards Boosting the Accuracy of Non-latin Scene Text Recognition.
Proceedings of the Document Analysis and Recognition, 2021

Transfer Learning for Scene Text Recognition in Indian Languages.
Proceedings of the Document Analysis and Recognition, 2021

iiit-indic-hw-words: A Dataset for Indic Handwritten Text Recognition.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Evaluation of Detection and Segmentation Tasks on Driving Datasets.
Proceedings of the Computer Vision and Image Processing - 6th International Conference, 2021

Classroom Slide Narration System.
Proceedings of the Computer Vision and Image Processing - 6th International Conference, 2021

Handwritten Text Retrieval from Unlabeled Collections.
Proceedings of the Computer Vision and Image Processing - 6th International Conference, 2021

Towards Label-Free Few-Shot Learning: How Far Can We Go?
Proceedings of the Computer Vision and Image Processing - 6th International Conference, 2021

Data-Efficient Training Strategies for Neural TTS Systems.
Proceedings of the CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, 2021

Revisiting Low Resource Status of Indian Languages in Machine Translation.
Proceedings of the CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, 2021

Personalized One-Shot Lipreading for an ALS Patient.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Audio-Visual Speech Super-Resolution.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Interactive Learning for Assisting Whole Slide Image Annotation.
Proceedings of the Pattern Recognition - 6th Asian Conference, 2021

More Parameters? No Thanks!
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Evaluation and Visualization of Driver Inattention Rating From Facial Features.
IEEE Trans. Biom. Behav. Identity Sci., 2020

Bringing semantics into word image representation.
Pattern Recognit., 2020

Guest Editorial: Special Issue on ACCV 2018.
Int. J. Comput. Vis., 2020

Few Shot Learning With No Labels.
CoRR, 2020

Document Visual Question Answering Challenge 2020.
CoRR, 2020

DocVQA: A Dataset for VQA on Document Images.
CoRR, 2020

Munich to Dubai: How far is it for Semantic Segmentationƒ.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

A Multi-Space Approach to Zero-Shot Object Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Region Pooling with Adaptive Feature Fusion for End-to-End Person Recognition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

IndicSpeech: Text-to-Speech Corpus for Indian Languages.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Multilingual Parallel Corpora Collection Effort for Indian Languages.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

DGAZE: Driver Gaze Mapping on Road.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

RoadText-1K: Text Detection & Recognition Dataset for Driving Videos.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Context Aware Group Activity Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Improving Word Recognition using Multiple Hypotheses and Deep Embeddings.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

PhraseOut: A Code Mixed Data Augmentation Method for MultilingualNeural Machine Tranlsation.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

Exploring Pair-Wise NMT for Indian Languages.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

Table Structure Recognition Using Top-Down and Bottom-Up Cues.
Proceedings of the Computer Vision - ECCV 2020, 2020

Recurrent Image Annotation with Explicit Inter-label Dependencies.
Proceedings of the Computer Vision - ECCV 2020, 2020

Weakly Supervised Instance Segmentation by Learning Annotation Consistent Instances.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Benchmark System for Indian Language Text Recognition.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

IIIT-AR-13K: A New Dataset for Graphical Object Detection in Documents.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

Adapting OCR with Limited Supervision.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Human-Machine Collaboration for Face Recognition.
Proceedings of the CoDS-COMAD 2020: 7th ACM IKDD CoDS and 25th COMAD, 2020

Spatial Feedback Learning to Improve Semantic Segmentation in Hot Weather.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Graph Representation Ensemble Learning.
Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2020

2019
Spotting words in silent speech videos: a retrieval-based approach.
Mach. Vis. Appl., 2019

HWNet v2: an efficient word image representation for handwritten documents.
Int. J. Document Anal. Recognit., 2019

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard.
CoRR, 2019

A Baseline Neural Machine Translation System for Indian Languages.
CoRR, 2019

Low-Cost Transfer Learning of Face Tasks.
CoRR, 2019

Technology interventions for road safety and beyond.
Commun. ACM, 2019

IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Region-based active learning for efficient labeling in semantic segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

A Deep Learning Approach for Robust Corridor Following from an Arbitrary Pose.
Proceedings of the 27th Signal Processing and Communications Applications Conference, 2019

Generating 1 Minute Summaries of Day Long Egocentric Videos.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Towards Automatic Face-to-Face Translation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Towards Increased Accessibility of Meme Images with the Help of Rich Face Emotion Captions.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Self-Supervised Visual Representations for Cross-Modal Retrieval.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

A Deep Learning Approach for Robust Corridor Following.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Graphical Object Detection in Document Images.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Towards Automated Evaluation of Handwritten Assessments.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Textual Description for Mathematical Equations.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

DocFigure: A Dataset for Scientific Document Figure Classification.
Proceedings of the 13th IAPR International Workshop on Graphics Recognition, 2019

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

A Cost Efficient Approach to Correct OCR Errors in Large Document Collections.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

ICDAR 2019 Competition on Scene Text Visual Question Answering.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Universal Semi-Supervised Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Scene Text Visual Question Answering.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cross-language Speech Dependent Lip-synchronization.
Proceedings of the IEEE International Conference on Acoustics, 2019

AutoRate: How attentive is the driver?
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Improved Road Connectivity by Joint Learning of Orientation and Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dissimilarity Coefficient Based Weakly Supervised Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

CVIT's submissions to WAT-2019.
Proceedings of the 6th Workshop on Asian Translation, 2019

2018
Semi-supervised annotation of faces in image collection.
Signal Image Video Process., 2018

Improving multiclass classification by deep networks using DAGSVM and Triplet Loss.
Pattern Recognit. Lett., 2018

Automatic image annotation: the quirks and what works.
Multim. Tools Appl., 2018

Efficient Query Specific DTW Distance for Document Retrieval with Unlimited Vocabulary.
J. Imaging, 2018

Cross-specificity: modelling data semantics for cross-modal matching and retrieval.
Int. J. Multim. Inf. Retr., 2018

Connecting Visual Experiences using Max-flow Network with Application to Visual Localization.
CoRR, 2018

TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces.
CoRR, 2018

Automated Top View Registration of Broadcast Football Videos.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Word Spotting in Silent Lip Videos.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Towards Structured Analysis of Broadcast Badminton Videos.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

City-Scale Road Audit System using Deep Learning.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Class2Str: End to End Latent Hierarchy Learning.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Enhancing OCR Accuracy with Super Resolution.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Augment and Adapt: A Simple Approach to Image Tampering Detection.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Investigating the generalizability of EEG-based cognitive load estimation across visualizations.
Proceedings of the International Conference on Multimodal Interaction: Adjunct, 2018

Localizing and Recognizing Text in Lecture Videos.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Improving CNN-RNN Hybrid Networks for Handwriting Recognition.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Towards Spotting and Recognition of Handwritten Words in Indic Scripts.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Unsupervised Learning of Face Representations.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

Word Spotting and Recognition Using Deep Embedding.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

Offline Handwriting Recognition on Devanagari Using a New Benchmark Dataset.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

Efficient Semantic Segmentation Using Gradual Grouping.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Scaling Handwritten Student Assessments With a Document Image Workflow System.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Efficient Optimization for Rank-Based Loss Functions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Self-Supervised Feature Learning for Semantic Segmentation of Overhead Imagery.
Proceedings of the British Machine Vision Conference 2018, 2018

Improved Visual Relocalization by Discovering Anchor Points.
Proceedings of the British Machine Vision Conference 2018, 2018

Neuro-IoU: Learning a Surrogate Loss for Semantic Segmentation.
Proceedings of the British Machine Vision Conference 2018, 2018

Learning Human Poses from Actions.
Proceedings of the British Machine Vision Conference 2018, 2018

Learning Optimal Redistribution Mechanisms Through Neural Networks.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Learning to Round for Discrete Labeling Problems.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

CVIT-MT Systems for WAT-2018.
Proceedings of the 32nd Pacific Asia Conference on Language, 2018

2017
Automatic analysis of broadcast football videos using contextual priors.
Signal Image Video Process., 2017

Trajectory aligned features for first person action recognition.
Pattern Recognit., 2017

Human pose search using deep networks.
Image Vis. Comput., 2017

Unsupervised refinement of color and stroke features for text binarization.
Int. J. Document Anal. Recognit., 2017

Image Annotation by Propagating Labels from Semantic Neighbourhoods.
Int. J. Comput. Vis., 2017

A support vector approach for cross-modal search of images and texts.
Comput. Vis. Image Underst., 2017

Collaborative Contributions for Better Annotations.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 6: VISAPP, Porto, Portugal, February 27, 2017

An EEG-Based Image Annotation System.
Proceedings of the Computer Vision, Pattern Recognition, Image Processing, and Graphics, 2017

Document Image Segmentation Using Deep Features.
Proceedings of the Computer Vision, Pattern Recognition, Image Processing, and Graphics, 2017

SmartTennisTV: Automatic Indexing of Tennis Videos.
Proceedings of the Computer Vision, Pattern Recognition, Image Processing, and Graphics, 2017

Towards Accurate Handwritten Word Recognition for Hindi and Bangla.
Proceedings of the Computer Vision, Pattern Recognition, Image Processing, and Graphics, 2017

Unsupervised Learning Based Approach for Plagiarism Detection in Programming Assignments.
Proceedings of the 10th Innovations in Software Engineering Conference, 2017

Unsupervised Learning of Deep Feature Representation for Clustering Egocentric Actions.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

An Empirical Study of Effectiveness of Post-Processing in Indic Scripts.
Proceedings of the 6th International Workshop on Multilingual OCR, 2017

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam.
Proceedings of the 6th International Workshop on Multilingual OCR, 2017

An Interactive Tour Guide for a Heritage Site.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Eye Contact Detection via Deep Neural Networks.
Proceedings of the HCI International 2017 - Posters' Extended Abstracts, 2017

Pose-Aware Person Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unconstrained scene text and video text recognition for Arabic script.
Proceedings of the 1st International Workshop on Arabic Script Analysis and Recognition, 2017

Plagiarism Detection in Programming Assignments Using Deep Features.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

Sequence-to-Sequence Learning for Human Pose Correction in Videos.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

Compressing Deep Neural Networks for Recognizing Places.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

Improving Small Object Detection.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

Unconstrained OCR for Urdu Using Deep CNN-RNN Hybrid Networks.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

2016
Single and Multiple View Support Order Prediction in Clutter for Manipulation.
J. Intell. Robotic Syst., 2016

Enhancing energy minimization framework for scene text recognition with top-down cues.
Comput. Vis. Image Underst., 2016

Generating Synthetic Data for Text Recognition.
CoRR, 2016

Learning multiple experiences useful visual features for active maps localization in crowded environments.
Adv. Robotics, 2016

Align Me: A framework to generate Parallel Corpus Using OCRs and Bilingual Dictionaries.
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Efficient object annotation for surveillance and automotive applications.
Proceedings of the 2016 IEEE Winter Applications of Computer Vision Workshops, 2016

Fine-tuning human pose estimations in videos.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Face fiducial detection by consensus of exemplars.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

A Robust Distance with Correlated Metric Learning for Multi-Instance Multi-Label Data.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Diverse Yet Efficient Retrieval using Locality Sensitive Hashing.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Discriminative learning based visual servoing across object instances.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Frame level annotations for tennis videos.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Visual Aesthetic Analysis for Handwritten Document Images.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

Deep Feature Embedding for Accurate Recognition and Retrieval of Handwritten Text.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

Partial Linearization Based Optimization for Multi-class SVM.
Proceedings of the Computer Vision - ECCV 2016, 2016

IIIT-CFW: A Benchmark Database of Cartoon Faces in the Wild.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Matching Handwritten Document Images.
Proceedings of the Computer Vision - ECCV 2016, 2016

Dynamic Narratives for Heritage Tour.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Error Detection in Indic OCRs.
Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

A Simple and Effective Solution for Script Identification in the Wild.
Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

Multilingual OCR for Indic Scripts.
Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

First Person Action Recognition Using Deep Learned Descriptors.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Optimizing Average Precision Using Weakly Supervised Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Path Planning for Visual servoing and Navigation using Convex Optimization.
Int. J. Robotics Autom., 2015

Diverse Yet Efficient Retrieval using Hash Functions.
CoRR, 2015

Document Retrieval with Unlimited Vocabulary.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Generic action recognition from egocentric videos.
Proceedings of the 2015 Fifth National Conference on Computer Vision, 2015

Learning metrics for diversity in instance retrieval.
Proceedings of the 2015 Fifth National Conference on Computer Vision, 2015

Efficient face frontalization in unconstrained images.
Proceedings of the 2015 Fifth National Conference on Computer Vision, 2015

Active learning based image annotation.
Proceedings of the 2015 Fifth National Conference on Computer Vision, 2015

A Probabilistic Approach for Image Retrieval Using Descriptive Textual Queries.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Servoing across object instances: Visual servoing for object category.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Can RNNs reliably separate script and language at word and line level?
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Unsupervised feature learning for optical character recognition.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Efficient word image retrieval using fast DTW distance.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Online handwriting recognition using depth sensors.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Multi-label Cross-Modal Retrieval.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Visual Phrases for Exemplar Face Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Domain adaptation by aligning locality preserving subspaces.
Proceedings of the Eighth International Conference on Advances in Pattern Recognition, 2015

Multi-label annotation of music.
Proceedings of the Eighth International Conference on Advances in Pattern Recognition, 2015

Human pose search using deep poselets.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Accurate localization by fusing images and GPS signals.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Fast approximate dynamic warping kernels.
Proceedings of the Second ACM IKDD Conference on Data Sciences, 2015

Exploring Locally Rigid Discriminative Patches for Learning Relative Attributes.
Proceedings of the British Machine Vision Conference 2015, 2015

TennisVid2Text: Fine-grained Descriptions for Domain Specific Videos.
Proceedings of the British Machine Vision Conference 2015, 2015

Semantic Classification of Boundaries of an RGBD Image.
Proceedings of the British Machine Vision Conference 2015, 2015

Fine-grain annotation of cricket videos.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Large scale document image retrieval by automatic word annotation.
Int. J. Document Anal. Recognit., 2014

Efficient Optimization for Average Precision SVM.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Monocular vision based road marking recognition for driver assistance and safety.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2014

Reactionless visual servoing of a dual-arm space robot.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Currency Recognition on Mobile Phones.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Efficient Evaluation of SVM Classifiers Using Error Space Encoding.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Enhancing Word Image Retrieval in Presence of Font Variations.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Identifying Ragas in Indian Music.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Face Recognition in Videos by Label Propagation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Estimating Floor Regions in Cluttered Indoor Scenes from First Person Camera View.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Providing services on demand by user action modeling on smart phones.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

Learning to Rank Using High-Order Information.
Proceedings of the Computer Vision - ECCV 2014, 2014

Towards a Robust OCR System for Indic Scripts.
Proceedings of the 11th IAPR International Workshop on Document Analysis Systems, 2014

Parsing World's Skylines Using Shape-Constrained MRFs.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Relative Parts: Distinctive Parts for Learning Relative Attributes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Optimizing Average Precision Using Weakly Supervised Data.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Im2Text and Text2Im: Associating Images and Texts for Cross-Modal Retrieval.
Proceedings of the British Machine Vision Conference, 2014

Scene Text Recognition and Retrieval for Large Lexicons.
Proceedings of the Computer Vision - ACCV 2014, 2014

Learning Partially Shared Dictionaries for Domain Adaptation.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

Optimizing Storage Intensive Vision Applications to Device Capacity.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Image Annotation in Presence of Noisy Labels.
Proceedings of the Pattern Recognition and Machine Intelligence, 2013

Semi-supervised Clustering by Selecting Informative Constraints.
Proceedings of the Pattern Recognition and Machine Intelligence, 2013

Learning Semantic Interaction among Graspable Objects.
Proceedings of the Pattern Recognition and Machine Intelligence, 2013

Indian Movie Face Database: A benchmark for face recognition under wide variations.
Proceedings of the Fourth National Conference on Computer Vision, 2013

Near real-time face parsing.
Proceedings of the Fourth National Conference on Computer Vision, 2013

Learning support order for manipulation in clutter.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Visual localization in highly crowded urban environments.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Multibody VSLAM with relative scale solution for curvilinear motion reconstruction.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Document Specific Sparse Coding for Word Retrieval.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Devanagari Text Recognition: A Transcription Based Formulation.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Error Detection in Highly Inflectional Languages.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Character N-Gram Spotting on Handwritten Documents Using Weakly-Supervised Segmentation.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Sparse Document Image Coding for Restoration.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Bringing Semantics in Word Image Retrieval.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Whole is Greater than Sum of Parts: Recognizing Scene Text Words.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Detection of Cut-and-Paste in Document Images.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Offline Mobile Instance Retrieval with a Small Memory Footprint.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Image Retrieval Using Textual Cues.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Decomposing Bag of Words Histograms.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Generating Image Descriptions Using Semantic Similarities in the Output Space.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Blocks That Shout: Distinctive Parts for Scene Classification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Efficient Category Mining by Leveraging Instance Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Learning Multiple Non-linear Sub-spaces Using K-RBMs.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Exploring SVM for Image Annotation in Presence of Confusing Labels.
Proceedings of the British Machine Vision Conference, 2013

Parsing Clothes in Unrestricted Images.
Proceedings of the British Machine Vision Conference, 2013

Depth really Matters: Improving Visual Salient Region Detection with Depth.
Proceedings of the British Machine Vision Conference, 2013

Compacting Large and Loose Communities.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Efficient and Rich Annotations for Large Photo Collections.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Sparse Representation Based Face Recognition with Limited Labeled Samples.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012
Video retrieval by mimicking poses.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Neti Neti: in search of deity.
Proceedings of the Eighth Indian Conference on Vision, Graphics and Image Processing, 2012

Heritage app: annotating images on mobile phones.
Proceedings of the Eighth Indian Conference on Vision, Graphics and Image Processing, 2012

Automatic localization and correction of line segmentation errors.
Proceedings of the Proceeding of the workshop on Document Analysis and Recognition, 2012

Content level access to digital library of India pages.
Proceedings of the Eighth Indian Conference on Vision, Graphics and Image Processing, 2012

A non-local MRF model for heritage architectural image completion.
Proceedings of the Eighth Indian Conference on Vision, Graphics and Image Processing, 2012

Are buildings only instances?: exploration in architectural style categories.
Proceedings of the Eighth Indian Conference on Vision, Graphics and Image Processing, 2012

Sparse discriminative Fisher vectors in visual classification.
Proceedings of the Eighth Indian Conference on Vision, Graphics and Image Processing, 2012

Motion segmentation of multiple objects from a freely moving monocular camera.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Recognition of printed Devanagari text using BLSTM Neural Network.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Partial Least Squares kernel for computing similarities between video sequences.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Logical Itemset Mining.
Proceedings of the 12th IEEE International Conference on Data Mining Workshops, 2012

Image Annotation Using Metric Learning in Semantic Neighbourhoods.
Proceedings of the Computer Vision - ECCV 2012, 2012

Towards Exhaustive Pairwise Matching in Large Image Collections.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Has My Algorithm Succeeded? An Evaluator for Human Pose Estimators.
Proceedings of the Computer Vision - ECCV 2012, 2012

Word Image Retrieval Using Bag of Visual Words.
Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, 2012

Robust Recognition of Degraded Documents Using Character N-Grams.
Proceedings of the 10th IAPR International Workshop on Document Analysis Systems, 2012

Cats and dogs.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Top-down and bottom-up cues for scene text recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Scene Text Recognition using Higher Order Language Priors.
Proceedings of the British Machine Vision Conference, 2012

Image Retrieval Using Eigen Queries.
Proceedings of the Computer Vision, 2012

Action Recognition Using Canonical Correlation Kernels.
Proceedings of the Computer Vision - ACCV 2012, 2012

Learning Hierarchical Bag of Words Using Naive Bayes Clustering.
Proceedings of the Computer Vision - ACCV 2012, 2012

Choosing Linguistics over Vision to Describe Images.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Video Scene Segmentation with a Semantic Similarity.
Proceedings of the 5th Indian International Conference on Artificial Intelligence, 2011

Large scale visual localization in urban environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Privacy Preserving Outlier Detection Using Locality Sensitive Hashing.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Character n-Gram Spotting in Document Images.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

An MRF Model for Binarization of Natural Scene Text.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

BLSTM Neural Network Based Word Retrieval for Hindi Documents.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

The truth about cats and dogs.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Realtime multibody visual SLAM with a smoothly moving monocular camera.
Proceedings of the IEEE International Conference on Computer Vision, 2011

LSH based outlier detection and its application in distributed setting.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Blind authentication: a secure crypto-biometric verification protocol.
IEEE Trans. Inf. Forensics Secur., 2010

Oxford-IIIT TRECVID 2010 - Notebook paper.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Realtime moving object detection from a freely moving monocular camera.
Proceedings of the 2010 IEEE International Conference on Robotics and Biomimetics, 2010

Efficient Privacy Preserving K-Means Clustering.
Proceedings of the Intelligence and Security Informatics, Pacific Asia Workshop, 2010

An adaptive outdoor terrain classification methodology using monocular camera.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Multiple plane tracking using Unscented Kalman Filter.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Towards recognition of degraded words by probabilistic parsing.
Proceedings of the Seventh Indian Conference on Computer Vision, 2010

Realtime motion segmentation based multibody visual SLAM.
Proceedings of the Seventh Indian Conference on Computer Vision, 2010

An indexing approach for speeding-up image classification.
Proceedings of the Seventh Indian Conference on Computer Vision, 2010

Characteristic pattern discovery in videos.
Proceedings of the Seventh Indian Conference on Computer Vision, 2010

Efficient Semantic Indexing for Image Retrieval.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Fast and Spatially-Smooth Terrain Classification Using Monocular Camera.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Nearest neighbor based collection OCR.
Proceedings of the Ninth IAPR International Workshop on Document Analysis Systems, 2010

A post-processing scheme for malayalam using statistical sub-character language models.
Proceedings of the Ninth IAPR International Workshop on Document Analysis Systems, 2010

Towards more effective distance functions for word image matching.
Proceedings of the Ninth IAPR International Workshop on Document Analysis Systems, 2010

Multi modal semantic indexing for image retrieval.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Generalized RBF feature maps for Efficient Detection.
Proceedings of the British Machine Vision Conference, 2010

Image-based walkthroughs from incremental and partial scene reconstructions.
Proceedings of the British Machine Vision Conference, 2010

Tripartie Graph Models for Multi Modal Retrieval.
Proceedings of the British Machine Vision Conference, 2010

2009
Retrieval of online handwriting by synthesis and matching.
Pattern Recognit., 2009

Oxford-IIIT TRECVID 2009 Notebook paper.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

A Bayesian Approach to Hybrid Image Retrieval.
Proceedings of the Pattern Recognition and Machine Intelligence, 2009

Robust Recognition of Documents by Fusing Results of Word Clusters.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

Managing multilingual OCR project using XML.
Proceedings of the International Workshop on Multilingual OCR, 2009

Efficient privacy preserving video surveillance.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Efficient Biometric Verification in Encrypted Domain.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Empirical Evaluation of Character Classification Schemes.
Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

Incremental on-line semantic indexing for image retrieval in dynamic databases.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Contextual restoration of severely degraded document images.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Example based video filters.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Subtitle-free Movie to Script Alignment.
Proceedings of the British Machine Vision Conference, 2009

Planar Scene Modeling from Quasiconvex Subproblems.
Proceedings of the Computer Vision, 2009

2008
Recognition-free search in graphics stream of PDF.
World Digit. Libr., 2008

Matching word images for content-based retrieval from printed document images.
Int. J. Document Anal. Recognit., 2008

Oxford/IIIT TRECVID 2008 - Notebook paper.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

On-line convex optimization based solution for mapping in VSLAM.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Attention-Based Super Resolution from Videos.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Fast and Secure Real-Time Video Encryption.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Document Image Segmentation as a Spectral Partitioning Problem.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Frequency Domain Visual Servoing Using Planar Contours.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Adaptation and Learning for Image Based Navigation.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Autonomous image-based exploration for mobile robot navigation.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Visual servoing based on Gaussian mixture models.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Recognition of books by verification and retraining.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Efficient implementation of SVM for large class problems.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

A novel video encryption technique based on secret sharing.
Proceedings of the International Conference on Image Processing, 2008

Robust image registration with illumination, blur and noise variations for super-resolution.
Proceedings of the IEEE International Conference on Acoustics, 2008

Super-Resolution of Text Images Using Edge-Directed Tangent Field.
Proceedings of the Eighth IAPR International Workshop on Document Analysis Systems, 2008

Private Content Based Image Retrieval.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

FISH: a practical system for fast interactive image search in huge databases.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
Optical Character Recognition of Amharic Documents.
Afr. J. Inf. Commun. Technol., 2007

A Vision System for Monitoring Intermodal Freight Trains.
Proceedings of the 8th IEEE Workshop on Applications of Computer Vision (WACV 2007), 2007

Self Adaptable Recognizer for Document Image Collections.
Proceedings of the Pattern Recognition and Machine Intelligence, 2007

Efficient Search with Changing Similarity Measures on Large Multimedia Datasets.
Proceedings of the Advances in Multimedia Modeling, 2007

Path planning approach to visual servoing with feature visibility constraints: A convex optimization based solution.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Optimizing image and camera trajectories in robot vision control using on-line boosting.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Visual Servoing in Non-Rigid Environments: A Space-Time Approach.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Visual Servoing by Optimization of a 2D/3D Hybrid Objective Function.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Combining Texture and Edge Planar Trackers based on a local Quality Metric.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

On Using Classical Poetry Structure for Indian Language Post-Processing.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

On Segmentation of Documents in Complex Scripts.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Content-level Annotation of Large Collection of Printed Document Images.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Modeling Time-Varying Population for Biometric Authentication.
Proceedings of the 2007 International Conference on Computing: Theory and Applications (ICCTA 2007), 2007

Probabilistic Reverse Annotation for Large Scale Image Retrieval.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Efficient Search in Document Image Collections.
Proceedings of the Computer Vision, 2007

2006
Integration Framework for Improved Visual Servoing in Image and Cartesian Spaces.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Text Driven Temporal Segmentation of Cricket Videos.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Enabling Search over Large Collections of Telugu Document Images - An Automatic Annotation Based Approach.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Learning Segmentation of Documents with Complex Scripts.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Task Specific Factors for Video Characterization.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Robust Homography-Based Control for Camera Positioning in Piecewise Planar Environments.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Computing Eigen Space from Limited Number of Views for Recognition.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Discriminative Actions for Recognising Events.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Dynamic Events as Mixtures of Spatial and Temporal Features.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

Visual Servoing in Presence of Non-Rigid Motion.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Efficient Region Based Indexing and Retrieval for Images with Elastic Bucket Tries.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Target Model Estimation using Particle Filters for Visual Servoing.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Learning Mixtures of Offline and Online features for Handwritten Stroke Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Analysis of Relevance Feedback in Content Based Image Retrieval.
Proceedings of the Ninth International Conference on Control, 2006

Improvement to the Minimization of Hybrid Error Functions for Pose Alignment.
Proceedings of the Ninth International Conference on Control, 2006

Probabilistic Integration of 2D and 3D Cues for Visual Servoing.
Proceedings of the Ninth International Conference on Control, 2006

Digitizing a Million Books: Challenges for Document Analysis.
Proceedings of the Document Analysis Systems VII, 7th International Workshop, 2006

A Semi-automatic Adaptive OCR for Digital Libraries.
Proceedings of the Document Analysis Systems VII, 7th International Workshop, 2006

Retrieval from Document Image Collections.
Proceedings of the Document Analysis Systems VII, 7th International Workshop, 2006

2005
Learning to Segment Document Images.
Proceedings of the Pattern Recognition and Machine Intelligence, 2005

Design of Hierarchical Classifier with Hybrid Architectures.
Proceedings of the Pattern Recognition and Machine Intelligence, 2005

Recognition of Printed Amharic Documents.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

Configurable Hybrid Architectures for Character Recognition Applications.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

Discriminant Substrokes for Online Handwriting Recognition.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

2004
Fourier domain representation of planar curves for recognition in multiple views.
Pattern Recognit., 2004

Discrete contours in multiple views: approximation and recognition.
Image Vis. Comput., 2004

Geometric Structure Computation from Conics.
Proceedings of the ICVGIP 2004, 2004

Searching in Document Images.
Proceedings of the ICVGIP 2004, 2004

Building blocks for autonomous navigation using contour correspondences.
Proceedings of the 2004 International Conference on Image Processing, 2004

Representation and annotation of online handwritten data.
Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition, 2004

Constraints on Coplanar Moving Points.
Proceedings of the Computer Vision, 2004

2003
A Bilingual OCR for Hindi-Telugu Documents and its Applications.
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2003

Tools for Developing OCRs for Indian Scripts.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2003

2002
An adaptive multifeature correspondence algorithm for stereo using dynamic programming.
Pattern Recognit. Lett., 2002

Generalised correlation for multi-feature correspondence.
Pattern Recognit., 2002

Multiview Constraints for Recognition of Planar Curves in Fourier Domain.
Proceedings of the ICVGIP 2002, 2002

Algebraic Constraints on Moving Points in Multiple Views.
Proceedings of the ICVGIP 2002, 2002

Polygonal Approximation of Closed Curves across Multiple Views.
Proceedings of the ICVGIP 2002, 2002

Planar Shape Recognition across Multiple Views.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Video frame alignment in multiple views.
Proceedings of the 2002 International Conference on Image Processing, 2002

Towards Fuzzy Calibration.
Proceedings of the Advances in Soft Computing, 2002

2000
Analysis of fuzzy thresholding schemes.
Pattern Recognit., 2000

1997
Investigations on fuzzy thresholding based on fuzzy clustering.
Pattern Recognit., 1997

1996
Fuzzy statistics of digital images.
IEEE Signal Process. Lett., 1996

Incorporation of gray-level imprecision in representation and processing of digital images.
Pattern Recognit. Lett., 1996

1995
Detection of clusters of distinct geometry: A step towards generalised fuzzy clustering.
Pattern Recognit. Lett., 1995


  Loading...