Dhiraj Joshi

According to our database1, Dhiraj Joshi authored at least 61 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Data-Prep-Kit: getting your data ready for LLM application development.
CoRR, 2024

2023
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Contrastive Mean Teacher for Domain Adaptive Object Detectors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.
CoRR, 2020

2019
Automatic Curation of Sports Highlights Using Multimodal Excitement Features.
IEEE Trans. Multim., 2019

Affective Computing for Large-Scale Heterogeneous Multimedia Data: A Survey.
CoRR, 2019

Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Grounding Spoken Words in Unlabeled Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection.
CoRR, 2018

The Excitement of Sports: Automatic Highlights Using Audio/Visual Cues.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Harnessing A.I. for Augmenting Creativity: Application to Movie Trailer Creation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

IBM High-Five: Highlights From Intelligent Video Engine.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Image-based user profiling of frequent and regular venue categories.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Automatic Curation of Golf Highlights Using Multimodal Excitement Features.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Using business-aware latent topics for image captioning in social media.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Business-Aware Visual Concept Discovery from Social Media for Multimodal Business Venue Recognition.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Building User Profiles from Shared Photos.
Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Inferring crowd-sourced venues for tweets.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
Finding selfies of users in microblogged photos.
Proceedings of the SoMeRA'14, 2014

Social Media-based Profiling of Business Locations.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014

Multi-modal Language Models for Lecture Video Retrieval.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Scalable Image Search with Multiple Index Tables.
Proceedings of the International Conference on Multimedia Retrieval, 2014

2013
Reinforced Similarity Integration in Image-Rich Information Networks.
IEEE Trans. Knowl. Data Eng., 2013

2012
Inferring photographic location using geotagged web images.
Multim. Tools Appl., 2012

Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-task Learning.
Proceedings of the 2012 IEEE International Symposium on Multimedia, 2012

Tag Cloud++ - Scalable Tag Clouds for Arbitrary Layouts.
Proceedings of the 2012 IEEE International Symposium on Multimedia, 2012

2011
Aesthetics and Emotions in Images.
IEEE Signal Process. Mag., 2011

Geotagging in multimedia and computer vision - a survey.
Multim. Tools Appl., 2011

Reliving on demand: a total viewer experience.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Dynamic media show drivable by semantics.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Finding geographically representative music via social media.
Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, Scottsdale, AZ, USA, November 28, 2011

Using Geotags to Derive Rich Tag-Clouds for Image Annotation.
Proceedings of the Social Media Modeling and Computing., 2011

2010
iRIN: image retrieval in image-rich information networks.
Proceedings of the 19th International Conference on World Wide Web, 2010

Semantic understanding of geotagged pictures.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Suggesting Songs for Media Creation Using Semantics.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Exploring user image tags for geo-location inference.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
<i>Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval</i>.
J. Electronic Imaging, 2009

Connecting people in photo-sharing sites by photo content and user annotations.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Geo-location inference from image content and user tags.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2008
<i>Multimedia Retrieval</i>.
J. Electronic Imaging, 2008

Image retrieval: Ideas, influences, and trends of the new age.
ACM Comput. Surv., 2008

Event recognition: viewing the world with a third eye.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Recognizing picture-taking environment from satellite images: A feasibility study.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Inferring generic activities and events from image content and bags of geo-tags.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
Tagging over time: real-world image annotation by lightweight meta-learning.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

A Greedy Performance Driven Algorithm for Decision Fusion Learning.
Proceedings of the International Conference on Image Processing, 2007

Semantics reinforcement and fusion learning for multimedia streams.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
The Story Picturing Engine - a system for automatic text illustration.
ACM Trans. Multim. Comput. Commun. Appl., 2006

A Computationally Efficient Approach to the Estimation of Two- and Three-Dimensional Hidden Markov Models.
IEEE Trans. Image Process., 2006

PARAgrab: A Comprehensive Architecture for Web Image Management and Multimodal Querying.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

IBM Research TRECVID-2006 Video Retrieval System.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Discovering groups of people in Google news.
Proceedings of the 1st ACM international workshop on Human-centered multimedia, 2006

Studying Aesthetics in Photographic Images Using a Computational Approach.
Proceedings of the Computer Vision, 2006

2005
Multimedia Systems and Content-Based Image Retrieval. By Sagarmay Deb, Idea Group Publishing, 2004, $79.95 ISBN 1-59140-156-9.
Inf. Process. Manag., 2005

Parameter estimation of multi-dimensional hidden Markov models - a scalable approach.
Proceedings of the 2005 International Conference on Image Processing, 2005

2004
The story picturing engine: finding elite images to illustrate a story using mutual reinforcement.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Stochastic modeling of volume images with a 3-d hidden markov model.
Proceedings of the 2004 International Conference on Image Processing, 2004

2002
A Computationally Efficient Evolutionary Algorithm for Real-Parameter Optimization.
Evol. Comput., 2002

Real-coded evolutionary algorithms with parent-centric recombination.
Proceedings of the 2002 Congress on Evolutionary Computation, 2002


  Loading...