Yu-Gang Jiang
Orcid: 0000-0002-1907-8567
According to our database1,
Yu-Gang Jiang
authored at least 385 papers
between 2006 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025
2024
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024
Mach. Learn., May, 2024
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition.
ACM Trans. Multim. Comput. Commun. Appl., February, 2024
Int. J. Comput. Vis., February, 2024
Locate Before Answering: Answer Guided Question Localization for Video Question Answering.
IEEE Trans. Multim., 2024
Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2024
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks.
CoRR, 2024
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders.
CoRR, 2024
Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models.
CoRR, 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation.
CoRR, 2024
EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models.
CoRR, 2024
ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack.
CoRR, 2024
Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers.
CoRR, 2024
CoRR, 2024
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations.
CoRR, 2024
CoRR, 2024
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning.
CoRR, 2024
CoRR, 2024
Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models.
CoRR, 2024
CoRR, 2024
FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model.
CoRR, 2024
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios.
CoRR, 2024
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Highly Transferable Diffusion-based Unrestricted Adversarial Attack on Pre-trained Vision-Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Pattern Recognit., October, 2023
IEEE Trans. Multim., 2023
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting.
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
IEEE Trans. Image Process., 2023
CoRR, 2023
VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model.
CoRR, 2023
CoRR, 2023
CoRR, 2023
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
CoRR, 2023
CoRR, 2023
Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Suspected Objects Matter: Rethinking Model's Prediction for One-stage Visual Grounding.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Bi-directional Feature Fusion Generative Adversarial Network for Ultra-high Resolution Pathological Image Virtual Re-staining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition.
IEEE Trans. Multim., 2022
IEEE Trans. Multim., 2022
Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.
IEEE Trans. Image Process., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection.
CoRR, 2022
Locate before Answering: Answer Guided Question Localization for Video Question Answering.
CoRR, 2022
Incorporating Locality of Images to Generate Targeted Transferable Adversarial Examples.
CoRR, 2022
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.
CoRR, 2022
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling.
CoRR, 2022
Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning.
CoRR, 2022
Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding.
CoRR, 2022
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE Trans. Knowl. Data Eng., 2021
IEEE Trans. Image Process., 2021
Predicting Content Similarity via Multimodal Modeling for Video-In-Video Advertising.
IEEE Trans. Circuits Syst. Video Technol., 2021
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Neurocomputing, 2021
Int. J. Comput. Vis., 2021
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation.
CoRR, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization.
IEEE Trans. Multim., 2020
IEEE Trans. Image Process., 2020
IEEE Trans. Image Process., 2020
IEEE Trans. Image Process., 2020
IEEE Trans. Circuits Syst. Video Technol., 2020
IEEE Trans. Circuits Syst. Video Technol., 2020
IEEE Trans. Pattern Anal. Mach. Intell., 2020
IEEE Trans. Pattern Anal. Mach. Intell., 2020
IEEE Trans. Pattern Anal. Mach. Intell., 2020
Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos.
CoRR, 2020
CoRR, 2020
Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.
CoRR, 2020
Recurrent Memory Reasoning Network for Expert Finding in Community Question Answering.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020
Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2019
IEEE Trans. Image Process., 2019
Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging.
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Sci. China Inf. Sci., 2019
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019
An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
ACM Trans. Multim. Comput. Commun. Appl., 2018
Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications.
IEEE Trans. Multim., 2018
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.
IEEE Trans. Multim., 2018
IEEE Trans. Knowl. Data Eng., 2018
IEEE Trans. Image Process., 2018
IEEE Trans. Circuits Syst. Video Technol., 2018
Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization.
IEEE Trans. Affect. Comput., 2018
Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content.
IEEE Signal Process. Mag., 2018
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018
Multim. Tools Appl., 2018
Neurocomputing, 2018
Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective.
CoRR, 2018
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Proceedings of the Frontiers of Multimedia Research, 2018
2017
Comput. Vis. Image Underst., 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
IEEE Trans. Multim., 2016
IEEE Trans. Big Data, 2016
Neurocomputing, 2016
Web video categorization using category-predictive classifiers and category-specific concept classifiers.
Neurocomputing, 2016
Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues.
IEEE Multim., 2016
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016
On Stochastic Primal-Dual Hybrid Gradient Approach for Compositely Regularized Minimization.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the British Machine Vision Conference 2016, 2016
2015
IEEE Trans. Image Process., 2015
CHCF: A Cloud-Based Heterogeneous Computing Framework for Large-Scale Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2015
Multim. Tools Appl., 2015
Data Min. Knowl. Discov., 2015
Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015
2014
ACM Trans. Multim. Comput. Commun. Appl., 2014
IEEE Trans. Multim., 2014
IEEE Trans. Multim., 2014
IEEE Trans. Image Process., 2014
Mach. Vis. Appl., 2014
Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues.
J. Comput. Sci. Technol., 2014
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014
Which Looks Like Which: Exploring Inter-class Relationships in Fine-Grained Visual Categorization.
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Int. J. Multim. Inf. Retr., 2013
Proceedings of the ACM Multimedia Conference, 2013
Beauty is here: evaluating aesthetics in videos using multimodal features and free training data.
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
Proceedings of the IJCAI 2013, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013
2012
IEEE Trans. Multim., 2012
IEEE Trans. Image Process., 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the International Conference on Multimedia Retrieval, 2012
Proceedings of the International Conference on Multimedia Retrieval, 2012
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Proceedings of the Computer Vision - ECCV 2012, 2012
Proceedings of the Computer Vision - ECCV 2012, 2012
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
2011
IEEE Trans. Circuits Syst. Video Technol., 2011
IEEE Trans. Circuits Syst. Video Technol., 2011
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Towards textually describing complex video contents with audio-visual concept classifiers.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Consumer video understanding: a benchmark database and an evaluation of human and machine performance.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011
Lost in binarization: query-adaptive ranking for similar image search with compact codes.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
2010
IEEE Trans. Multim., 2010
Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010
2009
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval.
Comput. Vis. Image Underst., 2009
VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Semantic context transfer across heterogeneous sources for domain adaptive video search.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009
2008
Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces.
IEEE Trans. Multim., 2008
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008
Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
2007
Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007
Towards optimal bag-of-features for object categorization and semantic video retrieval.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007
2006
Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006