Rongrong Ji

CoRR, 2024

CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Modal Prompt Learning on Blind Image Quality Assessment.

[BibT_eX]

[DOI]

CoRR, 2024

NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation.

[BibT_eX]

[DOI]

CoRR, 2024

ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model.

[BibT_eX]

[DOI]

CoRR, 2024

Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization.

[BibT_eX]

[DOI]

CoRR, 2024

DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

DMAD: Dual Memory Bank for Real-World Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers.

[BibT_eX]

[DOI]

CoRR, 2024

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling.

[BibT_eX]

[DOI]

CoRR, 2024

EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration.

[BibT_eX]

[DOI]

CoRR, 2024

Feature Denoising Diffusion Model for Blind Image Quality Assessment.

[BibT_eX]

[DOI]

CoRR, 2024

Cross-Modality Perturbation Synergy Attack for Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2024

Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Adaptive Selection based Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

3D-GRES: Generalized 3D Referring Expression Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multimodal Inplace Prompt Tuning for Open-set Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Deep Instruction Tuning for Segment Anything Model.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Prompting to Adapt Foundational Segmentation Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

QueryMatch: A Query-based Contrastive Learning Framework for Weakly Supervised Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ERQ: Error Reduction for Post-Training Quantization of Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Outlier-aware Slicing for Post-Training Quantization in Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

CaM: Cache Merging for Memory-efficient LLMs Inference.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

AffineQuant: Affine Transformation Quantization for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Exploring Target Representations for Masked Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

GreedyAgent: Crafting Efficient Agents for Meta-learning from Learning Curves via Greedy Algorithm Selection.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Functionally Similar Multi-Label Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Code Membership Inference for Detecting Unauthorized Data Use in Code Pre-trained Language Models.

[BibT_eX]

[DOI]

Sheng Zhang

Hui Li

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AnyTrans: Translate AnyText in the Image with Large Scale Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Multi-branch Collaborative Learning Network for 3D Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

AccDiffusion: An Accurate Method for Higher-Resolution Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

CamoTeacher: Dual-Rotation Consistency Learning for Semi-supervised Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DiffuMatting: Synthesizing Arbitrary Objects with Matting-Level Annotation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GraCo: Granularity-Controllable Interactive Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

UniPTS: A Unified Framework for Proficient Post-Training Sparsity.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Aligning and Prompting Everything All at Once for Universal Visual Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PortraitBooth: A Versatile Portrait Model for Fast Identity-Preserved Personalization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FocSAM: Delving Deeply into Focused Objects in Segmenting Anything.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Learning Image Demoiréing from Unpaired Real Data.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Toward Open-Set Human Object Interaction Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Lottery Jackpots Exist in Pre-Trained Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Super Vision Transformer.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., December, 2023

Pruning Networks With Cross-Layer Ranking & k-Reciprocal Nearest Filters.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Distilling a Powerful Student Model via Online Knowledge Distillation.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Carrying Out CNN Channel Pruning in a White Box.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., October, 2023

Prioritized Subnet Sampling for Resource-Adaptive Supernet Training.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Training Compact CNNs for Image Classification Using Dynamic-Coded Filter Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Towards local visual modeling for image captioning.

[BibT_eX]

[DOI]

Pattern Recognit., June, 2023

SiMaN: Sign-to-Magnitude Network Binarization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Generating Hypergraph-Based High-Order Representations of Whole-Slide Histopathological Images for Survival Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

DDPNAS: Efficient Neural Architecture Search via Dynamic Distribution Pruning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., May, 2023

Leveraging Local and Global Cues for Visual Tracking via Parallel Interaction Network.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2023

Robust Tracking via Uncertainty-Aware Semantic Consistency.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2023

1xN Pattern for Pruning Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

SiamBAN: Target-Aware Tracking With Siamese Box Adaptive Network.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Dynamic Support Network for Few-Shot Class Incremental Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

HGNN<sup>+</sup>: General Hypergraph Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2023

Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Knowing What it is: Semantic-Enhanced Dual Attention Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Learning Efficient GANs for Image Translation via Differentiable Masks and Co-Attention Distillation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Semantically Consistent Visual Representation for Adversarial Robustness.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2023

Adaptive Feature Selection for No-Reference Image Quality Assessment using Contrastive Mitigating Semantic Noise Sensitivity.

[BibT_eX]

[DOI]

CoRR, 2023

Boosting the Cross-Architecture Generalization of Dataset Distillation through an Empirical Study.

[BibT_eX]

[DOI]

CoRR, 2023

Less is More: Learning Reference Knowledge Using No-Reference Image Quality Assessment.

[BibT_eX]

[DOI]

CoRR, 2023

X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation.

[BibT_eX]

[DOI]

CoRR, 2023

I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization.

[BibT_eX]

[DOI]

CoRR, 2023

NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning.

[BibT_eX]

[DOI]

CoRR, 2023

JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Unified Token Learning for Vision-Language Tracking.

[BibT_eX]

[DOI]

CoRR, 2023

DLIP: Distilling Language-Image Pre-training.

[BibT_eX]

[DOI]

CoRR, 2023

M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce.

[BibT_eX]

[DOI]

CoRR, 2023

Continual Face Forgery Detection via Historical Distribution Preserving.

[BibT_eX]

[DOI]

CoRR, 2023

Towards General Visual-Linguistic Face Forgery Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer.

[BibT_eX]

[DOI]

CoRR, 2023

Approximated Prompt Tuning for Vision-Language Pre-trained Models.

[BibT_eX]

[DOI]

CoRR, 2023

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Spatial Re-parameterization for N: M Sparsity.

[BibT_eX]

[DOI]

CoRR, 2023

Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting.

[BibT_eX]

[DOI]

CoRR, 2023

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

MultiQuant: A Novel Multi-Branch Topology Method for Arbitrary Bit-width Network Quantization.

[BibT_eX]

[DOI]

CoRR, 2023

Distribution-Flexible Subset Quantization for Post-Quantizing Super-Resolution Networks.

[BibT_eX]

[DOI]

CoRR, 2023

Latent Feature Relation Consistency for Adversarial Robustness.

[BibT_eX]

[DOI]

CoRR, 2023

CAT: Collaborative Adversarial Training.

[BibT_eX]

[DOI]

CoRR, 2023

Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2023

Towards End-to-end Semi-supervised Learning for One-stage Object Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Efficient Visual Adaption via Structural Re-parameterization.

[BibT_eX]

[DOI]

CoRR, 2023

Spectral Aware Softmax for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Invariant Representation for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2023

Unsupervised Domain Adaptation on Person Re-Identification via Dual-level Asymmetric Mutual Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Self-supervised Graph Representation Learning for Black Market Account Detection.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Two-Stage Deep Learning Segmentation for Tiny Brain Regions.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Adversarial Robustness via Information Bottleneck Distillation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Semi-Supervised Panoptic Narrative Grounding.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Occlusion Disentanglement with Fine-grained Localization for Occluded Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Improving Human-Object Interaction Detection via Virtual Image Learning.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

PixelFace+: Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

EALink: An Efficient and Accurate Pre-Trained Framework for Issue-Commit Link Recovery.

[BibT_eX]

[DOI]

Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

RefBERT: A Two-Stage Pre-trained Framework for Automatic Rename Refactoring.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Interactive Object Placement with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Bi-directional Masks for Efficient N: M Sparse Training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Real-Time Image Demoiréing on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

InterFormer Real-time Interactive Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Pseudo-label Alignment for Semi-supervised Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffRate : Differentiable Compression Rate for Efficient Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SMMix: Self-Motivated Image Mixing for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Category-aware Allocation Transformer for Weakly Supervised Object Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DistilPose: Tokenized Pose Regression with Heatmap Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Meta Architecture for Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Clover: Towards A Unified Video-Language Alignment and Fusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Discriminator-Cooperated Feature Map Distillation for GAN Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

You Only Segment Once: Towards Real-Time Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OMPQ: Orthogonal Mixed Precision Quantization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

CF-ViT: A General Coarse-to-Fine Method for Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Attention-Based Neural Architecture Search for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

Network Pruning Using Adaptive Exemplar Filters.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

Filter Sketch for Network Pruning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Disentangling Task-Oriented Representations for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Plenty is Plague: Fine-Grained Learning for Visual Question Answering.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Fast Class-Wise Updating for Online Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards Robust Adversarial Training via Dual-label Supervised and Geometry Constraint.

[BibT_eX]

[DOI]

Int. J. Softw. Informatics, 2022

Exploring Content Relationships for Distilling Efficient GANs.

[BibT_eX]

[DOI]

CoRR, 2022

Shadow Removal by High-Quality Shadow Synthesis.

[BibT_eX]

[DOI]

CoRR, 2022

Meta Architecure for Point Cloud Analysis.

[BibT_eX]

[DOI]

CoRR, 2022

Exploiting the Partly Scratch-off Lottery Ticket for Quantization-Aware Training.

[BibT_eX]

[DOI]

CoRR, 2022

LAB-Net: LAB Color-Space Oriented Lightweight Network for Shadow Removal.

[BibT_eX]

[DOI]

CoRR, 2022

CycleTrans: Learning Neutral yet Discriminative Features for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2022

Clover: Towards A Unified Video-Language Alignment and Fusion Model.

[BibT_eX]

[DOI]

CoRR, 2022

Super Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Shadow-Aware Dynamic Convolution for Shadow Removal.

[BibT_eX]

[DOI]

CoRR, 2022

What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study.

[BibT_eX]

[DOI]

CoRR, 2022

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2022

PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Global2Local: A Joint-Hierarchical Attention for Video Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Differentiated Relevances Embedding for Group-based Referring Expression Comprehension.

[BibT_eX]

[DOI]

CoRR, 2022

Coarse-to-Fine Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Optimizing Gradient-driven Criteria in Network Sparsity: Gradient is All You Need.

[BibT_eX]

[DOI]

CoRR, 2022

What Hinders Perceptual Quality of PSNR-oriented Methods?

[BibT_eX]

[DOI]

CoRR, 2022

Deepwalk-aware graph convolutional networks.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2022

Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Best Combination for Efficient N: M Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Dynamic Prototype Mask for Occluded Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Open-Ended Text-to-Face Generation, Combination and Manipulation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Learning Dynamic Prior Knowledge for Text-to-Face Pixel Synthesis.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cycle Encoding of a StyleGAN Encoder for Improved Reconstruction and Editability.

[BibT_eX]

[DOI]

Xudong Mao

Aurele Tohokantche Gnanha

Zhenguo Yang

Qing Li

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Searching Lightweight Neural Network for Image Signal Processing.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Visual Tempo Contrastive Learning for Few-Shot Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

MDNet: Motion Distinction Network for Effective Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

SeqTR: A Simple Yet Universal Network for Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-grained Data Distribution Alignment for Post-Training Quantization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

An Information Theoretic Approach for Attention-Driven Face Forgery Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.

[BibT_eX]

[DOI]

Joni-Kristian Kämäräinen

Alireza Memarmoghadam

Christian Micheloni

Payman Moallem

Le Thanh Nguyen-Meidine

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

ARM: Any-Time Super-Resolution Method.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Training-free Transformer Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Architecture Search with Representation Mutual Information.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DIFNet: Boosting Visual Information Flow for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Active Teacher for Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Boosting Crowd Counting via Multifaceted Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dual Contrastive Learning for General Face Forgery Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Learning to Learn Transferable Attack.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Aggregating Global and Local Visual Representation for Vehicle Re-IDentification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Uncovering Media Bias via Social Network Learning.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2021

Beyond Universal Person Re-Identification Attack.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2021

Bio-Inspired Deep Attribute Learning Towards Facial Aesthetic Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2021

Joint segmentation and detection of COVID-19 via a sequential region generation network.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Evolving Fully Automated Machine Learning via Life-Long Knowledge Anchors.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

MIGO-NAS: Towards Fast and Generalizable Neural Architecture Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Winning Solutions and Post-Challenge Analyses of the ChaLearn AutoDL Challenge 2019.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Cauchy loss induced block diagonal representation for robust multi-view subspace clustering.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Real-time semantic segmentation via sequential knowledge distillation.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Binarized Neural Architecture Search for Efficient Object Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

Towards Language-guided Visual Recognition via Dynamic Convolutions.

[BibT_eX]

[DOI]

CoRR, 2021

OMPQ: Orthogonal Mixed Precision Quantization.

[BibT_eX]

[DOI]

CoRR, 2021

Prioritized Subnet Sampling for Resource-Adaptive Supernet Training.

[BibT_eX]

[DOI]

CoRR, 2021

Fine-grained Data Distribution Alignment for Post-Training Quantization.

[BibT_eX]

[DOI]

CoRR, 2021

An Information Theory-inspired Strategy for Automatic Network Pruning.

[BibT_eX]

[DOI]

CoRR, 2021

Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion.

[BibT_eX]

[DOI]

CoRR, 2021

GuidedMix-Net: Learning to Improve Pseudo Masks Using Labeled Images as Reference.

[BibT_eX]

[DOI]

CoRR, 2021

You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient.

[BibT_eX]

[DOI]

CoRR, 2021

1×N Block Pattern for Network Sparsity.

[BibT_eX]

[DOI]

CoRR, 2021

ISTR: End-to-End Instance Segmentation with Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack.

[BibT_eX]

[DOI]

CoRR, 2021

Carrying out CNN Channel Pruning in a White Box.

[BibT_eX]

[DOI]

CoRR, 2021

DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Lottery Jackpots Exist in Pre-trained Models.

[BibT_eX]

[DOI]

CoRR, 2021

Learnable Expansion-and-Compression Network for Few-shot Class-Incremental Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Distilling a Powerful Student Model via Online Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2021

On Evolving Attention Towards Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2021

DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2021

SiMaN: Sign-to-Magnitude Network Binarization.

[BibT_eX]

[DOI]

CoRR, 2021

Aurora Guard: Reliable Face Anti-Spoofing via Mobile Lighting System.

[BibT_eX]

[DOI]

CoRR, 2021

Non-Parametric Adaptive Network Pruning.

[BibT_eX]

[DOI]

CoRR, 2021

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

CDP: Towards Optimal Filter Pruning via Class-wise Discriminative Power.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Long-Range Feature Propagating for Natural Image Matting.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

RecycleNet: An Overlapped Text Instance Recovery Approach.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

E2Net: Excitative-Expansile Learning for Weakly Supervised Object Localization.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Towards Robustness Against Natural Language Word Substitutions.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

The Ninth Visual Object Tracking VOT2021 Challenge Results.

[BibT_eX]

[DOI]

Joni-Kristian Kämäräinen

Mohamed H. Abdelpakey

Alireza Memarmoghadam

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

A Dual-stream Framework for 3D Mask Face Presentation Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

EC-DARTS: Inducing Equalized and Consistent Optimization into DARTS.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

TRAR: Routing the Attention Spans in Transformer for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ReCU: Reviving the Dead Weights in Binary Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Aha! Adaptive History-driven Attack for Decision-based Black-box Models.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Architecture Disentanglement for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Image-to-Image Translation via Hierarchical Style Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Compact CNNs via Collaborative Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Domain General Face Forgery Detection by Learning to Weight.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dual-level Collaborative Transformer for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Local Relation Learning for Face Forgery Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dual Distribution Alignment Network for Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

A New Transfer Function for Volume Visualization of Aortic Stent and Its Application to Virtual Endoscopy.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

Toward Compact ConvNets via Structure-Sparsity Regularized Filter Pruning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2020

Fine-Grained Spatial Alignment Model for Person Re-Identification With Focal Triplet Loss.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Category-Aware Spatial Constraint for Weakly Supervised Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Similarity-Preserving Linkage Hashing for Online Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Every node counts: Self-ensembling graph convolutional networks for semi-supervised learning.

[BibT_eX]

[DOI]

Pattern Recognit., 2020

Semi-Supervised Adversarial Monocular Depth Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Link-aware semi-supervised hypergraph.

[BibT_eX]

[DOI]

Inf. Sci., 2020

Hadamard Matrix Guided Online Hashing.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Learning Efficient GANs using Differentiable Masks and co-Attention Distillation.

[BibT_eX]

[DOI]

CoRR, 2020

PAMS: Quantized Super-Resolution via Parameterized Max Scale.

[BibT_eX]

[DOI]

CoRR, 2020

Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Task-oriented Disentangled Representations for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2020

Dual Distribution Alignment Network for Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2020

Architecture Disentanglement for Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2020

ASFD: Automatic and Scalable Face Detector.

[BibT_eX]

[DOI]

CoRR, 2020

Distribution Distillation Loss: Generic Approach for Improving Face Recognition from Hard Samples.

[BibT_eX]

[DOI]

CoRR, 2020

Filter Sketch for Network Pruning.

[BibT_eX]

[DOI]

CoRR, 2020

UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Rotated Binary Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Exploring Language Prior for Mode-Sensitive Visual Attention Modeling.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cascade Grouped Attention Network for Referring Expression Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Attacking Image Captioning Towards Accuracy-Preserving Target Words Removal.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Channel Hypergraph Collaborative Filtering.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Channel Pruning via Automatic Structure Search.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Polynomial Universal Adversarial Perturbations for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Multiple Expert Brainstorming for Domain Adaptive Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Enabling Deep Residual Networks for Weakly Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

PAMS: Quantized Super-Resolution via Parameterized Max Scale.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Interpretable Neural Network Decoupling.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Improving Face Recognition from Hard Samples via Distribution Distillation Loss.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

API-Net: Robust Generative Classifier via a Single Discriminator.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

SSCGAN: Facial Attribute Editing via Style Skip Connections.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Anti-bandit Neural Architecture Search for Model Defense.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Cogradient Descent for Bilinear Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Rethinking Performance Estimation in Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Noise-Aware Fully Webly Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Filter Grafting for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

HRank: Filter Pruning Using High-Rank Feature Map.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Projection & Probability-Driven Black-Box Attack.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Siamese Box Adaptive Network for Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

One-Shot Adversarial Attacks on Visual Tracking With Dual Attention.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Salience-Guided Cascaded Suppression Network for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Asymmetric Co-Teaching for Unsupervised Cross-Domain Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Fast Learning of Temporal Action Proposal via Dense Boundary Generator.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Binarized Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Face Sketch Synthesis by Multidomain Adversarial Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

Cross-Modality Microblog Sentiment Prediction via Bi-Layer Multimodal Hypergraph Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Deep Manifold Structure Transfer for Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Correntropy-Induced Robust Low-Rank Hypergraph.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Exploring High-Order Correlations for Industry Anomaly Detection.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2019

Ordinal Constraint Binary Coding for Approximate Nearest Neighbor Search.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Holistic CNN Compression via Low-Rank Decomposition with Knowledge Transfer.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Font generation based on least squares conditional generative adversarial nets.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2019

Do Hotel Responses Matter?: A Comprehensive Perspective on Investigating Online Reviews.

[BibT_eX]

[DOI]

Wenlong Liu

Inf. Resour. Manag. J., 2019

Universal Adversarial Perturbations Against Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2019

Hadamard Codebook Based Deep Hashing.

[BibT_eX]

[DOI]

CoRR, 2019

Semantic-aware Image Deblurring.

[BibT_eX]

[DOI]

CoRR, 2019

Scene-based Factored Attention for Image Captioning.

[BibT_eX]

[DOI]

CoRR, 2019

Dynamic Neural Network Decoupling.

[BibT_eX]

[DOI]

CoRR, 2019

Dynamic Distribution Pruning for Efficient Network Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2019

Supervised Online Hashing via Similarity Distribution Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Aurora Guard: Real-Time Face Anti-Spoofing via Light Reflection.

[BibT_eX]

[DOI]

CoRR, 2019

Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning.

[BibT_eX]

[DOI]

CoRR, 2019

Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method.

[BibT_eX]

[DOI]

IEEE Access, 2019

FreeAnchor: Learning to Match Anchors for Visual Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Information Competing Process for Learning Diversified Representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Variational Structured Semantic Inference for Diverse Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Session details: Brave New Idea.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Multi-scale Features for Weakly Supervised Lesion Detection of Cerebral Hemorrhage with Collaborative Learning.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Part Power Set Model for Scale-Free Person Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Hypergraph Induced Convolutional Manifold Networks.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Generalized Zero-Shot Vehicle Detection in Remote Sensing Imagery via Coarse-to-Fine Framework.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Colloquial Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Multi-scale Gem Pooling with N-Pair Center Loss for Fine-Grained Image Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Visual-Textual Sentiment Analysis in Product Reviews.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Multinomial Distribution Learning for Effective Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Universal Perturbation Attack Against Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Bayesian Optimized 1-Bit CNNs.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Scoot: A Perceptual Metric for Facial Sketches.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Towards Cross-modality Topic Modelling via Deep Topical Correlation Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Learning Similarity-specific Dictionary for Zero-shot Fine-grained Recognition.

[BibT_eX]

[DOI]

Hong Chen

Proceedings of the IEEE International Conference on Acoustics, 2019

DSNET: Accelerate Indoor Scene Semantic Segmentation.

[BibT_eX]

[DOI]

Feng Jiang

Feng Guo

Proceedings of the IEEE International Conference on Acoustics, 2019

Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Circulant Binary Convolutional Networks: Enhancing the Performance of 1-Bit DCNNs With Circulant Back Propagation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Optimal Structured CNN Pruning via Generative Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards Visual Feature Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dynamic Capsule Attention for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Free VQA Models from Knowledge Inertia by Pairwise Inconformity Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

PVRNet: Point-View Relation Neural Network for 3D Shape Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Neural Bag-of-Matrix-Summarization with Riemannian Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Towards Optimal Discrete Online Hashing with Balanced Similarity.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Hypergraph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Weakly Supervised Object Detection via Object-Specific Pixel Gradient.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2018

Predicting Microblog Sentiments via Weakly Supervised Multimodal Deep Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

A Stacked Sparse Autoencoder-Based Detector for Automatic Identification of Neuromagnetic High Frequency Oscillations in Epilepsy.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2018

Inductive Multi-Hypergraph Learning and Its Application on View-Based 3D Object Classification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Action-Attending Graphic Neural Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Body Structure Aware Deep Crowd Counting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Image Quality Assessment for Color Correction Based on Color Contrast Similarity and Color Value Difference.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Face sketch aging via aging oriented principal component analysis.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2018

AAM Based Face Sketch Synthesis.

[BibT_eX]

[DOI]

Shengchuan Zhang

Neural Process. Lett., 2018

Less is More: Unified Model for Unsupervised Multi-Domain Image-to-Image Translation.

[BibT_eX]

[DOI]

CoRR, 2018

Face Sketch Synthesis Style Similarity: A New Structure Co-occurrence Texture Measure.

[BibT_eX]

[DOI]

CoRR, 2018

Topically-informed bilingually-constrained recursive autoencoders for statistical machine translation.

[BibT_eX]

[DOI]

Zhiwei Ruan

Commun. Inf. Syst., 2018

Surface Saliency Detection Based on Curvature Co-Occurrence Histograms.

[BibT_eX]

[DOI]

IEEE Access, 2018

Context-Aware Phrase Representation for Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Topic-Guided Automatical Human-Simulated Tweeting System.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Dense Auto-Encoder Hashing for Robust Cross-Modality Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Supervised Online Hashing via Hadamard Codebook Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Multimedia-2 (Socical & Emotional Multimedia).

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Robust Face Sketch Synthesis via Generative Adversarial Fusion of Priors and Parametric Sigmoid.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Accelerating Convolutional Networks via Global & Dynamic Filter Pruning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Cross-Modality Person Re-Identification with Generative Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Depth-assisted RefineNet for Indoor Semantic Segmentation.

[BibT_eX]

[DOI]

Manyu Chang

Feng Guo

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Towards Compact Visual Descriptor via Deep Fisher Network with Binary Embedding.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Gamma Mixture Models for Outlier Removal.

[BibT_eX]

[DOI]

Xin Wu

Ling Cai

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Modulated Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Generative Adversarial Learning Towards Fast Weakly Supervised Detection.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Asynchronous Bidirectional Decoding for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Output Constraint Transfer for Kernelized Correlation Filter in Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Syst., 2017

Continuous Probability Distribution Prediction of Image Emotions via Multitask Shared Sparse Regression.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Mobile Social Multimedia Analytics in the Big Data Era: An Introduction to the Special Issue.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2017

Learning-Based Shadow Recognition and Removal From Monochromatic Natural Images.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Toward Optimal Manifold Hashing via Discrete Locally Linear Embedding.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Exploring Coherent Motion Patterns via Structured Trajectory Learning for Crowd Mood Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2017

Weakly supervised vehicle detection in satellite images via multi-instance discriminative learning.

[BibT_eX]

[DOI]

Pattern Recognit., 2017

Learning high-dimensional multimedia data.

[BibT_eX]

[DOI]

Xiaofeng Zhu

Zhi Jin

Multim. Syst., 2017

Special issue on "visual semantic analysis with weak supervision".

[BibT_eX]

[DOI]

Multim. Syst., 2017

Deep Spatio-temporal Manifold Network for Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

More Than An Answer: Neural Pivot Network for Visual Qestion Answering.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

StructCap: Structured Semantic Embedding for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Deep-based fisher vector for mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Sensitive Information Detection on Cyber-Space.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 9th International Conference, 2017

Optimization Algorithm Toward Deep Features Based Camera Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 9th International Conference, 2017

Cross-Modality Binary Code Learning via Fusion Similarity Hashing.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Lattice-Based Recurrent Neural Network Encoders for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Ordinal Constrained Binary Code Learning for Nearest Neighbor Search.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

ESPACE: Accelerating Convolutional Neural Networks via Eliminating Spatial and Channel Redundancy.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Image Categorization by Learning a Propagated Graphlet Path.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2016

Joint Depth and Semantic Inference from a Single Image via Elastic Conditional Random Field.

[BibT_eX]

[DOI]

Yan Wang

Pattern Recognit., 2016

Towards perceptual video cropping with curve fitting.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

On application-unbiased benchmarking of web videos from a social network perspective.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Visual sentiment topic model based microblog image sentiment analysis.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Fast verification via statistical geometric for mobile visual search.

[BibT_eX]

[DOI]

Multim. Syst., 2016

Spectral-spatial co-clustering of hyperspectral image data based on bipartite graph.

[BibT_eX]

[DOI]

Multim. Syst., 2016

Decomposed human localization from social photo album.

[BibT_eX]

[DOI]

Multim. Syst., 2016

Special issue: When social media meets physical world.

[BibT_eX]

[DOI]

Multim. Syst., 2016

A cross-media public sentiment analysis system for microblog.

[BibT_eX]

[DOI]

Multim. Syst., 2016

Discriminative local collaborative representation for online object tracking.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2016

Special issue on weakly supervised learning.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2016

Local consistent hierarchical Hough Match for image re-ranking.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2016

A novel features ranking metric with application to scalable visual and bioinformatics data classification.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Advanced learning for large-scale heterogeneous computing.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Learning for medical imaging.

[BibT_eX]

[DOI]

Neurocomputing, 2016

3D object retrieval with multi-feature collaboration and bipartite graph matching.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Detection based object labeling of 3D point cloud for indoor scenes.

[BibT_eX]

[DOI]

Neurocomputing, 2016

The distributed system for inverted multi-index visual retrieval.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Masked face detection via a modified LeNet.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Multimodal learning for view-based 3D object classification.

[BibT_eX]

[DOI]

Fuhai Chen

Neurocomputing, 2016

Web video topics discovery and structuralization with social network.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Dynamic programming based optimized product quantization for approximate nearest neighbor search.

[BibT_eX]

[DOI]

Yuanzheng Cai

Shaozi Li

Neurocomputing, 2016

Bounding Multiple Gaussians Uncertainty with Application to Object Tracking.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Face recognition by decision fusion of two-dimensional linear discriminant analysis and local binary pattern.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2016

Survey of visual sentiment prediction for social media analysis.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2016

Predicting Personalized Emotion Perceptions of Social Images.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Supervised Matrix Factorization for Cross-Modality Hashing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Towards Convolutional Neural Networks Compression via Global Error Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Crowd video retrieval via deep attribute-embedding graph ranking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Towards Building Abstraction by Using Line Segment Descriptor.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Variational Neural Discourse Relation Recognizer.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Search-Based Depth Estimation via Coupled Dictionary Learning with Large-Margin Structure Inference.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

A spatial-temporal visual mid-level ontology for GIF sentiment analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Congress on Evolutionary Computation, 2016

Towards Optimal Binary Code Learning via Ordinal Embedding.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Towards Domain Adaptive Vehicle Detection in Satellite Image by Supervised Super-Resolution Transfer.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

3D Object Retrieval with Multimodal Views.

[BibT_eX]

[DOI]

Proceedings of the 9th Eurographics Workshop on 3D Object Retrieval, 2016

2015

Learning a Probabilistic Topology Discovering Model for Scene Categorization.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2015

Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

When Location Meets Social Multimedia: A Survey on Vision-Based Recognition and Mining for Geo-Social Multimedia Analytics.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2015

On-Device Mobile Landmark Recognition Using Binarized Descriptor with Multifeature Fusion.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2015

Spatial-Aware Object-Level Saliency Prediction by Learning Graphlet Hierarchies.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2015

Social Attribute-Aware Force Model: Exploiting Richness of Interaction for Abnormal Crowd Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

High-capacity reversible watermarking scheme of 2D-vector data.

[BibT_eX]

[DOI]

Chaoguang Men

Signal Image Video Process., 2015

Sparse auto-encoder based feature learning for human body detection in depth image.

[BibT_eX]

[DOI]

Signal Process., 2015

Signal processing and learning methods for 3D semantic analysis.

[BibT_eX]

[DOI]

Signal Process., 2015

Robust infrared target tracking based on particle filter with embedded saliency detection.

[BibT_eX]

[DOI]

Inf. Sci., 2015

Localizing web videos using social images.

[BibT_eX]

[DOI]

Inf. Sci., 2015

Learning for visual semantic understanding in big data.

[BibT_eX]

[DOI]

Neurocomputing, 2015

Feature learning based on SAE-PCA network for human gesture recognition in RGBD images.

[BibT_eX]

[DOI]

Neurocomputing, 2015

Learning for 3D understanding.

[BibT_eX]

[DOI]

Neurocomputing, 2015

Video (GIF) Sentiment Analysis using Large-Scale Mid-Level Ontology.

[BibT_eX]

[DOI]

Zheng Cai

Donglin Cao

CoRR, 2015

A Cross-media Sentiment Analytics Platform For Microblog.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Modeling Inter- and Intra-Part Deformations for Object Structure Parsing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Multimodal hypergraph learning for microblog sentiment prediction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

An effective eye states detection method based on the projection of the gray interval distribution.

[BibT_eX]

[DOI]

Xianming Lin

Ling Cai

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Sentiment analysis of Chinese micro-blog based on multi-modal correlation model.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Top Rank Supervised Binary Coding for Visual Search.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Forward stereo obstacle detection with Weighted Hough Transform and local temporal correlation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Interactive on-device Mobile Landmark Recognition with compact binary codes.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Rank Preserving Hashing for Rapid Image Search.

[BibT_eX]

[DOI]

Proceedings of the 2015 Data Compression Conference, 2015

Understanding image structure via hierarchical shape parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Towards 3D object detection with bimodal deep Boltzmann machines over RGBD imagery.

[BibT_eX]

[DOI]

Wei Liu

Shaozi Li

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Cross-Modality Sentiment Analysis for Social Multimedia.

[BibT_eX]

[DOI]

Donglin Cao

Dazhen Lin

Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Low-Rank Similarity Metric Learning in High Dimensions.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

3D Object Retrieval with Multimodal Views.

[BibT_eX]

[DOI]

Proceedings of the 8th Eurographics Workshop on 3D Object Retrieval, 2015

2014

Representative Discovery of Structure Cues for Weakly-Supervised Image Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Towards Mobile Document Image Retrieval for Digital Library.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Weakly Supervised Multi-Graph Learning for Robust Image Reranking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Learning High-Level Feature by Deep Belief Networks for 3-D Model Retrieval and Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Actively Learning Human Gaze Shifting Paths for Semantics-Aware Photo Cropping.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Toward Statistical Modeling of Saccadic Eye-Movement and Visual Saliency.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Learning-Based Bipartite Graph Matching for View-Based 3D Model Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Spatiotemporal Grid Flow for Video Retargeting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Mining Compact Bag-of-Patterns for Low Bit Rate Mobile Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Hyperspectral Image Classification Through Bilayer Graph-Based Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

3-D Object Retrieval With Hausdorff Distance Learning.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2014

Spectral-Spatial Constraint Hyperspectral Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2014

Symbiotic Tracker Ensemble Toward A Unified Tracking Framework.

[BibT_eX]

[DOI]

Yue Gao

Longfei Zhang

Alexander G. Hauptmann

IEEE Trans. Circuits Syst. Video Technol., 2014

Improved and Promising Identificationof Human MicroRNAs by Incorporatinga High-Quality Negative Set.

[BibT_eX]

[DOI]

IEEE ACM Trans. Comput. Biol. Bioinform., 2014

Visual tracking via weakly supervised learning from multiple imperfect oracles.

[BibT_eX]

[DOI]

Pattern Recognit., 2014

Where should I stand? Learning based human position recommendation for mobile photographing.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2014

Online semi-supervised compressive coding for robust visual tracking.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2014

Structured partial least squares for simultaneous object tracking and segmentation.

[BibT_eX]

[DOI]

Neurocomputing, 2014

Robust tracking via patch-based appearance model and local background estimation.

[BibT_eX]

[DOI]

Neurocomputing, 2014

Single/cross-camera multiple-person tracking by graph matching.

[BibT_eX]

[DOI]

Neurocomputing, 2014

Online MIL tracking with instance-level semi-supervised learning.

[BibT_eX]

[DOI]

Neurocomputing, 2014

Large-Scale Geosocial Multimedia [Guest editorial].

[BibT_eX]

[DOI]

IEEE Multim., 2014

Discriminative Orthogonal Nonnegative matrix factorization with flexibility for data representation.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2014

Efficient semantic image segmentation with multi-class ranking prior.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2014

Pursuing Detector Efficiency for Simple Scene Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

Hacking Chinese Touclick CAPTCHA by Multi-Scale Corner Structure Model with Fast Pattern Matching.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Robust nonnegative matrix factorization via L1 norm regularization by multiplicative updating rules.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Microblog Sentiment Analysis Based on Cross-media Bag-of-words Model.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

RGBD Salient Object Detection: A Benchmark and Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

2013

Nonlinear scrambling-based reversible watermarking for 2D-vector maps.

[BibT_eX]

[DOI]

Chaoguang Men

Vis. Comput., 2013

Image retrieval with query-adaptive hashing.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2013

Learning to Distribute Vocabulary Indexing for Scalable Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

Learning from mobile contexts to minimize the mobile location search latency.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2013

Weakly supervised codebook learning by iterative label propagation with graph quantization.

[BibT_eX]

[DOI]

Signal Process., 2013

Bidirectional-isomorphic manifold learning at image semantic understanding & representation.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2013

Visual attention modeling based on short-term environmental adaption.

[BibT_eX]

[DOI]

Xiaoshuai Sun

J. Vis. Commun. Image Represent., 2013

Background subtraction driven seeds selection for moving objects segmentation and matting.

[BibT_eX]

[DOI]

Neurocomputing, 2013

A Bayesian framework for dense depth estimation based on spatial-temporal correlation.

[BibT_eX]

[DOI]

Neurocomputing, 2013

Mining spatiotemporal video patterns towards robust action retrieval.

[BibT_eX]

[DOI]

Neurocomputing, 2013

Learning Compact Visual Descriptors for Low Bit Rate Mobile Landmark Search.

[BibT_eX]

[DOI]

AI Mag., 2013

Seeing actions through scene context.

[BibT_eX]

[DOI]

Proceedings of the 2013 Visual Communications and Image Processing, 2013

Decomposed human localization in personal photo albums.

[BibT_eX]

[DOI]

Proceedings of the 2013 Visual Communications and Image Processing, 2013

A new camera self-calibration method based on CSA.

[BibT_eX]

[DOI]

Proceedings of the 2013 Visual Communications and Image Processing, 2013

Saliency detection by adaptive clustering.

[BibT_eX]

[DOI]

Proceedings of the 2013 Visual Communications and Image Processing, 2013

Geographical Retagging.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Stereotime: a wireless 2D and 3D switchable video communication system.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Query-dependent visual dictionary adaptation for image reranking.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Large-scale visual sentiment ontology and detectors using adjective noun pairs.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

SentiBank: large-scale ontology and classifiers for detecting sentiment and emotions in visual content.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Semi-Supervised Learning with Manifold Fitted Graphs.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

Spectral-spatial classification of hyperspectral imagery based on Random Forests.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Visual Reranking through Weakly Supervised Multi-graph Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

On the interoperability of local descriptors compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Label Propagation from ImageNet to 3D Point Clouds.

[BibT_eX]

[DOI]

Yan Wang

Shih-Fu Chang

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Salient Object Detection via Low-Rank and Structured Sparse Matrix Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Localizing Web Videos from Heterogeneous Images.

[BibT_eX]

[DOI]

Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

2012

Active query sensing: Suggesting the best query view for mobile visual search.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2012

Context-Aware Semi-Local Feature Detector.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2012

Task-Dependent Visual-Codebook Compression.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

3-D Object Retrieval and Recognition With Hypergraph Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

Cross-View Down/Up-Sampling Method for Multiview Depth Video Coding.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2012

k-Partite graph reinforcement and its application in multimedia information retrieval.

[BibT_eX]

[DOI]

Inf. Sci., 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2012

Robust Nonnegative Matrix Factorization via L<sub>1</sub> Norm Regularization

[BibT_eX]

[DOI]

CoRR, 2012

Symbiotic Black-Box Tracker.

[BibT_eX]

[DOI]

Longfei Zhang

Yue Gao

Alexander G. Hauptmann

Gangyi Ding

Boaz J. Super

Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

View-based 3D object retrieval by bipartite graph matching.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Estimating viewing angles in mobile street view search.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Weakly supervised topic grouping of YouTube search results.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Multi-stage vector quantization towards low bit rate visual search.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Hyperspectral image classification with hypergraph modelling.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Learning multiple codebooks for low bit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Predicting the effectiveness of queries for visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Pruning tree-structured vector quantizer towards low bit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Weak attributes for large-scale image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

What are we looking for: Towards statistical modeling of saccadic eye movements and visual saliency.

[BibT_eX]

[DOI]

Xiaoshuai Sun

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Supervised hashing with kernels.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Towards compact topical descriptors.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Weakly supervised sparse coding with geometric consistency pooling.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Mining flickr landmarks by modeling reconstruction sparsity.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2011

Actor-independent action search using spatiotemporal vocabulary with appearance hashing.

[BibT_eX]

[DOI]

Xiaoshuai Sun

Pattern Recognit., 2011

Building descriptive and discriminative visual codebook for large-scale image applications.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2011

Vocabulary Hierarchy Optimization and Transfer for Scalable Image Search.

[BibT_eX]

[DOI]

IEEE Multim., 2011

Grid-Based Retargeting with Transformation Consistency Smoothing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2011

Video indexing and recommendation based on affective analysis of viewers.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

A mobile location search system with active query sensing.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Active query sensing for mobile location search.

[BibT_eX]

[DOI]

Felix X. Yu

Shih-Fu Chang

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Unsupervised fast anomaly detection in crowds.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning heterogeneous data for hierarchical web video classification.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards low bit rate mobile visual search with multiple-channel coding.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning Compact Visual Descriptor for Low Bit Rate Mobile Landmark Search.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

Sparse representation based visual element analysis.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning the trip suggestion from landmark photos on the web.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

PKUBench: A context rich mobile visual search benchmark.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Video stabilization based on saliency driven SIFT matching and discriminative RANSAC.

[BibT_eX]

[DOI]

Proceedings of the ICIMCS 2011, 2011

Contextual dictionaries for image super resolution.

[BibT_eX]

[DOI]

Proceedings of the ICIMCS 2011, 2011

A spatiotemporal context phrase description for general dynamic texture.

[BibT_eX]

[DOI]

Proceedings of the ICIMCS 2011, 2011

When codeword frequency meets geographical location.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

A lowbit rate vocabulary coding scheme for mobile landmark search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Sorting local descriptors for lowbit rate mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Nonnegative Spectral Clustering with Discriminative Regularization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Topic level sampling towards optimized locality sensitive vocabulary coding.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Information, 2011

2010

A rotation and scale invariant texture description approach.

[BibT_eX]

[DOI]

Proceedings of the Visual Communications and Image Processing 2010, 2010

3D silhouette tracking with occlusion inference.

[BibT_eX]

[DOI]

Proceedings of the Visual Communications and Image Processing 2010, 2010

Saliency detection based on short-term sparse representation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

Visual saliency as sequential eye fixation probability.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

A robust texture descriptor using multifractal analysis with Gabor filter.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Visual topic model for web image annotation.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Mining actor correlations with hierarchical concurrence parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

SIGMA: Spatial Integrated Matching Association algorithm for logo detection.

[BibT_eX]

[DOI]

Pengfei Xu

Proceedings of the IEEE International Conference on Acoustics, 2010

Exploring statistical properties for semantic annotation: sparse distributed and convergent assumptions for keywords.

[BibT_eX]

[DOI]

Xianming Liu

Proceedings of the IEEE International Conference on Acoustics, 2010

Visual tracking via weakly supervised learning from multiple imperfect oracles.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Novel observation model for probabilistic object tracking.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Towards semantic embedding in visual vocabulary.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Visual and textual fusion for semantically supervised region-based retrieval.

[BibT_eX]

[DOI]

Multim. Syst., 2009

Photo assessment based on computational visual attention model.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

What is a complete set of keywords for image description & annotation on the web.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Location sensitive indexing for image-based advertising.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Mining city landmarks from blogs by graph modeling.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

VisualCor system: search actor correlations in TV series.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Vocabulary hierarchy optimization for effective and transferable retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008

DRM: dynamic region matching for image retrieval using probabilistic fuzzy matching and boosting feature selection.

[BibT_eX]

[DOI]

Dawei Liang

Signal Image Video Process., 2008

Vision-Based Semi-supervised Homecare with Spatial Constraint.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2008

Attention-driven action retrieval with DTW-based 3d descriptor matching.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Place retrieval with graph-based place-view model.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Cross-media manifold learning for image retrieval & annotation.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Flexible sub block ordering based intra 4/SPL times/4 prediction.

[BibT_eX]

[DOI]

Hu Wei

Tao Lin

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Clustering-based subspace SVM ensemble for relevance feedback learning.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Directional correlation analysis of local Haar binary pattern for text detection.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Vocabulary tree incremental indexing for scalable location recognition.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Text Particles Multi-band Fusion for Robust Text Detection.

[BibT_eX]

[DOI]

Proceedings of the Image Analysis and Recognition, 5th International Conference, 2008

2007

Visual & textual fusion for region retrieval: from both fuzzy matching and bayesian reasoning aspects.

[BibT_eX]

[DOI]