Serena Yeung-Levy

CoRR, 2024

Apollo: An Exploration of Video Understanding in Large Multimodal Models.

[BibT_eX]

[DOI]

Philippe Hansen-Estruch

CoRR, 2024

DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery.

[BibT_eX]

[DOI]

CoRR, 2024

Motion Diffusion-Guided 3D Global HMR from a Dynamic Camera.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-shot Action Localization via the Confidence of Large Vision-Language Models.

[BibT_eX]

[DOI]

Josiah Aklilu

Xiaohan Wang

CoRR, 2024

Ask, Pose, Unite: Scaling Data Acquisition for Close Interactions with Vision Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities.

[BibT_eX]

[DOI]

CoRR, 2024

Continuous Perception Benchmark.

[BibT_eX]

[DOI]

Zeyu Wang

CoRR, 2024

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision.

[BibT_eX]

[DOI]

CoRR, 2024

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Why are Visually-Grounded Language Models Bad at Image Classification?

[BibT_eX]

[DOI]

CoRR, 2024

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models.

[BibT_eX]

[DOI]

Elaine Sui

Xiaohan Wang

CoRR, 2024

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Human Mesh Recovery with Transformers.

[BibT_eX]

[DOI]

Zeyu Wang

CoRR, 2024

AdaEmbed: Semi-supervised Domain Adaptation in the Embedding Space.

[BibT_eX]

[DOI]

Ali Mottaghi

Muhammad Abdullah Jamal

Omid Mohareri

CoRR, 2024

Single-View 3D Human Digitalization with Large Reconstruction Models.

[BibT_eX]

[DOI]

CoRR, 2024

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data.

[BibT_eX]

[DOI]

Yuhui Zhang

Elaine Sui

Proceedings of the Twelfth International Conference on Learning Representations, 2024

VideoAgent: Long-Form Video Understanding with Large Language Model as Agent.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Depth-Guided NeRF Training via Earth Mover's Distance.

[BibT_eX]

[DOI]

Anita Rau

Josiah Aklilu

F. Christopher Holsinger

Proceedings of the Computer Vision - ECCV 2024, 2024

Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models.

[BibT_eX]

[DOI]

James Burgess

Proceedings of the Computer Vision - ECCV 2024, 2024

Describing Differences in Image Sets with Natural Language.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains.

[BibT_eX]

[DOI]

Laura Bravo Sánchez

Proceedings of the International Conference on 3D Vision, 2024

2023

Self-supervised learning for medical image classification: a systematic review and implementation guidelines.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2023

Author Correction: Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2023

Open World Object Detection in the Era of Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis.

[BibT_eX]

[DOI]

CoRR, 2023

Viewpoint Textual Inversion: Unleashing Novel View Synthesis with Pretrained 2D Diffusion Models.

[BibT_eX]

[DOI]

James Burgess

CoRR, 2023

ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image.

[BibT_eX]

[DOI]

Zeyu Wang

CoRR, 2023

Diffusion-HPC: Generating Synthetic Images with Realistic Humans.

[BibT_eX]

[DOI]

Laura Bravo Sánchez

CoRR, 2023

LOVM: Language-Only Vision Model Selection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DataPerf: Benchmarks for Data-Centric AI Development.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

INSPECT: A Multimodal Dataset for Patient Outcome Prediction of Pulmonary Embolisms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Semi-supervised Detection of Hands in Diverse Open Surgery Environments.

[BibT_eX]

[DOI]

Pranav Vaid

Anita Rau

Proceedings of the Machine Learning for Healthcare Conference, 2023

Video pretraining advances 3D deep learning on chest CT tasks.

[BibT_eX]

[DOI]

Proceedings of the Medical Imaging with Deep Learning, 2023

Diagnosing and Rectifying Vision Models using Language.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generalizable Neural Fields as Partially Observed Neural Processes.

[BibT_eX]

[DOI]

Jeffrey Gu

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PROB: Probabilistic Objectness for Open World Object Detection.

[BibT_eX]

[DOI]

Orr Zohar

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2022

DataPerf: Benchmarks for Data-Centric AI Development.

[BibT_eX]

[DOI]

CoRR, 2022

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ALGES: Active Learning with Gradient Embeddings for Semantic Segmentation of Laparoscopic Surgical Images.

[BibT_eX]

[DOI]

Josiah Aklilu

Proceedings of the Machine Learning for Healthcare Conference, 2022

Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Health, 2022

Adaptation of Surgical Activity Recognition Models Across Operating Rooms.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Domain Adaptive 3D Pose Augmentation for In-the-Wild Human Mesh Recovery.

[BibT_eX]

[DOI]

Proceedings of the International Conference on 3D Vision, 2022

2021

Deep learning-enabled medical computer vision.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2021

A real-time spatiotemporal AI model analyzes skill in open surgical videos.

[BibT_eX]

[DOI]

CoRR, 2021

Staying in shape: learning invariant shape representations using contrastive learning.

[BibT_eX]

[DOI]

Jeffrey Gu

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Achieving Trustworthy Biomedical Data Solutions.

[BibT_eX]

[DOI]

Peter Washington

Bethany Percha

Nicholas P. Tatonetti

Jan T. Liphardt

Dennis P. Wall

Proceedings of the Biocomputing 2021: Proceedings of the Pacific Symposium, 2021

Capturing implicit hierarchical structure in 3D biomedical images with self-supervised hyperbolic representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Personalized Federated Learning with First Order Model Optimization.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Holistic 3D Human and Scene Mesh Estimation From Single View Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DARCNN: Domain Adaptive Region-Based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images.

[BibT_eX]

[DOI]

Joy Hsu

Wah Chiu

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation.

[BibT_eX]

[DOI]

Julia Gong

F. Christopher Holsinger

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Automatic detection of hand hygiene using computer vision technology.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2020

Learning Hyperbolic Representations for Unsupervised 3D Segmentation.

[BibT_eX]

[DOI]

CoRR, 2020

Medical symptom recognition from patient text: An active learning approach for long-tailed multilabel distributions.

[BibT_eX]

[DOI]

CoRR, 2020

Rapidly Personalizing Mobile Health Treatment Policies with Limited Data.

[BibT_eX]

[DOI]

CoRR, 2020

Using Computer Vision to Automate Hand Detection and Tracking of Surgeon Movements in Videos of Open Surgery.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2020, 2020

2019

A computer vision system for deep learning-based detection of patient mobilization activities in the ICU.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2019

Adversarial Representation Active Learning.

[BibT_eX]

[DOI]

Ali Mottaghi

CoRR, 2019

2018

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2018

Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference.

[BibT_eX]

[DOI]

CoRR, 2018

Scaling Human-Object Interaction Recognition Through Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

3D Point Cloud-Based Visual Prediction of ICU Mobility Care Activities.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Healthcare Conference, 2018

Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Dynamic Task Prioritization for Multitask Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Graph Matching Networks for Fewshot 3D Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

Tackling Over-pruning in Variational Autoencoders.

[BibT_eX]

[DOI]

CoRR, 2017

Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Health Care Conference, 2017

Learning to Learn from Noisy Web Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

ITOP Dataset.

[BibT_eX]

[DOI]

Dataset, October, 2016

Viewpoint Invariant 3D Human Pose Estimation with Recurrent Error Feedback.

[BibT_eX]

[DOI]

CoRR, 2016

Towards Viewpoint Invariant 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

End-to-End Learning of Action Detection from Frame Glimpses in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Vision-Based Hand Hygiene Monitoring in Hospitals.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2016, 2016

2014

VideoSET: Video Summary Evaluation through Text.

[BibT_eX]

[DOI]