Serena Yeung-Levy

Orcid: 0000-0003-0529-0628

Affiliations:
  • Stanford University, CA, USA


According to our database1, Serena Yeung-Levy authored at least 81 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Hyperbolic Deep Learning in Computer Vision: A Survey.
Int. J. Comput. Vis., September, 2024

Revisiting Active Learning in the Era of Vision Foundation Models.
Trans. Mach. Learn. Res., 2024

Zero-shot Action Localization via the Confidence of Large Vision-Language Models.
CoRR, 2024

Ask, Pose, Unite: Scaling Data Acquisition for Close Interactions with Vision Language Models.
CoRR, 2024

How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities.
CoRR, 2024

Continuous Perception Benchmark.
CoRR, 2024

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision.
CoRR, 2024

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding.
CoRR, 2024

Why are Visually-Grounded Language Models Bad at Image Classification?
CoRR, 2024

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models.
CoRR, 2024

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging.
CoRR, 2024

Multi-Human Mesh Recovery with Transformers.
CoRR, 2024

AdaEmbed: Semi-supervised Domain Adaptation in the Embedding Space.
CoRR, 2024

Single-View 3D Human Digitalization with Large Reconstruction Models.
CoRR, 2024

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VideoAgent: Long-Form Video Understanding with Large Language Model as Agent.
Proceedings of the Computer Vision - ECCV 2024, 2024

Depth-Guided NeRF Training via Earth Mover's Distance.
Proceedings of the Computer Vision - ECCV 2024, 2024

Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Describing Differences in Image Sets with Natural Language.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains.
Proceedings of the International Conference on 3D Vision, 2024

2023
Self-supervised learning for medical image classification: a systematic review and implementation guidelines.
npj Digit. Medicine, 2023

Author Correction: Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.
npj Digit. Medicine, 2023

Open World Object Detection in the Era of Foundation Models.
CoRR, 2023

INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis.
CoRR, 2023

Viewpoint Textual Inversion: Unleashing Novel View Synthesis with Pretrained 2D Diffusion Models.
CoRR, 2023

ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image.
CoRR, 2023

Diffusion-HPC: Generating Synthetic Images with Realistic Humans.
CoRR, 2023

LOVM: Language-Only Vision Model Selection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


INSPECT: A Multimodal Dataset for Patient Outcome Prediction of Pulmonary Embolisms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Semi-supervised Detection of Hands in Diverse Open Surgery Environments.
Proceedings of the Machine Learning for Healthcare Conference, 2023

Video pretraining advances 3D deep learning on chest CT tasks.
Proceedings of the Medical Imaging with Deep Learning, 2023

Diagnosing and Rectifying Vision Models using Language.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generalizable Neural Fields as Partially Observed Neural Processes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PROB: Probabilistic Objectness for Open World Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.
npj Digit. Medicine, 2022

DataPerf: Benchmarks for Data-Centric AI Development.
CoRR, 2022

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ALGES: Active Learning with Gradient Embeddings for Semantic Segmentation of Laparoscopic Surgical Images.
Proceedings of the Machine Learning for Healthcare Conference, 2022

Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation.
Proceedings of the Machine Learning for Health, 2022

Adaptation of Surgical Activity Recognition Models Across Operating Rooms.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Domain Adaptive 3D Pose Augmentation for In-the-Wild Human Mesh Recovery.
Proceedings of the International Conference on 3D Vision, 2022

2021
Deep learning-enabled medical computer vision.
npj Digit. Medicine, 2021

A real-time spatiotemporal AI model analyzes skill in open surgical videos.
CoRR, 2021

Staying in shape: learning invariant shape representations using contrastive learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Achieving Trustworthy Biomedical Data Solutions.
Proceedings of the Biocomputing 2021: Proceedings of the Pacific Symposium, 2021

Capturing implicit hierarchical structure in 3D biomedical images with self-supervised hyperbolic representations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Personalized Federated Learning with First Order Model Optimization.
Proceedings of the 9th International Conference on Learning Representations, 2021

GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Holistic 3D Human and Scene Mesh Estimation From Single View Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DARCNN: Domain Adaptive Region-Based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Automatic detection of hand hygiene using computer vision technology.
J. Am. Medical Informatics Assoc., 2020

Learning Hyperbolic Representations for Unsupervised 3D Segmentation.
CoRR, 2020

Medical symptom recognition from patient text: An active learning approach for long-tailed multilabel distributions.
CoRR, 2020

Rapidly Personalizing Mobile Health Treatment Policies with Limited Data.
CoRR, 2020

Using Computer Vision to Automate Hand Detection and Tracking of Surgeon Movements in Videos of Open Surgery.
Proceedings of the AMIA 2020, 2020

2019
A computer vision system for deep learning-based detection of patient mobilization activities in the ICU.
npj Digit. Medicine, 2019

Adversarial Representation Active Learning.
CoRR, 2019

2018
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos.
Int. J. Comput. Vis., 2018

Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference.
CoRR, 2018

Scaling Human-Object Interaction Recognition Through Zero-Shot Learning.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

3D Point Cloud-Based Visual Prediction of ICU Mobility Care Activities.
Proceedings of the Machine Learning for Healthcare Conference, 2018

Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Dynamic Task Prioritization for Multitask Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Graph Matching Networks for Fewshot 3D Action Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Tackling Over-pruning in Variational Autoencoders.
CoRR, 2017

Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance.
Proceedings of the Machine Learning for Health Care Conference, 2017

Learning to Learn from Noisy Web Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Viewpoint Invariant 3D Human Pose Estimation with Recurrent Error Feedback.
CoRR, 2016

Towards Viewpoint Invariant 3D Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2016, 2016

End-to-End Learning of Action Detection from Frame Glimpses in Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Vision-Based Hand Hygiene Monitoring in Hospitals.
Proceedings of the AMIA 2016, 2016

2014
VideoSET: Video Summary Evaluation through Text.
CoRR, 2014

2011
Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011


  Loading...