Ser-Nam Lim

Affiliations:
  • Facebook AI, New York, NY, USA
  • Avitas Systems, GE Venture, Boston, MA, USA
  • GE Global Research, Niskayuna, NY, USA
  • Cognex Corp., Natick, MA, USA
  • University of Maryland, College Park, MD, USA (PhD 2006)


According to our database1, Ser-Nam Lim authored at least 157 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models.
CoRR, 2024

Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning.
CoRR, 2024

DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks.
CoRR, 2024

AirSketch: Generative Motion to Sketch.
CoRR, 2024

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence.
CoRR, 2024

Distilling Vision-Language Pretraining for Efficient Cross-Modal Retrieval.
CoRR, 2024

Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models.
CoRR, 2024

Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval.
CoRR, 2024

Mitigating Dialogue Hallucination for Large Multi-modal Models via Adversarial Instruction Tuning.
CoRR, 2024

Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions.
CoRR, 2024

FSViewFusion: Few-Shots View Generation of Novel Objects.
CoRR, 2024

Video Decomposition Prior: Editing Videos Layer by Layer.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Language-Free Compositional Action Generation via Decoupling Refinement.
Proceedings of the IEEE International Conference on Acoustics, 2024

uCAP: An Unsupervised Prompting Method for Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Fast Encoding and Decoding for Implicit Video Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Object Recognition as Next Token Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Composing Object Relations and Attributes for Image-Text Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

UniMODE: Unified Monocular 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Visual Delta Generator with Large Multi-Modal Models for Semi-Supervised Composed Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

UVIS: Unsupervised Video Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Few-Shot Object Detection with Foundation Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On the Robustness of Large Multimodal Models Against Image Adversarial Attacks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Universal Pyramid Adversarial Training for Improved ViT Performance.
CoRR, 2023

CLAMP: Contrastive LAnguage Model Prompt-tuning.
CoRR, 2023

Label Delay in Continual Learning.
CoRR, 2023

From Categories to Classifier: Name-Only Continual Learning by Exploring the Web.
CoRR, 2023

Riemannian Residual Neural Networks.
CoRR, 2023

Language-free Compositional Action Generation via Decoupling Refinement.
CoRR, 2023

LASER: Neuro-Symbolic Learning of Semantic Video Representations.
CoRR, 2023

VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection.
CoRR, 2023

ScribbleSeg: Scribble-based Interactive Image Segmentation.
CoRR, 2023

Distribution Normalization: An "Effortless" Test-Time Augmentation for Contrastively Learned Visual-language Models.
CoRR, 2023

Online Backfilling with No Regret for Large-Scale Image Retrieval.
CoRR, 2023

Test-Time Distribution Normalization for Contrastively Learned Visual-language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Riemannian Residual Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Graph Inductive Biases in Transformers without Message Passing.
Proceedings of the International Conference on Machine Learning, 2023

Raising the Bar on the Evaluation of Out-of-Distribution Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BT<sup>2</sup>: Backward-compatible Training with Basis Transformation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Open-vocabulary Panoptic Segmentation with Embedding Modulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Computationally Budgeted Continual Learning: What Does Matter?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TIPI: Test Time Adaptation with Transformation Invariance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HNeRV: A Hybrid Neural Representation for Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Detecting Everything in the Open World: Towards Universal Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Scalable Neural Representation for Diverse Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unifying the Harmonic Analysis of Adversarial Attacks and Robustness.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Sample-Dependent Adaptive Temperature Scaling for Improved Calibration.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
PyTorch Adapt.
CoRR, 2022

A Unified Model for Tracking and Image-Video Detection Has More Power.
CoRR, 2022

Benchmarking Validation Methods for Unsupervised Domain Adaptation.
CoRR, 2022

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness.
CoRR, 2022

VRAG: Region Attention Graphs for Content-Based Video Retrieval.
CoRR, 2022

Task-Agnostic Robust Representation Learning.
CoRR, 2022

GAPX: Generalized Autoregressive Paraphrase-Identification X.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Few-Shot Fast-Adaptive Anomaly Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Spartan: Differentiable Sparsity via Regularized Transportation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Using Mixup as a Regularizer Can Surprisingly Improve Accuracy & Out-of-Distribution Robustness.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FedSR: A Simple and Effective Domain Generalization Method for Federated Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Object-Centric Unsupervised Image Captioning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Totems: Physical Objects for Verifying Visual Integrity.
Proceedings of the Computer Vision - ECCV 2022, 2022

Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions.
Proceedings of the Computer Vision - ECCV 2022, 2022

Visual Prompt Tuning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Diversified Dynamic Routing for Vision Tasks.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

ObjectFormer for Image Manipulation Detection and Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Image Recolorization for Creative Domains.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

CNeRV: Content-adaptive Neural Representation for Visual Data.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Rethinking Nearest Neighbors for Visual Classification.
CoRR, 2021

Unsupervised Domain Adaptation: A Reality Check.
CoRR, 2021

A Frequency Perspective of Adversarial Robustness.
CoRR, 2021

Learning to Ground Multi-Agent Communication with Autoencoders.
CoRR, 2021

MixNorm: Test-Time Adaptation Through Online Normalization Estimation.
CoRR, 2021

Self-appearance-aided Differential Evolution for Motion Transfer.
CoRR, 2021

Edge Proposal Sets for Link Prediction.
CoRR, 2021

Multimodal Fusion Refiner Networks.
CoRR, 2021

New Benchmarks for Learning on Non-Homophilous Graphs.
CoRR, 2021

THAT: Two Head Adversarial Training for Improving Robustness at Scale.
CoRR, 2021

A Continuous Mapping For Augmentation Design.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Ground Multi-Agent Communication with Autoencoders.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Equivariant Manifold Flows.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

NeRV: Neural Representations for Videos.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Combining Label Propagation and Simple Models out-performs Graph Neural Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Analyzing and Mitigating JPEG Compression Defects in Deep Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Joint Audio-Visual Deepfake Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Robustness and Generalization via Generative Adversarial Training.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Exploring Visual Engagement Signals for Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

When in Doubt: Improving Classification Performance with Alternating Normalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Cross-Modal Retrieval Augmentation for Multi-Modal Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

On Feature Normalization and Data Augmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Intentonomy: A Dataset and Study Towards Human Intent Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Efficient Object Embedding for Spliced Image Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GTA: Global Temporal Attention for Video Action Understanding.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Deep Video Inpainting Detection.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Analyzing and Mitigating Compression Defects in Deep Learning.
CoRR, 2020

PyTorch Metric Learning.
CoRR, 2020

MiCo: Mixup Co-Training for Semi-Supervised Domain Adaptation.
CoRR, 2020

Set-Structured Latent Representations.
CoRR, 2020

Deep Multi-Modal Sets.
CoRR, 2020

Detecting Deep-Fake Videos from Appearance and Behavior.
Proceedings of the 12th IEEE International Workshop on Information Forensics and Security, 2020

Neural Manifold Ordinary Differential Equations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Better Set Representations For Relational Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Differentiating through the Fréchet Mean.
Proceedings of the 37th International Conference on Machine Learning, 2020

Curriculum Manager for Source Selection in Multi-source Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Metric Learning Reality Check.
Proceedings of the Computer Vision - ECCV 2020, 2020

Quantization Guided JPEG Artifact Correction.
Proceedings of the Computer Vision - ECCV 2020, 2020

What Makes Fake Images Detectable? Understanding Properties that Generalize.
Proceedings of the Computer Vision - ECCV 2020, 2020

One-Shot Domain Adaptation for Face Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Generate, Segment, and Refine: Towards Generic Manipulation Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Measuring Dataset Granularity.
CoRR, 2019

Unconstrained Facial Expression Transfer using Style-based Generator.
CoRR, 2019

Fine-grained Synthesis of Unrestricted Adversarial Examples.
CoRR, 2019

Unsupervised Deep Metric Learning via Auxiliary Rotation Loss.
CoRR, 2019

An Analysis of Object Embeddings for Image Retrieval.
CoRR, 2019

Cross-X Learning for Fine-Grained Visual Categorization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Enhancing Adversarial Example Transferability With an Intermediate Level Attack.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Adversarial Example Decomposition.
CoRR, 2018

Intermediate Level Adversarial Attack for Enhanced Transferability.
CoRR, 2018

Explain Black-box Image Classifications Using Superpixel-based Interpretation.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

DCAN: Dual Channel-Wise Alignment Networks for Unsupervised Scene Adaptation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Regularizing Deep Networks Using Efficient Layerwise Adversarial Training.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Unsupervised Domain Adaptation for Semantic Segmentation with GANs.
CoRR, 2017

Guided Perturbations: Self-Corrective Behavior in Convolutional Neural Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Adaptive RNN Tree for Large-Scale Human Action Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A Reinforcement Learning Approach to the View Planning Problem.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
A Reinforcement Learning Approach to Sensor Planning for 3D Models.
CoRR, 2016

Tooth guard: A vision system for detecting missing tooth in rope mine shovel.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Multiscale fully convolutional network with application to industrial inspection.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Visual tracking based on object appearance and structure preserved local patches matching.
Proceedings of the 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2016

2013
Automatic Registration of Smooth Object Image to 3D CAD Model for Industrial Inspection Applications.
Proceedings of the 2013 International Conference on 3D Vision, 2013

2012
Simultaneous image segmentation and 3D plane fitting for RGB-D sensors - An iterative framework.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011
Multi-class Object Layout with Unsupervised Image Classification and Object Localization.
Proceedings of the Advances in Visual Computing - 7th International Symposium, 2011

Automatic surveillance video matting using a shape prior.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

2010
Group Level Activity Recognition in Crowded Environments across Multiple Cameras.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
Monitoring, recognizing and discovering social networks.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Collaborative Control of Active Cameras in Large-Scale Surveillance.
Proceedings of the Multi-Camera Networks, 2009

2007
An Ease-of-Use Stereo-Based Particle Filter for Tracking Under Occlusion.
Proceedings of the Human Motion, 2007

Task Scheduling in Large Camera Networks.
Proceedings of the Computer Vision, 2007

2006
Sensor, Motion and Temporal Planning.
PhD thesis, 2006

Constructing task visibility intervals for video surveillance.
Multim. Syst., 2006

2005
Constructing task visibility intervals for a surveillance system.
Proceedings of the Third ACM International Workshop on Video Surveillance & Sensor Networks, 2005

Fast Illumination-Invariant Background Subtraction Using Two Views: Error Analysis, Sensor Placement and Applications.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Multi-level fast multipole method for thin plate spline evaluation.
Proceedings of the 2004 International Conference on Image Processing, 2004

Uncalibrated stereo rectification for automatic 3d surveillance.
Proceedings of the 2004 International Conference on Image Processing, 2004

2003
Image-based pan-tilt camera control in a multi-camera surveillance environment.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A Scalable Image-Based Multi-Camera Visual Surveillance System.
Proceedings of the 2003 IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS 2003), 2003


  Loading...