We stand with Ukraine

We stand with Ukraine

Sangdoo Yun

Orcid: 0000-0002-0417-8450

According to our database¹, Sangdoo Yun authored at least 92 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion.

[BibT_eX]

[DOI]

,

,

,

,

,

Trans. Mach. Learn. Res., 2024

Code-Switching Curriculum Learning for Multilingual Transfer in LLMs.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Probabilistic Language-Image Pre-Training.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Versatile Motion Language Models for Multi-Turn Interactive Agents.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Targeted Cause Discovery with Data-Driven Learning.

[BibT_eX]

[DOI]

,

Claudia Skok Gibbs

,

,

,

CoRR, 2024

Unveiling Disparities in Web Task Handling Between Human and Web Agent.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Direct Unlearning Optimization for Robust and Safe Text-to-Image Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Towards Calibrated Robust Fine-Tuning of Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Alexander Hauptmann

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Toward Interactive Regional Understanding in Vision-Large Language Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Compressed Context Memory for Online Language Model Interaction.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Model Stock: All We Need Is Just a Few Fine-Tuned Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Rotary Position Embedding for Vision Transformer.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Language-only Efficient Training of Zero-shot Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Calibrating Large Language Models Using Their Generations Only.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Who Wrote this Code? Watermarking for Code Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Match Me If You Can: Semi-supervised Semantic Correspondence Learning with Unpaired Images.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ACCV 2024, 2024

2023

Match me if you can: Semantic Correspondence Learning with Unpaired Images.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Augmenting Sub-model to Improve Main Model.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Neural Relation Graph for Identifying Problematic Data.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Observations on K-Image Expansion of Image-Mixing Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Access, 2023

Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ProPILE: Probing Privacy Leakage in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

What Do Self-Supervised Vision Transformers Learn?

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Exploring Temporally Dynamic Data Augmentation for Video Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Neglected Free Lunch - Learning Image Classifiers Using Annotation Byproducts.

[BibT_eX]

[DOI]

,

,

Seonghyeok Chun

,

John Joon Young Chung

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MPCHAT: Towards Multimodal Persona-Grounded Conversation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Group Generalized Mean Pooling for Vision Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

How Much a Model be Trained by Passive Learning Before Active Learning?

[BibT_eX]

[DOI]

,

,

IEEE Access, 2022

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Dataset Condensation with Contrastive Signals.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Dataset Condensation via Efficient Synthetic-Data Parameterization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

OCR-Free Document Understanding Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

The Majority Can Help the Minority: Context-rich Minority Oversampling for Long-tailed Classification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly Supervised Semantic Segmentation using Out-of-Distribution Data.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Region-based dropout with attention prior for weakly supervised object localization.

[BibT_eX]

[DOI]

,

,

,

,

,

Pattern Recognit., 2021

Donut: Document Understanding Transformer without OCR.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2021

Observations on K-image Expansion of Image-Mixing Augmentation for Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

Detecting and Removing Text in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Access, 2021

Progressive Transmission and Inference of Deep Learning Models.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Normalization Matters in Weakly Supervised Object Localization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Spatial Dimensions of Vision Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Re-Labeling ImageNet: From Single to Multi-Labels, From Global to Localized Labels.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking Channel Dimensions for Efficient Model Design.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

VideoMix: Rethinking Data Augmentation for Video Classification.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2020

ReXNet: Diminishing Representational Bottleneck on Convolutional Neural Network.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Slowing Down the Weight Norm Increase in Momentum-based Optimizers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2020

An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

Learning De-biased Representations with Biased Representations.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Neural Approximation of an Auto-Regressive Process through Confidence Guided Sampling.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

EXTD: Extremely Tiny Face Detector via Iterative Filter Reuse.

[BibT_eX]

[DOI]

,

,

CoRR, 2019

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Comprehensive Overhaul of Feature Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Character Region Awareness for Text Detection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Knowledge Distillation with Adversarial Samples Supporting Decision Boundary.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Action-Driven Visual Object Tracking With Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Neural Networks Learn. Syst., 2018

Concentrated-Comprehensive Convolutions for lightweight semantic segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2018

Selective Ensemble Network for Accurate Crowd Density Estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 24th International Conference on Pattern Recognition, 2018

VisDrone-SOT2018: The Vision Meets Drone Single-Object Tracking Challenge Results.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Asanka G. Perera

,

,

,

,

,

Emmanouil Michail

,

,

,

,

Ioannis Kompatsiaris

,

,

,

,

,

,

,

,

,

,

,

,

,

Konstantinos Avgerinakis

,

,

,

,

Panagiotis Giannakeris

,

,

,

,

,

,

Robert Laganière

,

,

,

,

,

Stefanos Vrochidis

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Unsupervised Holistic Image Generation from Key Local Patches.

[BibT_eX]

[DOI]

,

,

,

,

Ming-Hsuan Yang

,

Proceedings of the Computer Vision - ECCV 2018, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.

[BibT_eX]

[DOI]

,

,

,

Michael Felsberg

,

Roman P. Pflugfelder

,

Luka Cehovin Zajc

,

,

,

,

Abdelrahman Eldesokey

,

Gustavo Fernández

,

Álvaro García-Martín

,

Álvaro Iglesias-Arias

,

A. Aydin Alatan

,

Abel González-García

,

Alfredo Petrosino

,

Alireza Memarmoghadam

,

,

,

,

Arnold W. M. Smeulders

,

Asanka G. Perera

,

,

,

,

,

Changzhen Xiong

,

,

,

,

,

,

,

,

,

,

Efstratios Gavves

,

,

Erik Velasco-Salido

,

Fahad Shahbaz Khan

,

,

,

,

Francesco Battistone

,

,

Gorthi R. K. Sai Subrahmanyam

,

Guilherme Sousa Bastos

,

,

Hamed Kiani Galoogahi

,

,

,

,

,

,

Horst Possegger

,

,

,

,

,

,

Hyung Jin Chang

,

Isabela Drummond

,

,

Jaime Spencer Martin

,

Javaan Singh Chahl

,

,

,

,

,

,

Joakim Johnander

,

João F. Henriques

,

,

Joost van de Weijer

,

Jorge Rodríguez Herranz

,

José M. Martínez

,

,

,

,

,

,

,

,

,

,

Luca Bertinetto

,

,

,

Mario Edoardo Maresca

,

Martin Danelljan

,

Ming-Hsuan Yang

,

Mohamed H. Abdelpakey

,

Mohamed S. Shehata

,

,

,

,

,

,

Pablo Vicente-Moñivar

,

,

,

Philip H. S. Torr

,

Priya Mariam Raju

,

,

,

,

,

Rafael Martin Nieto

,

Rama Krishna Sai Subrahmanyam Gorthi

,

,

,

Richard M. Everson

,

,

,

,

,

,

Shuangping Huang

,

,

,

,

Stuart Golodetz

,

,

,

,

,

Vincenzo Santopietro

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Yiannis Demiris

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Context-Aware Deep Feature Compression for High-Speed Visual Tracking.

[BibT_eX]

[DOI]

,

Hyung Jin Chang

,

,

,

,

,

Yiannis Demiris

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Butterfly Effect: Bidirectional Control of Classification Performance by Small Additive Perturbation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2017

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold.

[BibT_eX]

[DOI]

,

,

Hyung Jin Chang

,

Yiannis Demiris

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Attentional Correlation Filter Network for Adaptive Visual Tracking.

[BibT_eX]

[DOI]

,

Hyung Jin Chang

,

,

,

Yiannis Demiris

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

PaletteNet: Image Recolorization with Given Color Palette.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016

Voting-based 3D object cuboid detection robust to partial occlusion from RGB-D images.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Attention-inspired moving object detection in monocular dashcam videos.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Density-Aware Pedestrian Proposal Networks for Robust People Detection in Crowded Scenes.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Visual Path Prediction in Complex Scenes with Crowded Moving Objects.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Category Attentional Search for Fast Object Detection by Mimicking Human Visual Perception.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

2014

Self-Organizing Cascaded Structure of Deformable Part Models for Fast Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Visual surveillance briefing system: Event-based video retrieval and summarization.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

Multi-task learning with over-sampled time-series representation of a trajectory for traffic motion pattern recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

2012

Multiple ground plane estimation for 3D scene understanding using a monocular camera.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Image and Vision Computing New Zealand, 2012

Loading...