Sangdoo Yun

Orcid: 0000-0002-0417-8450

According to our database1, Sangdoo Yun authored at least 85 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion.
Trans. Mach. Learn. Res., 2024

Targeted Cause Discovery with Data-Driven Learning.
CoRR, 2024

Direct Unlearning Optimization for Robust and Safe Text-to-Image Models.
CoRR, 2024

Unveiling Disparities in Web Task Handling Between Human and Web Agent.
CoRR, 2024

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts.
CoRR, 2024

Rotary Position Embedding for Vision Transformer.
CoRR, 2024

Toward Interactive Regional Understanding in Vision-Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Compressed Context Memory for Online Language Model Interaction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Model Stock: All We Need Is Just a Few Fine-Tuned Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Language-only Efficient Training of Zero-shot Composed Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Calibrating Large Language Models Using Their Generations Only.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Who Wrote this Code? Watermarking for Code Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Match me if you can: Semantic Correspondence Learning with Unpaired Images.
CoRR, 2023

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models.
CoRR, 2023

Augmenting Sub-model to Improve Main Model.
CoRR, 2023

Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.
CoRR, 2023

RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models.
CoRR, 2023

Neural Relation Graph for Identifying Problematic Data.
CoRR, 2023

Observations on K-Image Expansion of Image-Mixing Augmentation.
IEEE Access, 2023

Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ProPILE: Probing Privacy Leakage in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

What Do Self-Supervised Vision Transformers Learn?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Exploring Temporally Dynamic Data Augmentation for Video Recognition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Neglected Free Lunch - Learning Image Classifiers Using Annotation Byproducts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MPCHAT: Towards Multimodal Persona-Grounded Conversation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Group Generalized Mean Pooling for Vision Transformer.
CoRR, 2022

How Much a Model be Trained by Passive Learning Before Active Learning?
IEEE Access, 2022

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Dataset Condensation with Contrastive Signals.
Proceedings of the International Conference on Machine Learning, 2022

Dataset Condensation via Efficient Synthetic-Data Parameterization.
Proceedings of the International Conference on Machine Learning, 2022

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective.
Proceedings of the Tenth International Conference on Learning Representations, 2022

OCR-Free Document Understanding Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Majority Can Help the Minority: Context-rich Minority Oversampling for Long-tailed Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly Supervised Semantic Segmentation using Out-of-Distribution Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Region-based dropout with attention prior for weakly supervised object localization.
Pattern Recognit., 2021

Donut: Document Understanding Transformer without OCR.
CoRR, 2021

Observations on K-image Expansion of Image-Mixing Augmentation for Classification.
CoRR, 2021

Detecting and Removing Text in the Wild.
IEEE Access, 2021

Progressive Transmission and Inference of Deep Learning Models.
Proceedings of the 20th IEEE International Conference on Machine Learning and Applications, 2021

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights.
Proceedings of the 9th International Conference on Learning Representations, 2021

Normalization Matters in Weakly Supervised Object Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Spatial Dimensions of Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Re-Labeling ImageNet: From Single to Multi-Labels, From Global to Localized Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking Channel Dimensions for Efficient Model Design.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
VideoMix: Rethinking Data Augmentation for Video Classification.
CoRR, 2020

ReXNet: Diminishing Representational Bottleneck on Convolutional Neural Network.
CoRR, 2020

Slowing Down the Weight Norm Increase in Momentum-based Optimizers.
CoRR, 2020

An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods.
CoRR, 2020

Learning De-biased Representations with Biased Representations.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Neural Approximation of an Auto-Regressive Process through Confidence Guided Sampling.
CoRR, 2019

EXTD: Extremely Tiny Face Detector via Iterative Filter Reuse.
CoRR, 2019

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Comprehensive Overhaul of Feature Distillation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Character Region Awareness for Text Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Knowledge Distillation with Adversarial Samples Supporting Decision Boundary.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Action-Driven Visual Object Tracking With Deep Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., 2018

Concentrated-Comprehensive Convolutions for lightweight semantic segmentation.
CoRR, 2018

Selective Ensemble Network for Accurate Crowd Density Estimation.
Proceedings of the 24th International Conference on Pattern Recognition, 2018


Unsupervised Holistic Image Generation from Key Local Patches.
Proceedings of the Computer Vision - ECCV 2018, 2018

The Sixth Visual Object Tracking VOT2018 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Context-Aware Deep Feature Compression for High-Speed Visual Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Butterfly Effect: Bidirectional Control of Classification Performance by Small Additive Perturbation.
CoRR, 2017

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Attentional Correlation Filter Network for Adaptive Visual Tracking.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

PaletteNet: Image Recolorization with Given Color Palette.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Voting-based 3D object cuboid detection robust to partial occlusion from RGB-D images.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Attention-inspired moving object detection in monocular dashcam videos.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Density-Aware Pedestrian Proposal Networks for Robust People Detection in Crowded Scenes.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Visual Path Prediction in Complex Scenes with Crowded Moving Objects.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Category Attentional Search for Fast Object Detection by Mimicking Human Visual Perception.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

2014
Self-Organizing Cascaded Structure of Deformable Part Models for Fast Object Detection.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Visual surveillance briefing system: Event-based video retrieval and summarization.
Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

Multi-task learning with over-sampled time-series representation of a trajectory for traffic motion pattern recognition.
Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

2012
Multiple ground plane estimation for 3D scene understanding using a monocular camera.
Proceedings of the Image and Vision Computing New Zealand, 2012


  Loading...