Wonjae Kim

Orcid: 0000-0002-6616-7685

According to our database1, Wonjae Kim authored at least 31 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion.
Trans. Mach. Learn. Res., 2024

Probabilistic Language-Image Pre-Training.
CoRR, 2024

Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval.
CoRR, 2024

STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized Alignment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts.
Proceedings of the Computer Vision - ECCV 2024, 2024

Language-only Efficient Training of Zero-shot Composed Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment.
CoRR, 2023

Computational Approaches for App-to-App Retrieval and Design Consistency Check.
CoRR, 2023

Unified Chest X-ray and Radiology Report Generation Model with Multi-view Chest X-rays.
CoRR, 2023

What Do Self-Supervised Vision Transformers Learn?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Discrete Infomax Codes for Supervised Representation Learning.
Entropy, 2022

Group Generalized Mean Pooling for Vision Transformer.
CoRR, 2022

Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning.
CoRR, 2022

An Extendable, Efficient and Effective Transformer-based Object Detector.
CoRR, 2022

ViDT: An Efficient and Effective Fully Transformer-based Object Detector.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO.
Proceedings of the Computer Vision - ECCV 2022, 2022

Speeding up Inference with User Simulators throughPolicy Modulation.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Correlation between Alignment-Uniformity and Performance of Dense Contrastive Representations.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Conditional Generation of Periodic Signals with Fourier-Based Decoder.
CoRR, 2021

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Diversified Mutual Learning for Deep Metric Learning.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

2019
Discrete Infomax Codes for Meta-Learning.
CoRR, 2019

Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Bluff Forwarding: A Practical Protocol for Delivering Refreshed Symmetric Keys on a Multi-Path Big Data Ingestion System.
IEEE Access, 2018

A Platform for Choreography of Heterogeneous Healthcare Services.
Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems, 2018

2017
Decision Matrix Analysis of Impact Sounding Test Method to Determine Interlayer Condition of Concrete Bridge Deck.
J. Sensors, 2017

ChartSense: Interactive Data Extraction from Chart Images.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

SwiftTuna: Responsive and incremental visual exploration of large-scale multidimensional data.
Proceedings of the 2017 IEEE Pacific Visualization Symposium, 2017

2011
Electrical properties of CVD-graphene FETs.
Proceedings of the 2011 NORCHIP, Lund, Sweden, November 14-15, 2011, 2011


  Loading...