Tae Hyun Oh

Orcid: 0000-0003-0468-1571

According to our database1, Tae Hyun Oh authored at least 122 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A unified framework for unsupervised action learning via global-to-local motion transformer.
Pattern Recognit., 2025

2024
The devil in the details: simple and effective optical flow synthetic data generation.
Vis. Comput., December, 2024

Factorized Multi-Resolution HashGrid for Efficient Neural Radiance Fields: Execution on Edge-Devices.
IEEE Robotics Autom. Lett., November, 2024

An Iterative Method for Unsupervised Robust Anomaly Detection Under Data Contamination.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation.
IEEE Robotics Autom. Lett., July, 2024

Multi-stage adaptive rank statistic pruning for lightweight human 3D mesh recovery model.
Vis. Comput., February, 2024

ENInst: Enhancing weakly-supervised low-shot instance segmentation.
Pattern Recognit., January, 2024

AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models.
CoRR, 2024

VLM's Eye Examination: Instruct and Inspect Visual Competency of Vision Language Models.
CoRR, 2024

MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation.
CoRR, 2024

MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models.
CoRR, 2024

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment.
CoRR, 2024

Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert.
CoRR, 2024

MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset.
CoRR, 2024

Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild.
CoRR, 2024

Revisiting Learning-based Video Motion Magnification for Real-time Processing.
CoRR, 2024

Overcoming Client Data Deficiency in Federated Learning by Exploiting Unlabeled Data on the Server.
IEEE Access, 2024

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data.
IEEE Access, 2024

LaughTalk: Expressive 3D Talking Head Generation with Laughter.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

An Efficient and Effective Sea Turtle Detection Using Positioning Enhancement Module.
Proceedings of the International Workshop on Intelligent Systems, 2024

CAS: A Probability-Based Approach for Universal Condition Alignment Score.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning-based Axial Video Motion Magnification.
Proceedings of the Computer Vision - ECCV 2024, 2024

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Joint Video Super-Resolution and Frame Interpolation via Permutation Invariance.
Sensors, March, 2023

Learning-based Axial Motion Magnification.
CoRR, 2023

A Large-Scale 3D Face Mesh Video Dataset via Neural Re-parameterized Optimization.
CoRR, 2023

The Devil in the Details: Simple and Effective Optical Flow Synthetic Data Generation.
CoRR, 2023

Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective.
CoRR, 2023

Computational Discovery of Microstructured Composites with Optimal Strength-Toughness Trade-Offs.
CoRR, 2023

Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learning Few-shot Segmentation from Bounding Box Annotations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Mask-KLT: Sub-pixel Accurate Directional Motion Estimation by Stripe Masking.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

DFlow: Learning to Synthesize Better Optical Flow Datasets via a Differentiable Pipeline.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Enhancing Classification Accuracy on Limited Data via Unconditional GAN.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Sound Source Localization is All about Cross-Modal Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scratching Visual Transformer's Back with Uniform Attention.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Pre-Training for Data-Efficient Text-to-Speech on Low Resource Languages.
Proceedings of the IEEE International Conference on Acoustics, 2023

Prefix Tuning for Automated Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2023

FPGA-Based Accelerator for Rank-Enhanced and Highly-Pruned Block-Circulant Neural Networks.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Lightweight Speaker Recognition in Poincaré Spaces.
IEEE Signal Process. Lett., 2022

Dense Relational Image Captioning via Multi-Task Triple-Stream Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Robust and Efficient Estimation of Relative Pose for Cameras on Selfie Sticks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes.
CoRR, 2022

Audio-Visual Fusion Layers for Event Type Aware Video Recognition.
CoRR, 2022

FedPara: Low-rank Hadamard Product for Communication-Efficient Federated Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes.
Proceedings of the Computer Vision - ECCV 2022, 2022

HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields.
Proceedings of the Computer Vision - ECCV 2022, 2022

Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

FICGAN: Facial Identity Controllable GAN for De-identification.
CoRR, 2021

FedPara: Low-rank Hadamard Product Parameterization for Efficient Federated Learning.
CoRR, 2021

Supervoxel Attention Graphs for Long-Range Video Modeling.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

CDS: Cross-Domain Self-supervised Pre-training.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Distilling Global and Local Logits with Densely Connected Relations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MDARTS: Multi-objective Differentiable Neural Architecture Search.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Monocular Reconstruction of Neural Face Reflectance Fields.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unified 3D Mesh Recovery of Humans and Animals by Learning Animal Exercise.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Globally Optimal Inlier Set Maximization for Atlanta World Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Cross-domain Self-supervised Learning for Domain Adaptation with Few Source Labels.
CoRR, 2020

Linear RGB-D SLAM for Atlanta World.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Globally Optimal Relative Pose Estimation for Camera on a Selfie Stick.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Listen to Look: Action Recognition by Previewing Audio.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Gradient-Based Camera Exposure Control for Outdoor Mobile Platforms.
IEEE Trans. Circuits Syst. Video Technol., 2019

High-Fidelity Depth Upsampling Using the Self-Learning Framework.
Sensors, 2019

Robust and Globally Optimal Manhattan Frame Estimation in Near Real Time.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Neural Inverse Knitting: From Images to Manufacturing Instructions.
CoRR, 2019

Neural Inverse Knitting: From Images to Manufacturing Instructions.
Proceedings of the 36th International Conference on Machine Learning, 2019

Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2019

Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Speech2Face: Learning the Face Behind a Voice.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Variational Prototyping-Encoder: One-Shot Learning With Prototypical Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Visuomotor Understanding for Representation Learning of Driving Scenes.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Semantic soft segmentation.
ACM Trans. Graph., 2018

A Closed-Form Solution to Rotation Estimation for Structure from Small Motion.
IEEE Signal Process. Lett., 2018

Fast Randomized Singular Value Thresholding for Low-Rank Optimization.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Contextually Customized Video Summaries Via Natural Language.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Disjoint Multi-task Learning Between Heterogeneous Human-Centric Tasks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Learning-Based Video Motion Magnification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Part-Based Player Identification Using Deep Convolutional Representation and Multi-Scale Pooling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

On Learning Association of Sound Source and Visual Scenes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Learning to Localize Sound Source in Visual Scenes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Globally Optimal Inlier Set Maximization for Atlanta Frame Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

On Learning Associations of Faces and Voices.
Proceedings of the Computer Vision - ACCV 2018, 2018

Co-Domain Embedding Using Deep Quadruplet Networks for Unseen Traffic Sign Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Textually Customized Video Summaries.
CoRR, 2017

Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Partial Sum Minimization of Singular Values in Robust PCA: Algorithm and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Real-time Robust Manhattan Frame Estimation: Global Optimality and Applications.
CoRR, 2016

Human Attention Estimation for Natural Images: An Automatic Gaze Refinement Approach.
CoRR, 2016

Human body part classification from optical flow.
Proceedings of the 13th International Conference on Ubiquitous Robots and Ambient Intelligence, 2016

A Pseudo-Bayesian Algorithm for Robust PCA.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Globally Optimal Manhattan Frame Estimation in Real-Time.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Video-Story Composition via Plot Analysis.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
An Autonomous Driving System for Unknown Environments Using a Unified Map.
IEEE Trans. Intell. Transp. Syst., 2015

Robust High Dynamic Range Imaging by Rank Minimization.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

New Design Criteria for Robust PCA and a Compliant Bayesian-Inspired Algorithm.
CoRR, 2015

Line assisted vision applications in structured environments.
Proceedings of the 12th International Conference on Ubiquitous Robots and Ambient Intelligence, 2015

Line meets as-projective-as-possible image stitching with moving DLT.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Fast randomized Singular Value Thresholding for Nuclear Norm Minimization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

A Multi-view Structured-Light System for Highly Accurate 3D Modeling.
Proceedings of the 2015 International Conference on 3D Vision, 2015

2014
A simple and real-time moving object detection invariant to cast shadow.
Proceedings of the 11th International Conference on Ubiquitous Robots and Ambient Intelligence, 2014

A fusion approach for robust visual object tracking in crowd scenes.
Proceedings of the 11th International Conference on Ubiquitous Robots and Ambient Intelligence, 2014

Cost-aware depth map estimation for Lytro camera.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Balanced optical flow refinement by bidirectional constraint.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

A Two Phase Approach for Pedestrian Detection.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Patch-based robust L1 tracker to dynamic appearance change.
Proceedings of the 10th International Conference on Ubiquitous Robots and Ambient Intelligence, 2013

L1-based photometric stereo via augmented lagrange multiplier method.
Proceedings of the 10th International Conference on Ubiquitous Robots and Ambient Intelligence, 2013

High dynamic range imaging by a rank-1 constraint.
Proceedings of the IEEE International Conference on Image Processing, 2013

Hierarchical 3D line restoration based on angular proximity in structured environments.
Proceedings of the IEEE International Conference on Image Processing, 2013

Partial Sum Minimization of Singular Values in RPCA for Low-Level Vision.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Autonomous homing based on laser-camera fusion system.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Real-time motion detection based on Discrete Cosine Transform.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

A Tensor Voting Approach for Multi-view 3D Scene Flow Estimation and Refinement.
Proceedings of the Computer Vision - ECCV 2012, 2012


  Loading...