Min Sun

Orcid: 0000-0001-9598-8178

Affiliations:
  • National Tsing Hua University, Department of Electrical Engineering, Joint Research Center for AI Technology and All Vista Healthcare, Hsinchu, Taiwan
  • University of Washington, Seattle, WA, USA
  • University of Michigan, Ann Arbor, MI, USA (PhD)


According to our database1, Min Sun authored at least 105 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Self-training Room Layout Estimation via Geometry-Aware Ray-Casting.
Proceedings of the Computer Vision - ECCV 2024, 2024

GenRC: Generative 3D Room Completion from Sparse Image Collections.
Proceedings of the Computer Vision - ECCV 2024, 2024

No More Ambiguity in 360° Room Layout via Bi-Layout Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VMCML: Video and Music Matching via Cross-Modality Lifting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
BiFuse++: Self-Supervised and Efficient Bi-Projection Fusion for 360° Depth Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Monocular Quasi-Dense 3D Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views.
CoRR, 2023

DreaMo: Articulated 3D Reconstruction From A Single Casual Video.
CoRR, 2023

Dense Prediction with Attentive Feature Aggregation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Shared Embedding of X-ray & Enose Networks for Lung Cancer Classification.
Proceedings of the 2023 8th International Conference on Biomedical Imaging, 2023

Sparse and Privacy-enhanced Representation for Human Pose Estimation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

PanoMixSwap - Panorama Mixing via Structural Swapping for Indoor Scene Understanding.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

MixFairFace: Towards Ultimate Fairness via MixFair Adapter in Face Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation.
IEEE Robotics Autom. Lett., 2022

Improved Direct Voxel Grid Optimization for Radiance Fields Reconstruction.
CoRR, 2022

360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CLA-NeRF: Category-Level Articulated Neural Radiance Field.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Data Efficient 3D Learner via Knowledge Transferred from 2D Model.
Proceedings of the Computer Vision - ECCV 2022, 2022

Autoregressive 3D Shape Generation via Canonical Mapping.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semiconductor Defect Detection by Hybrid Classical-Quantum Deep Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion.
Proceedings of the Conference on Robot Learning, 2022

2021
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module.
CoRR, 2021

Leveraging Sequence Embedding and Convolutional Neural Network for Protein Function Prediction.
CoRR, 2021

LED2-Net: Monocular 360 Layout Estimation via Differentiable Depth Rendering.
CoRR, 2021

Learning 3D Dense Correspondence via Canonical Point Autoencoder.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust 360-8PA: Redesigning The Normalized 8-point Algorithm for 360-FoV Images.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LED2-Net: Monocular 360deg Layout Estimation via Differentiable Depth Rendering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HoHoNet: 360 Indoor Holistic Understanding With Latent Horizontal Features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Toward Robust Long Range Policy Transfer.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Interactive Radiotherapy Target Delineation with 3D-Fused Context Propagation.
CoRR, 2020

LayoutMP3D: Layout Annotation of Matterport3D.
CoRR, 2020

360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Visual Question Answering on 360° Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Lymph Node Gross Tumor Volume Detection in Oncology Imaging via Relationship Learning Using Graph Neural Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

360SD-Net: 360° Stereo Depth Estimation with Learnable Cost Volume.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Controllable Image Synthesis via SegVAE.
Proceedings of the Computer Vision - ECCV 2020, 2020

BiFuse: Monocular 360 Depth Estimation via Bi-Projection Fusion.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

InstaNAS: Instance-Aware Neural Architecture Search.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Flat2Layout: Flat Representation for Estimating Layout of General Room Types.
CoRR, 2019

Radiotherapy Target Contouring with Convolutional Gated Graph Neural Network.
CoRR, 2019

3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Plug-and-Play: Improve Depth Prediction via Sparse Data Propagation.
Proceedings of the International Conference on Robotics and Automation, 2019

Point-to-Point Video Generation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Joint Monocular 3D Vehicle Detection and Tracking.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DuLa-Net: A Dual-Projection Network for Estimating Room Layouts From a Single RGB Panorama.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

HorizonNet: Learning Room Layout With 1D Representation and Pano Stretch Data Augmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Stylish Image Description Generation via Domain Layer Norm.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Self-Supervised Learning of Depth and Camera Motion from 360° Videos.
CoRR, 2018

DLWV2: A Deep Learning-Based Wearable Vision-System with Vibrotactile-Feedback for Visually Impaired People to Reach Objects.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Omnidirectional CNN for Visual Place Recognition and Navigation.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

PPP-Net: Platform-aware Progressive Search for Pareto-optimal Neural Architectures.
Proceedings of the 6th International Conference on Learning Representations, 2018

Searching toward pareto-optimal device-aware neural architectures.
Proceedings of the International Conference on Computer-Aided Design, 2018

Liquid Pouring Monitoring via Rich Sensory Inputs.
Proceedings of the Computer Vision - ECCV 2018, 2018

Efficient Uncertainty Estimation for Semantic Segmentation in Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

DPP-Net: Device-Aware Progressive Search for Pareto-Optimal Neural Architectures.
Proceedings of the Computer Vision - ECCV 2018, 2018

Leveraging Motion Priors in Videos for Improving Human Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Self-supervised Learning of Depth and Camera Motion from 360 ^\circ Videos.
Proceedings of the Computer Vision - ACCV 2018, 2018

Self-View Grounding Given a Narrated 360° Video.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Semantic Highlight Retrieval and Term Prediction.
IEEE Trans. Image Process., 2017

Summarizing Unconstrained Videos Using Salient Montages.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight.
CoRR, 2017

Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Video.
CoRR, 2017

Learning to Compose with Professional Photographs on the Web.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Tactics of Adversarial Attack on Deep Reinforcement Learning Agents.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Visual Forecasting by Imitating Dynamics in Natural Sequences.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Tell Me Where to Look: Investigating Ways for Assisting Focus in 360° Video.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

Leveraging Video Descriptions to Learn Video Question Answering.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Ranking Highlights in Personal Videos by Analyzing Edited Videos.
IEEE Trans. Image Process., 2016

Semantic highlight retrieval.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Title Generation for User Generated Videos.
Proceedings of the Computer Vision - ECCV 2016, 2016

Extracting Driving Behavior: Global Metric Localization from Dashcam Videos in the Wild.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Recognition from Hand Cameras: A Revisit with Deep Learning.
Proceedings of the Computer Vision - ECCV 2016, 2016

Proactive Sensing for Improving Hand Pose Estimation.
Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016

Video Captioning via Sentence Augmentation and Spatio-Temporal Attention.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

The World Is Changing: Finding Changes on the Street.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

Anticipating Accidents in Dashcam Videos.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Recognition from Hand Cameras.
CoRR, 2015

2014
Model-Based Object Recognition.
Computer Vision, A Reference Guide, 2014

Relating Things and Stuff via ObjectProperty Interactions.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Salient Montages from Unconstrained Videos.
Proceedings of the Computer Vision - ECCV 2014, 2014

Ranking Domain-Specific Highlights by Analyzing Edited Videos.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Object detection, shape recovery, and 3D modelling by depth-encoded hough voting.
Comput. Vis. Image Underst., 2013

Find the Best Path: An Efficient and Accurate Classifier for Image Hierarchies.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Efficient and Exact MAP-MRF Inference using Branch and Bound.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Object Detection using Geometrical Context Feedback.
Int. J. Comput. Vis., 2012

Relating Things and Stuff by High-Order Potential Modeling.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

An efficient branch-and-bound algorithm for optimal human pose estimation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Conditional regression forests for human pose estimation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Mobile object detection through client-server based vote transfer.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Toward coherent object detection and scene layout understanding.
Image Vis. Comput., 2011

Articulated part-based model for joint object detection and pose estimation.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Toward Automatic 3D Generic Object Modeling from One Single Image.
Proceedings of the International Conference on 3D Imaging, 2011

2010
Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery.
Proceedings of the Computer Vision - ECCV 2010, 2010

Object Detection with Geometrical Context Feedback Loop.
Proceedings of the British Machine Vision Conference, 2010

2009
Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

A multi-view probabilistic model for 3D object classes.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Unsupervised Object Pose Classification from Short Video Sequences.
Proceedings of the British Machine Vision Conference, 2009


  Loading...