Oncel Tuzel

According to our database1, Oncel Tuzel authored at least 105 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement.
Trans. Mach. Learn. Res., 2024

Synth4Seg - Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization.
CoRR, 2024

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models.
CoRR, 2024

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions.
CoRR, 2024

Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum.
CoRR, 2024

CLIP with Quality Captions: A Strong Pretraining for Vision Tasks.
CoRR, 2024

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data.
CoRR, 2024

Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

TiC-CLIP: Continual Training of CLIP Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Corpus Synthesis for Zero-Shot ASR Domain Adaptation Using Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

MUSCLE: A Model Update Strategy for Compatible LLM Evolution.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HUGS: Human Gaussian Splats.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Weight subcloning: direct initialization of transformers using larger pretrained ones.
CoRR, 2023

Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models.
CoRR, 2023

Novel-View Acoustic Synthesis from 3D Reconstructed Rooms.
CoRR, 2023

VISION Datasets: A Benchmark for Vision-based InduStrial InspectiON.
CoRR, 2023

Token Pooling in Vision Transformers for Image Classification.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

FastFill: Efficient Compatible Model Update.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text is all You Need: Personalizing ASR Models Using Controllable Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

I See What You Hear: A Vision-Inspired Method to Localize Words.
Proceedings of the IEEE International Conference on Acoustics, 2023

MobileOne: An Improved One millisecond Mobile Backbone.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FaceLit: Neural 3D Relightable Faces.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Pointersect: Neural Rendering with Cloud-Ray Intersection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
RangeAugment: Efficient Online Augmentation with Range Learning.
CoRR, 2022

I see what you hear: a vision-inspired method to localize words.
CoRR, 2022

APE: Aligning Pretrained Encoders to Quickly Learn Aligned Multimodal Representations.
CoRR, 2022

An Improved One millisecond Mobile Backbone.
CoRR, 2022

Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models.
Proceedings of the International Conference on Machine Learning, 2022

SYNT++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Data Incubation - Synthesizing Missing Data for Handwriting Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

NeuMan: Neural Human Radiance Field from a Single Video.
Proceedings of the Computer Vision - ECCV 2022, 2022

Forward Compatible Training for Large-Scale Embedding Retrieval Systems.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Forward Compatible Training for Representation Learning.
CoRR, 2021

Token Pooling in Vision Transformers.
CoRR, 2021

Instance-Level Task Parameters: A Robust Multi-task Weighting Framework.
CoRR, 2021

Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric.
Proceedings of the IEEE International Conference on Acoustics, 2021

SapAugment: Learning A Sample Adaptive Policy for Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Implicit vs. Explicit Style Transfer? A Comparison of GAN Architectures for Continuous Path Keyboard Input Modeling.
Proceedings of the 29th European Signal Processing Conference, 2021

Extracurricular Learning: Knowledge Transfer Beyond Empirical Distribution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Subject-Aware Contrastive Learning for Biosignals.
CoRR, 2020

Extracurricular Learning: Knowledge Transfer Beyond Empirical Distribution.
CoRR, 2020

Least squares binary quantization of neural networks.
CoRR, 2020

Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Least squares binary quantization of neural networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Data Parameters: A New Family of Parameters for Learning a Differentiable Curriculum.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

MVX-Net: Multimodal VoxelNet for 3D Object Detection.
Proceedings of the International Conference on Robotics and Automation, 2019

Learning Conditional Error Model for Simulated Time-Series Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Nonlinear Conjugate Gradients For Scaling Synchronous Distributed DNN Training.
CoRR, 2018

Divide, Denoise, and Defend against Adversarial Attacks.
CoRR, 2018

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Attentional Network for Visual Object Detection.
CoRR, 2017

Learning from Simulated and Unsupervised Images through Adversarial Training.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Global-Local Face Upsampling Network.
CoRR, 2016

Unsupervised network pretraining via encoding human design.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Coupled Generative Adversarial Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

High-accuracy user identification using EEG biometrics.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

Robust Face Alignment Using a Mixture of Invariant Experts.
Proceedings of the Computer Vision - ECCV 2016, 2016

Gaussian Conditional Random Field Network for Semantic Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Gaussian Conditional Random Field Network: A Model-Based Deep Network for Discriminative Denoising.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Multi-stream Bi-directional Recurrent Neural Network for Fine-Grained Action Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

R-CNN for Small Object Detection.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Unsupervised Deep Network Pretraining via Human Design.
CoRR, 2015

Efficient Upsampling of Natural Images.
CoRR, 2015

Layered Interpretation of Street View Images.
Proceedings of the Robotics: Science and Systems XI, Sapienza University of Rome, 2015

Deep hierarchical parsing for semantic segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Semi-Supervised Kernel Mean Shift Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Entropy-Rate Clustering: Cluster Analysis via Maximizing a Submodular Function Subject to a Matroid Constraint.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Detecting 3D geometric boundaries of indoor scenes under varying lighting.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Recursive Context Propagation Network for Semantic Scene Labeling.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Learning to Rank 3D Features.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Joint Geodesic Upsampling of Depth Images.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Fast object localization and pose estimation in heavy clutter for robotic bin picking.
Int. J. Robotics Res., 2012

Voting-based pose estimation for robotic assembly using a 3D sensor.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Variable focus video: Reconstructing depth and video for dynamic scenes.
Proceedings of the 2012 IEEE International Conference on Computational Photography, 2012

Motion-Aware Structured Light Using Spatio-Temporal Decodable Patterns.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Compressed Inference for Probabilistic Sequential Models.
Proceedings of the UAI 2011, 2011

Finding a needle in a specular haystack.
Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Entropy rate superpixel segmentation.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Pose estimation in heavy clutter using a multi-flash camera.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Image Invariants for Smooth Reflective Surfaces.
Proceedings of the Computer Vision, 2010

<i>P</i>2Pi: A Minimal Solution for Registration of 3D Points to 3D Planes.
Proceedings of the Computer Vision - ECCV 2010, 2010

Specular surface reconstruction from sparse reflection correspondences.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Fast directional chamfer matching.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
PathMiner: A Web-Based Tool for Computer-Assisted Diagnostics in Pathology.
IEEE Trans. Inf. Technol. Biomed., 2009

A caGrid-Enabled, Learning Based Image Segmentation Method for Histopathology Specimens.
Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Boston, MA, USA, June 28, 2009

Kernel methods for weakly supervised mean shift clustering.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2008
Pedestrian Detection via Classification on Riemannian Manifolds.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Automatic Image Analysis of Histopathology Specimens Using Concave Vertex Graph.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2008

Learning on lie groups for invariant detection and tracking.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Classification of hematologic malignancies using texton signatures.
Pattern Anal. Appl., 2007

Human Detection via Classification on Riemannian Manifolds.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Fast Construction of Covariance Matrices for Arbitrary Size Image Windows.
Proceedings of the International Conference on Image Processing, 2006

Region Covariance: A Fast Descriptor for Detection and Classification.
Proceedings of the Computer Vision, 2006

Covariance Tracking using Model Update Based on Lie Algebra.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Bayesian background modeling for foreground detection.
Proceedings of the Third ACM International Workshop on Video Surveillance & Sensor Networks, 2005

Multi-Kernel Object Tracking.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Simultaneous Multiple 3D Motion Estimation via Mode Finding on Lie Groups.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Object tracking in low-frame-rate video.
Proceedings of the Electronic Imaging: Image and Video Communications and Processing 2005, 2005

A Bayesian Approach to Background Modeling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2005


  Loading...