Sergey Tulyakov

Orcid: 0000-0003-3465-1592

According to our database1, Sergey Tulyakov authored at least 160 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Promptable Game Models: Text-guided Game Simulation via Masked Diffusion Models.
ACM Trans. Graph., April, 2024

DELTA: Dense Efficient Long-range 3D Tracking for any video.
CoRR, 2024

Scalable Ranked Preference Optimization for Text-to-Image Generation.
CoRR, 2024

ControlMM: Controllable Masked Motion Generation.
CoRR, 2024

Pixel-Aligned Multi-View Generation with Depth Guided Decoder.
CoRR, 2024

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control.
CoRR, 2024

Efficient Training with Denoised Neural Weights.
CoRR, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.
CoRR, 2024

Lightweight Predictive 3D Gaussian Splats.
CoRR, 2024

Taming Data and Transformers for Audio Generation.
CoRR, 2024

VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing.
CoRR, 2024

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models.
CoRR, 2024

GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement.
CoRR, 2024

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model.
CoRR, 2024

SF-V: Single Forward Video Generation Model.
CoRR, 2024

Visual Concept-driven Image Generation with Text-to-Image Diffusion Model.
CoRR, 2024

SPAD : Spatially Aware Multiview Diffusers.
CoRR, 2024

AToM: Amortized Text-to-Mesh using 2D Diffusion.
CoRR, 2024

E<sup>2</sup>GAN: Efficient Training of Efficient GANs for Image-to-Image Translation.
CoRR, 2024

Diffusion Priors for Dynamic View Synthesis from Monocular Videos.
CoRR, 2024

MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

E2GAN: Efficient Training of Efficient GANs for Image-to-Image Translation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

UpFusion: Novel View Diffusion from Unposed Sparse View Observations.
Proceedings of the Computer Vision - ECCV 2024, 2024

Efficient Training with Denoised Neural Weights.
Proceedings of the Computer Vision - ECCV 2024, 2024

TC4D: Trajectory-Conditioned Text-to-4D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

MyVLM: Personalizing VLMs for User-Specific Queries.
Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Text-guided 3D Scene Composition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hierarchical Patch Diffusion Models for High-Resolution Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TextCraftor: Your Text Encoder can be Image Quality Controller.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SPAD: Spatially Aware Multi-View Diffusers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Evaluating Very Long-Term Conversational Memory of LLM Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Text-Guided Synthesis of Eulerian Cinemagraphs.
ACM Trans. Graph., December, 2023

Virtual Pets: Animatable Animal Generation in 3D Scenes.
CoRR, 2023

SceneWiz3D: Towards Text-guided 3D Scene Composition.
CoRR, 2023

iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis.
CoRR, 2023

Synthesizing Artistic Cinemagraphs from Text.
CoRR, 2023

Plotting Behind the Scenes: Towards Learnable Game Engines.
CoRR, 2023

Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Repurposing Diffusion Inpainters for Novel View Synthesis.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Autodecoding Latent 3D Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LightSpeed: Light and Fast Neural Light Fields on Mobile Devices.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

3D generation on ImageNet.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

InfiniCity: Infinite-Scale City Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Vision Transformers for MobileNet Size and Speed.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text2Tex: Text-driven Texture Synthesis via Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Volumetric Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Make-A-Story: Visual Memory Conditioned Consistent Story Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Invertible Neural Skinning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Real-Time Neural Light Field on Mobile Devices.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Affection: Learning Affective Explanations for Real-World Visual Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
NeROIC: neural rendering of objects from online image collections.
ACM Trans. Graph., 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation.
CoRR, 2022

EfficientFormer: Vision Transformers at MobileNet Speed.
CoRR, 2022

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EpiGRAF: Rethinking training of 3D GANs.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EfficientFormer: Vision Transformers at MobileNet Speed.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

InfinityGAN: Towards Infinite-Pixel Image Synthesis.
Proceedings of the Tenth International Conference on Learning Representations, 2022

F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

LADIS: Language Disentanglement for 3D Shape Editing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Quantized GAN for Complex Music Generation from Dance Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022

R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis.
Proceedings of the Computer Vision - ECCV 2022, 2022

Cross-modal 3D Shape Generation and Manipulation.
Proceedings of the Computer Vision - ECCV 2022, 2022

StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Playable Environments: Video Manipulation in Space and Time.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

InOut: Diverse Image Outpainting via GAN Inversion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
InfinityGAN: Towards Infinite-Resolution Image Synthesis.
CoRR, 2021

In&Out : Diverse Image Outpainting via GAN Inversion.
CoRR, 2021

Task-Assisted Domain Adaptation with Anchor Tasks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Representations for Content Creation, Manipulation and Animation.
Proceedings of the ADGD '21: Proceedings of the 1st Workshop on Synthetic Multimedia, 2021

A Good Image Generator Is What You Need for High-Resolution Video Synthesis.
Proceedings of the 9th International Conference on Learning Representations, 2021

TADPool: Target Adaptive Pooling for Set Based Face Recognition.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

Multistage Fusion of Face Matchers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Motion Representations for Articulated Animation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Flow Guided Transformable Bottleneck Networks for Motion Retargeting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Playable Video Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Teachers Do More Than Teach: Compressing Image-to-Image Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SMIL: Multimodal Learning with Severely Missing Modality.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Interactive video stylization using few-shot patch-based training.
ACM Trans. Graph., 2020

MichiGAN: multi-input-conditioned hair image generation for portrait editing.
ACM Trans. Graph., 2020

Towards Photo-Realistic Facial Expression Manipulation.
Int. J. Comput. Vis., 2020

Motion-supervised Co-Part Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Human Motion Transfer from Poses in the Wild.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Neural Hair Rendering.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Anchor Tasks: Inexpensive, Shared, and Aligned Tasks for Domain Adaptation.
CoRR, 2019

First Order Motion Model for Image Animation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Utilizing Template Diversity for Fusion Of Face Recognizers.
Proceedings of the 5th IEEE International Conference on Identity, 2019

Laplace Landmark Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Transformable Bottleneck Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Real-Time Patch-Based Stylization of Portraits Using Generative Adversarial Network.
Proceedings of the 8th ACM/Eurographics Expressive Symposium, 2019

Animating Arbitrary Objects via Deep Motion Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Significant Feature Based Representation for Template Protection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

3D Guided Fine-Grained Face Manipulation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Recurrent Convolutional Shape Regression.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Viewpoint-Consistent 3D Face Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Metadata-Based Feature Aggregation Network for Face Recognition.
Proceedings of the 2018 International Conference on Biometrics, 2018

Knowledge Transfer Using Neural Network Based Approach for Handwritten Text Recognition.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

MoCoGAN: Decomposing Motion and Content for Video Generation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Hybrid VAE: Improving Deep Generative Models using Partial Observations.
CoRR, 2017

Score normalization in stratified biometric systems.
Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017

2016
The First 3D Face Alignment in the Wild (3DFAW) Challenge.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recurrent Convolutional Face Alignment.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
FaceCept3D: Real Time 3D Face Tracking and Analysis.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Regressing a 3D Face Shape from a Single Image.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Facial expression recognition under a wide range of head poses.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

A multiple server scheme for fingerprint fuzzy vaults.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

2014
Handprinted Character and Word Recognition.
Proceedings of the Handbook of Document Image Processing and Recognition, 2014

Robust Real-Time Extreme Head Pose Estimation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Improved Local Correlation Method for Fingerprint Matching.
Proceedings of the Second International Symposium on Computing and Networking, 2014

Secure Fingerprint Matching with Generic Local Structures.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
A feature information based approach for enhancing score-level fusion in multi-sample biometric systems.
Proceedings of the Fourth National Conference on Computer Vision, 2013

Towards fingerprints as strings: Secure indexing for fingerprint matching.
Proceedings of the International Conference on Biometrics, 2013

Minutiae-Based Matching State Model for Combinations in Fingerprint Matching System.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Face Recognition System for Machine Readable Travel Documents.
Int. J. Comput., 2012

Neural network-based digital map extraction approach.
Proceedings of the 2012 Proceedings of the 35th International Convention, 2012

Etalon-based integrated microchip inspection system.
Proceedings of the 2012 Proceedings of the 35th International Convention, 2012

Facial behavior as a soft biometric.
Proceedings of the 5th IAPR International Conference on Biometrics, 2012

Utilization of matching score vector similarity measures in biometric systems.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011
The intellectual system for face recognition: Algorithms and results.
Proceedings of the MIPRO, 2011

Combination of multiple samples utilizing identification model in biometric systems.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

Combination of user- and enrollee-specific statistical information in verification systems.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

Multiple-sample fusion of matching scores in biometric systems.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
A Framework for Efficient Fingerprint Identification Using a Minutiae Tree.
IEEE Syst. J., 2010

On the Difference between Optimal Combination Functions for Verification and Identification Systems.
Int. J. Pattern Recognit. Artif. Intell., 2010

Combination of Symmetric Hash Functions for Secure Fingerprint Matching.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
Neural Network Optimization for Combinations in Identification Systems.
Proceedings of the Multiple Classifier Systems, 8th International Workshop, 2009

Combining Facial Skin Mark and Eigenfaces for Face Recognition.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

2008
Review of Classifier Combination Methods.
Proceedings of the Machine Learning in Document Analysis and Recognition, 2008

Learning Matching Score Dependencies for Classifier Combination.
Proceedings of the Machine Learning in Document Analysis and Recognition, 2008

Use of Identification Trial Statistics for the Combination of Biometric Matchers.
IEEE Trans. Inf. Forensics Secur., 2008

Integrating minutiae based fingerprint matching with local mutual information.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Comparison of combination methods utilizing T-normalization and second best score model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

2007
Symmetric hash functions for secure fingerprint biometric systems.
Pattern Recognit. Lett., 2007

Optimal Classifier Combination Rules for Verification and Identification Systems.
Proceedings of the Multiple Classifier Systems, 7th International Workshop, 2007

Robust Point-Based Feature Fingerprint Segmentation Algorithm.
Proceedings of the Advances in Biometrics, International Conference, 2007

Real-time Automatic Deceit Detection from Involuntary Facial Expressions.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Facial Expression Biometrics Using Tracker Displacement Features.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Image Quality Measures for Fingerprint Image Enhancement.
Proceedings of the Multimedia Content Representation, 2006

Utilizing Independence of Multimodal Biometric Matchers.
Proceedings of the Multimedia Content Representation, 2006

Classifier Combination Types for Biometric Applications.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

2005
Using Independence Assumption to Improve Multimodal Biometric Fusion.
Proceedings of the Multiple Classifier Systems, 6th International Workshop, 2005

Combining Matching Scores in Identification Model.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

Symmetric Hash Functions for Fingerprint Minutiae.
Proceedings of the Pattern Recognition and Image Analysis, 2005

2003
Postal address block location by contour clustering.
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2003

2001
Probabilistic Model for Segmentation Based Word Recognition with Lexicon.
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR 2001), 2001


  Loading...