Peter Vajda

Orcid: 0000-0002-2031-4678

According to our database1, Peter Vajda authored at least 77 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Observation and Local Prediction of the Vertical Gravity Gradient: Review Paper.
IEEE Instrum. Meas. Mag., September, 2024

An Investigation on Hardware-Aware Vision Transformer Scaling.
ACM Trans. Embed. Comput. Syst., May, 2024

GROWTH-23: An integrated code for inversion of complete Bouguer gravity anomaly or temporal gravity changes.
Comput. Geosci., January, 2024

Movie Gen: A Cast of Media Foundation Models.
CoRR, 2024

Imagine yourself: Tuning-Free Personalized Image Generation.
CoRR, 2024

Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation.
CoRR, 2024

Animated Stickers: Bringing Stickers to Life with Video Diffusion.
CoRR, 2024

AVID: Any-Length Video Inpainting with Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cache Me if You Can: Accelerating Diffusion Models through Block Caching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ControlRoom3D: Room Generation Using Semantic Proxy Rooms.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MixRT: Mixed Neural Representations For Real-Time NeRF Rendering.
Proceedings of the International Conference on 3D Vision, 2024

2023
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack.
CoRR, 2023

Pruning Compact ConvNets for Efficient Inference.
CoRR, 2023

XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Practical Stereo Depth System for Smart Glasses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference.
CoRR, 2022

3D-Aware Encoding for Style-based Neural Radiance Fields.
CoRR, 2022

Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models.
Proceedings of the Computer Vision - ECCV 2022, 2022

Open-Set Semi-Supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Cross-Domain Adaptive Teacher for Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation.
CoRR, 2021

Cross-Domain Object Detection via Adaptive Self-Training.
CoRR, 2021

FBNetV5: Neural Architecture Search for Multiple Tasks in One Run.
CoRR, 2021

Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets.
CoRR, 2021

You Only Group Once: Efficient Point-Cloud Processing with Token Representation and Relation Inference Module.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Unbiased Teacher for Semi-Supervised Object Detection.
Proceedings of the 9th International Conference on Learning Representations, 2021

Visual Transformers: Where Do Transformers Really Belong in Vision Models?
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking the Self-Attention in Vision Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Data-Efficient Language-Supervised Zero-Shot Learning With Self-Distillation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
One shot 3D photography.
ACM Trans. Graph., 2020

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge.
CoRR, 2020

Visual Transformers: Token-based Image Representation and Processing for Computer Vision.
CoRR, 2020

FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function.
CoRR, 2020

Learning the Loss Functions in a Discriminative Space for Video Restoration.
CoRR, 2020

SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Generate Grounded Visual Captions Without Localization Supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deep Space-Time Video Upsampling Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning to Generate Grounded Image Captions without Localization Supervision.
CoRR, 2019

Efficient Segmentation: Learning Downsampling Near Semantic Boundaries.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Machine Learning at Facebook: Understanding Inference at the Edge.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Precision Highway for Ultra Low-Precision Quantization.
CoRR, 2018

Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search.
CoRR, 2018

Value-Aware Quantization for Training and Inference of Neural Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
DSD: Dense-Sparse-Dense Training for Deep Neural Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

2014
Real-time query-by-image video search system.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013
Geotag Propagation with User Trust Modeling.
Proceedings of the Social Media Retrieval, 2013

Comparative Study of Trust Modeling for Automatic Landmark Tagging.
IEEE Trans. Inf. Forensics Secur., 2013

EigenNews: a personalized news video delivery platform.
Proceedings of the ACM Multimedia Conference, 2013

Eigennews: Generating and delivering personalized news video.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Analysis of visual similarity in news videos with robust and memory-efficient image retrieval.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

2012
In Tags We Trust: Trust modeling in social tagging of multimedia content.
IEEE Signal Process. Mag., 2012

Geotag propagation in social networks based on user trust model.
Multim. Tools Appl., 2012

2011
Object Duplicate Detection.
PhD thesis, 2011

Epitomize Your Photos.
Int. J. Comput. Games Technol., 2011

Social game epitome versus automatic visual analysis.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010
Robust Duplicate Detection of 2D and 3D Objects.
Int. J. Multim. Data Eng. Manag., 2010

3D object duplicate detection for video retrieval.
Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services, 2010

Object-based tag propagation for semi-automatic annotation of images.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

2009
Graph-based approach for 3D object duplicate detection.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Analysis of the Limits of Graph-Based Object Duplicate Detection.
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

2008
Parameter Control Methods for Selection Operators in Genetic Algorithms.
Proceedings of the Parallel Problem Solving from Nature, 2008

Towards Fully Automatic Image Segmentation Evaluation.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2008

2007
Hungarian WordNet and representation of verbal event structure.
Acta Cybern., 2007

2006
Morphdb.hu: Hungarian lexical database and morphological grammar.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006


  Loading...