Yiyang Ma

Orcid: 0000-0001-7210-4018

According to our database1, Yiyang Ma authored at least 14 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Prompt-Based Modality Bridging for Unified Text-to-Face Generation and Manipulation.
ACM Trans. Multim. Comput. Commun. Appl., December, 2024

Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery.
IEEE Trans. Geosci. Remote. Sens., 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding.
CoRR, 2024

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation.
CoRR, 2024

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation.
CoRR, 2024

Consistency Guided Diffusion Model with Neural Syntax for Perceptual Image Compression.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation.
CoRR, 2023

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MetroGAN: Simulating Urban Morphology with Generative Adversarial Network.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Meta-Interpolation: Time-Arbitrary Frame Interpolation via Dual Meta-Learning.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022


  Loading...