Xiang Li

Orcid: 0000-0002-3282-1159

Affiliations:
  • Carnegie Mellon University, School of Computer Science, Department of Electrical and Computer Engineering, Pittsburgh, PA, USA


According to our database1, Xiang Li authored at least 29 papers between 2022 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer.
CoRR, 2024

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation.
CoRR, 2024

On the Diversity of Synthetic Data and its Impact on Training Large Language Models.
CoRR, 2024

ImageFolder: Autoregressive Image Generation with Folded Tokens.
CoRR, 2024

Efficient Autoregressive Audio Modeling via Next-Scale Prediction.
CoRR, 2024

From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking.
CoRR, 2024

ControlVAR: Exploring Controllable Visual Autoregressive Modeling.
CoRR, 2024

Slight Corruption in Pre-training Data Makes Better Diffusion Models.
CoRR, 2024

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition.
CoRR, 2024

Evaluating and Improving Continual Learning in Spoken Language Understanding.
CoRR, 2024

Customizable Perturbation Synthesis for Robust SLAM Benchmarking.
CoRR, 2024

Completing Visual Objects via Bridging Generation and Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A General Framework for Learning from Weak Supervision.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

R<sup>2</sup>-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations.
Proceedings of the Computer Vision - ECCV 2024, 2024

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Video Instance Segmentation by Instance Flow Assembly.
IEEE Trans. Multim., 2023

Completing Visual Objects via Bridging Generation and Segmentation.
CoRR, 2023

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition.
CoRR, 2023

PaintSeg: Training-free Segmentation via Painting.
CoRR, 2023

PaintSeg: Painting Pixels for Training-free Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Voice-Face Correlation: A Geometry View.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Referring Video Object Segmentation with Cyclic Structural Consensus.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Online Video Instance Segmentation via Robust Context Fusion.
CoRR, 2022

R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency.
CoRR, 2022

Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution.
CoRR, 2022

Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022


  Loading...