Xiang Li

Orcid: 0000-0002-3282-1159

Affiliations:

Carnegie Mellon University, School of Computer Science, Department of Electrical and Computer Engineering, Pittsburgh, PA, USA

According to our database¹, Xiang Li authored at least 31 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video.

[BibT_eX]

[DOI]

Matthew Johnson-Roberson

Sebastian Scherer

Xiaonan Huang

CoRR, January, 2025

2024

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer.

[BibT_eX]

[DOI]

CoRR, 2024

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation.

[BibT_eX]

[DOI]

CoRR, 2024

On the Diversity of Synthetic Data and its Impact on Training Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

ImageFolder: Autoregressive Image Generation with Folded Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Autoregressive Audio Modeling via Next-Scale Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking.

[BibT_eX]

[DOI]

Matthew Johnson-Roberson

Xiaonan Huang

CoRR, 2024

ControlVAR: Exploring Controllable Visual Autoregressive Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition.

[BibT_eX]

[DOI]

CoRR, 2024

Evaluating and Improving Continual Learning in Spoken Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Customizable Perturbation Synthesis for Robust SLAM Benchmarking.

[BibT_eX]

[DOI]

Matthew Johnson-Roberson

Xiaonan Huang

CoRR, 2024

Slight Corruption in Pre-training Data Makes Better Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Completing Visual Objects via Bridging Generation and Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

A General Framework for Learning from Weak Supervision.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

R<sup>2</sup>-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Video Instance Segmentation by Instance Flow Assembly.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Completing Visual Objects via Bridging Generation and Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition.

[BibT_eX]

[DOI]

CoRR, 2023

PaintSeg: Training-free Segmentation via Painting.

[BibT_eX]

[DOI]

CoRR, 2023

PaintSeg: Painting Pixels for Training-free Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Voice-Face Correlation: A Geometry View.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Referring Video Object Segmentation with Cyclic Structural Consensus.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Online Video Instance Segmentation via Robust Context Fusion.

[BibT_eX]

[DOI]

CoRR, 2022

R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency.

[BibT_eX]

[DOI]

CoRR, 2022

Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution.

[BibT_eX]

[DOI]

CoRR, 2022

Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Xiang Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...