Siteng Huang

Orcid: 0000-0002-9735-1186

According to our database¹, Siteng Huang authored at least 27 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration.

[BibT_eX]

[DOI]

CoRR, 2024

Accelerating Diffusion Transformers with Token-wise Feature Caching.

[BibT_eX]

[DOI]

CoRR, 2024

Focus-Consistent Multi-Level Aggregation for Compositional Zero-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2024

M<sup>2</sup>IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension.

[BibT_eX]

[DOI]

CoRR, 2024

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference.

[BibT_eX]

[DOI]

CoRR, 2024

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference.

[BibT_eX]

[DOI]

CoRR, 2024

ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DARA: Domain- and Relation-Aware Adapters Make Parameter-Efficient Tuning for Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

PiTe: Pixel-Temporal Alignment for Large Video-Language Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Prompt-Based Distribution Alignment for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Reference-Limited Compositional Zero-Shot Learning.

[BibT_eX]

[DOI]

Siteng Huang

Qiyao Wei

Donglin Wang

Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Domain Generalized Few-Shot Image Classification via Meta Regularization Network.

[BibT_eX]

[DOI]

Min Zhang

Siteng Huang

Donglin Wang

Proceedings of the IEEE International Conference on Acoustics, 2022

Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network.

[BibT_eX]

[DOI]

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Pareto Self-Supervised Training for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2019

DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Siteng Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...