Shiyu Huang

Orcid: 0000-0003-0500-0141

Affiliations:

Zhipu AI, Beijing, China
4Paradigm Inc., Beijing, China (2022 - 2024)
Tsinghua University, Department of Computer Science and Technology, China (PhD 2022)

According to our database¹, Shiyu Huang authored at least 28 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

DreamPolish: Domain Score Distillation With Progressive Geometry Generation.

[BibT_eX]

[DOI]

CoRR, 2024

CogVLM2: Visual Language Models for Image and Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization.

[BibT_eX]

[DOI]

Wentse Chen

Shiyu Huang

Jeff Schneider

CoRR, 2024

LVBench: An Extreme Long Video Understanding Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment.

[BibT_eX]

[DOI]

CoRR, 2024

AutoSAT: Automatically Optimize SAT Solvers via Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning.

[BibT_eX]

[DOI]

Nat. Mac. Intell., July, 2023

OpenRL: A Unified Reinforcement Learning Framework.

[BibT_eX]

[DOI]

CoRR, 2023

Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Diverse Policies Converge in Reward-free Markov Decision Processe.

[BibT_eX]

[DOI]

Fanqi Lin

Shiyu Huang

Weiwei Tu

CoRR, 2023

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.

[BibT_eX]

[DOI]

Prithviraj Ammanabrolu

Yejin Choi

Xiang Ren

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

Deep reinforcement learning with credit assignment for combinatorial optimization.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Diverse Policies Converge in Reward-Free Markov Decision Processes.

[BibT_eX]

[DOI]

Fanqi Lin

Shiyu Huang

Wei-Wei Tu

Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2022

VMAPD: Generate Diverse Solutions for Multi-Agent Games with Recurrent Trajectory Discriminators.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021

TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2021

Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization.

[BibT_eX]

[DOI]

CoRR, 2021

Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

2020

SVQN: Sequential Variational Soft Q-Learning Networks.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Combo-Action: Training Agent For FPS Game with Auxiliary Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2017

Recognition in-the-Tail: Training Detectors for Unusual Pedestrians with Synthetic Imposters.

[BibT_eX]

[DOI]

Shiyu Huang

Deva Ramanan

CoRR, 2017

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters.

[BibT_eX]

[DOI]

Shiyu Huang

Deva Ramanan

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Shiyu Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...