Shiyu Huang

Orcid: 0000-0003-0500-0141

Affiliations:
  • Zhipu AI, Beijing, China
  • 4Paradigm Inc., Beijing, China (2022 - 2024)
  • Tsinghua University, Department of Computer Science and Technology, China (PhD 2022)


According to our database1, Shiyu Huang authored at least 28 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DreamPolish: Domain Score Distillation With Progressive Geometry Generation.
CoRR, 2024

CogVLM2: Visual Language Models for Image and Video Understanding.
CoRR, 2024

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer.
CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024

Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization.
CoRR, 2024

LVBench: An Extreme Long Video Understanding Benchmark.
CoRR, 2024

MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment.
CoRR, 2024

AutoSAT: Automatically Optimize SAT Solvers via Large Language Models.
CoRR, 2024

LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning.
Nat. Mac. Intell., July, 2023

OpenRL: A Unified Reinforcement Learning Framework.
CoRR, 2023

Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models.
CoRR, 2023

Diverse Policies Converge in Reward-free Markov Decision Processe.
CoRR, 2023

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Deep reinforcement learning with credit assignment for combinatorial optimization.
Pattern Recognit., 2022

Diverse Policies Converge in Reward-Free Markov Decision Processes.
Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2022

VMAPD: Generate Diverse Solutions for Multi-Agent Games with Recurrent Trajectory Discriminators.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations.
CoRR, 2021

Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization.
CoRR, 2021

Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

2020
SVQN: Sequential Variational Soft Q-Learning Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Combo-Action: Training Agent For FPS Game with Auxiliary Tasks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2017
Recognition in-the-Tail: Training Detectors for Unusual Pedestrians with Synthetic Imposters.
CoRR, 2017

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017


  Loading...