Yuheng Huang

Orcid: 0000-0003-3666-4020

Affiliations:

University of Alberta, Department of Electrical and Computer Engineering, Canada

According to our database¹, Yuheng Huang authored at least 23 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Look Before You Leap: An Exploratory Study of Uncertainty Analysis for Large Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Software Eng., February, 2025

Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

LUNA: A Model-Based Universal Analysis Framework for Large Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Software Eng., July, 2024

Generation-based Differential Fuzzing for Deep Learning Libraries.

[BibT_eX]

[DOI]

ACM Trans. Softw. Eng. Methodol., February, 2024

Towards Understanding Retrieval Accuracy and Prompt Quality in RAG Systems.

[BibT_eX]

[DOI]

CoRR, 2024

LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Testing and Evaluating Vision-Language-Action Models for Robotic Manipulation: An Empirical Study.

[BibT_eX]

[DOI]

CoRR, 2024

LeCov: Multi-level Testing Criteria for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Active Testing of Large Language Model via Multi-Stage Sampling.

[BibT_eX]

[DOI]

CoRR, 2024

Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture.

[BibT_eX]

[DOI]

CoRR, 2024

Vortex under Ripplet: An Empirical Study of RAG-enabled Applications.

[BibT_eX]

[DOI]

CoRR, 2024

Where Do Large Language Models Fail When Generating Code?

[BibT_eX]

[DOI]

CoRR, 2024

TESTEVAL: Benchmarking Large Language Models for Test Case Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing.

[BibT_eX]

[DOI]

CoRR, 2024

Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward.

[BibT_eX]

[DOI]

CoRR, 2024

GMP-TL: Gender-Augmented Multi-Scale Pseudo-Label Enhanced Transfer Learning For Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement.

[BibT_eX]

[DOI]

Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023

PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing.

[BibT_eX]

[DOI]

Yuheng Huang

Lei Ma

Yuanchun Li

ACM Trans. Softw. Eng. Methodol., November, 2023

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

When Simulator Meets Natural Deviation: A Study on Deviations in Simulation-based ADS Testing.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, ISSRE 2023, 2023

DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction.

[BibT_eX]

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

DeepLens: Interactive Out-of-distribution Data Detection in NLP Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

An Exploratory Study of AI System Risk Assessment from the Lens of Data Distribution and Uncertainty.

[BibT_eX]

[DOI]

CoRR, 2022

Yuheng Huang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...