Yuheng Huang

Orcid: 0000-0003-3666-4020

Affiliations:
  • University of Alberta, Department of Electrical and Computer Engineering, Canada


According to our database1, Yuheng Huang authored at least 18 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LUNA: A Model-Based Universal Analysis Framework for Large Language Models.
IEEE Trans. Software Eng., July, 2024

Generation-based Differential Fuzzing for Deep Learning Libraries.
ACM Trans. Softw. Eng. Methodol., February, 2024

Towards Testing and Evaluating Vision-Language-Action Models for Robotic Manipulation: An Empirical Study.
CoRR, 2024

LeCov: Multi-level Testing Criteria for Large Language Models.
CoRR, 2024

Active Testing of Large Language Model via Multi-Stage Sampling.
CoRR, 2024

Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture.
CoRR, 2024

Vortex under Ripplet: An Empirical Study of RAG-enabled Applications.
CoRR, 2024

Where Do Large Language Models Fail When Generating Code?
CoRR, 2024

TESTEVAL: Benchmarking Large Language Models for Test Case Generation.
CoRR, 2024

Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing.
CoRR, 2024

Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward.
CoRR, 2024

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023
PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing.
ACM Trans. Softw. Eng. Methodol., November, 2023

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models.
CoRR, 2023

When Simulator Meets Natural Deviation: A Study on Deviations in Simulation-based ADS Testing.
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, ISSRE 2023, 2023

DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

DeepLens: Interactive Out-of-distribution Data Detection in NLP Models.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
An Exploratory Study of AI System Risk Assessment from the Lens of Data Distribution and Uncertainty.
CoRR, 2022


  Loading...