Jianyu Huang

Orcid: 0000-0001-7595-5539

According to our database1, Jianyu Huang authored at least 29 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Context Parallelism for Scalable Million-Token Inference.
CoRR, 2024

The Llama 3 Herd of Models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
et al.
CoRR, 2024

PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability Against Parameter Corruptions.
CoRR, 2024

An Adaptive Path Planning Method for Curved Roads via Three-Dimensional Space-Aware Profiling Maps.
IEEE Access, 2024

Faster Trajectory Planning for Lane Change Scenarios with Dynamic Environment.
Proceedings of the 8th International Conference on Robotics, Control and Automation, 2024

2023
Trajectory Planning in Frenet Frame via Multi-Objective Optimization.
IEEE Access, 2023

AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

2022

A denoising method based on CNN through multi-layer separation.
Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering, 2022

Efficient Soft-Error Detection for Low-precision Deep Learning Recommendation Models.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale.
IEEE Micro, 2021

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models.
CoRR, 2021

FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference.
CoRR, 2021

2020
Strassen's Algorithm Reloaded on GPUs.
ACM Trans. Math. Softw., 2020

Mixed-Precision Embedding Using a Cache.
CoRR, 2020

Multi UAV Cluster Control Method Based on Virtual Core in Improved Artificial Potential Field.
IEEE Access, 2020

2019
Deep Learning Recommendation Model for Personalization and Recommendation Systems.
CoRR, 2019

A Study of BFLOAT16 for Deep Learning Training.
CoRR, 2019

2018
Strassen's Algorithm for Tensor Contraction.
SIAM J. Sci. Comput., 2018

Rapid 3D Reconstruction for Image Sequence Acquired from UAV Camera.
Sensors, 2018

Implementing Strassen's Algorithm with CUTLASS on NVIDIA Volta GPUs.
CoRR, 2018

Learning from Optimizing Matrix-Matrix Multiplication.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

2017
Generating Families of Practical Fast Matrix Multiplication Algorithms.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Implementing Strassen's Algorithm with BLIS.
CoRR, 2016

BLISlab: A Sandbox for Optimizing GEMM.
CoRR, 2016

Strassen's algorithm reloaded.
Proceedings of the International Conference for High Performance Computing, 2016

2015
Performance optimization for the k-nearest neighbors kernel on x86 architectures.
Proceedings of the International Conference for High Performance Computing, 2015

2013
High Performance Super-Resolution Reconstruction of Multiple Images Based on Fast Registration and Edge Enhancement.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

2011
A Space Target Recognition Method Based on Compressive Sensing.
Proceedings of the Sixth International Conference on Image and Graphics, 2011


  Loading...