Jieru Zhao

Orcid: 0000-0001-8211-2812

According to our database1, Jieru Zhao authored at least 44 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FiberFlex: Real-time FPGA-based Intelligent and Distributed Fiber Sensor System for Pedestrian Recognition.
ACM Trans. Reconfigurable Technol. Syst., December, 2024

Automatic Mapping of Heterogeneous DNN Models on Adaptive Multiaccelerator Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2024

HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes.
IEEE Robotics Autom. Lett., November, 2024

FADO: Floorplan-Aware Directive Optimization Based on Synthesis and Analytical Models for High-Level Synthesis Designs on Multi-Die FPGAs.
ACM Trans. Reconfigurable Technol. Syst., September, 2024

Hardware-Software Co-Design Enabling Static and Dynamic Sparse Attention Mechanisms.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., September, 2024

AMF-Placer 2.0: Open-Source Timing-Driven Analytical Mixed-Size Placer for Large-Scale Heterogeneous FPGA.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., September, 2024

SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity.
CoRR, 2024

Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU.
CoRR, 2024

Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture.
CoRR, 2024

AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs.
CoRR, 2024

The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving.
CoRR, 2024

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation.
CoRR, 2024

HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes.
CoRR, 2024

PAS: Towards Accurate and Efficient Federated Learning with Parameter-Adaptive Synchronization.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

A Tale of Two Domains: Exploring Efficient Architecture Design for Truly Autonomous Things.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator Generation.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024

O <sub>2</sub>V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Swift-Mapping: Online Neural Implicit Dense Mapping in Urban Scenes.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HL-Pow: Learning-Assisted Pre-RTL Power Modeling and Optimization for FPGA HLS.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2023

FPGA sharing in the cloud: a comprehensive analysis.
Frontiers Comput. Sci., October, 2023

PAC: Preference-Aware Co-location Scheduling on Heterogeneous NUMA Architectures To Improve Resource Utilization.
Proceedings of the 37th International Conference on Supercomputing, 2023

FlowMap: Path Generation for Automated Vehicles in Open Space Using Traffic Flow.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers.
Proceedings of the 19th Workshop on Hot Topics in Operating Systems, 2023

SSiMD: Supporting Six Signed Multiplications in a DSP Block for Low-Precision CNN on FPGAs.
Proceedings of the International Conference on Field Programmable Technology, 2023

SpMMPlu: A Compiler Plug-in with Sparse IR for Efficient Sparse Matrix Multiplication.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

MARS: Exploiting Multi-Level Parallelism for DNN Workloads on Adaptive Multi-Accelerator Systems.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
CSC: Collaborative System Configuration for I/O-Intensive Applications in Multi-Tenant Clouds.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

PowerGear: Early-Stage Power Estimation in FPGA HLS via Heterogeneous Edge-Centric GNNs.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2021
Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.
Proceedings of the International Conference for High Performance Computing, 2021

Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks.
Proceedings of the 39th IEEE International Conference on Computer Design, 2021

AMF-Placer: High-Performance Analytical Mixed-size Placer for FPGA.
Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

2020
Performance Modeling and Directives Optimization for High-Level Synthesis on FPGA.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications.
Proceedings of the 30th International Conference on Field-Programmable Logic and Applications, 2020

HL-Pow: A Learning-Based Power Modeling Framework for High-Level Synthesis.
Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

2019
Hi-ClockFlow: Multi-Clock Dataflow Automation and Throughput Optimization in High-Level Synthesis.
Proceedings of the International Conference on Computer-Aided Design, 2019

A Hybrid Data-Consistent Framework for Link-Aware AccessManagement in Emerging CPU-FPGA Platforms.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Machine Learning Based Routing Congestion Prediction in FPGA High-Level Synthesis.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

LAMA: Link-Aware Hybrid Management for Memory Accesses in Emerging CPU-FPGA Platforms.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

2018
Hi-DMM: High-Performance Dynamic Memory Management in High-Level Synthesis.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

2017
COMBA: A comprehensive model-based analysis framework for high level synthesis of real applications.
Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017


  Loading...