Mingyu Chen
Orcid: 0000-0003-4469-1037Affiliations:
- Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
According to our database1,
Mingyu Chen
authored at least 135 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
Asynchronous Memory Access Unit: Exploiting Massive Parallelism for Far Memory Access.
ACM Trans. Archit. Code Optim., September, 2024
IEEE Trans. Parallel Distributed Syst., May, 2024
DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects.
CoRR, 2024
Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2024
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024
Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024
2023
CoRR, 2023
A Data-Driven Framework for TCP to Achieve Flexible QoS Control in Mobile Data Networks.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023
Morpheus: An Adaptive DRAM Cache with Online Granularity Adjustment for Disaggregated Memory.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023
REMU: Enabling Cost-Effective Checkpointing and Deterministic Replay in FPGA-based Emulation.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023
Ah-Q: Quantifying and Handling the Interference within a Datacenter from a System Perspective.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023
MARB: Bridge the Semantic Gap between Operating System and Application Memory Access Behavior.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023
Rethinking Design Paradigm of Graph Processing System with a CXL-like Memory Semantic Fabric.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023
2022
High fusion computers: The IoTs, edges, data centers, and humans-in-the-loop as a computer.
CoRR, 2022
QStack: Re-architecting User-space Network Stack to Optimize CPU Efficiency and Service Quality.
CoRR, 2022
Concurr. Comput. Pract. Exp., 2022
GraFF: A Multi-FPGA System with Memory Semantic Fabric for Scalable Graph Processing.
Proceedings of the International Conference on Field-Programmable Technology, 2022
Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022
Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022
Proceedings of the Benchmarking, Measuring, and Optimizing, 2022
2021
Proceedings of the 2021 IEEE International Conference on Engineering, 2021
EdUCAS: An In-house CI/CD Platform with Cloud FPGAs for Agilely Conducting Computer Systems Course Projects.
Proceedings of the ITiCSE '21: Proceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education V.2, Virtual Event, Germany, June 26, 2021
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021
2020
IEEE Trans. Mob. Comput., 2020
IMPULP: A Hardware Approach for In-Process Memory Protection via User-Level Partitioning.
J. Comput. Sci. Technol., 2020
Labeled Network Stack: A High-Concurrency and Low-Tail Latency Cloud Server Framework for Massive IoT Devices.
J. Comput. Sci. Technol., 2020
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020
2019
ACM Trans. Archit. Code Optim., 2019
Proceedings of the 50th ACM Technical Symposium on Computer Science Education, 2019
HCMA: Supporting High Concurrency of Memory Accesses with Scratchpad Memory in FPGAs.
Proceedings of the 2019 IEEE International Conference on Networking, 2019
Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019
ZyCube: An In-House Mini-Cluster for Agilely Developing and Conducting Computer Systems Course Projects.
Proceedings of the ACM Conference on Global Computing Education, 2019
Proceedings of the Benchmarking, Measuring, and Optimizing, 2019
2018
ACM Trans. Embed. Comput. Syst., 2018
CoRR, 2018
ZyForce: An FPGA-based Cloud Platform for Experimental Curriculum of Computer System in University of Chinese Academy of Sciences (Abstract Only).
Proceedings of the 49th ACM Technical Symposium on Computer Science Education, 2018
Labeled Network Stack: A Co-designed Stack for Low Tail-Latency and High Concurrency in Datacenter Services.
Proceedings of the Network and Parallel Computing, 2018
Proceedings of the Advanced Computer Architecture - 12th Conference, 2018
2017
ACM Trans. Archit. Code Optim., 2017
Proceedings of the 14th Annual IEEE International Conference on Sensing, 2017
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017
Efficient Regional Congestion Awareness (ERCA) for Load Balance with Aggregated Congestion Information.
Proceedings of the 25th Euromicro International Conference on Parallel, 2017
Stem: A Table-Based Congestion Control Framework for Virtualized Data Center Networks.
Proceedings of the Network and Parallel Computing, 2017
Proceedings of the 2017 IEEE International Symposium on Performance Analysis of Systems and Software, 2017
Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017
Proceedings of the 2017 IEEE International Conference on Computer Design, 2017
Proceedings of the International Conference on Field Programmable Technology, 2017
2016
Titian2: a scalable system-level emulator with all programmability for datacenter servers in cloud computing.
Proceedings of the 9th International Conference on Utility and Cloud Computing, 2016
Proceedings of the Second International Symposium on Memory Systems, 2016
Twin-Load: Bridging the Gap between Conventional Direct-Attached and Buffer-on-Board Memory Systems.
Proceedings of the Second International Symposium on Memory Systems, 2016
Proceedings of the 24th IEEE/ACM International Symposium on Quality of Service, 2016
Proceedings of the IEEE Symposium on Computers and Communication, 2016
Proceedings of the 34th IEEE International Conference on Computer Design, 2016
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016
sAXI: A High-Efficient Hardware Inter-Node Link in ARM Server for Remote Memory Access.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016
2015
OpenFlow网络数据流路径建立开销的量化分析 (Quantitative Analysis of Flow-setup Cost in OpenFlow Network).
计算机科学, 2015
Detection of soft errors in LU decomposition with partial pivoting using algorithm-based fault tolerance.
Int. J. High Perform. Comput. Appl., 2015
CoRR, 2015
Proceedings of the 12th Annual IEEE International Conference on Sensing, 2015
An Effective Correlation-Aware VM Placement Scheme for SLA Violation Reduction in Data Centers.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015
Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Improving Memory Access Performance of In-Memory Key-Value Store Using Data Prefetching Techniques.
Proceedings of the Advanced Parallel Processing Technologies, 2015
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015
2014
BPM/BPM+: Software-based dynamic memory partitioning mechanisms for mitigating DRAM bank-/channel-level interferences in multicore systems.
ACM Trans. Archit. Code Optim., 2014
HMTT: A hybrid hardware/software tracing system for bridging the DRAM access trace's semantic gap.
ACM Trans. Archit. Code Optim., 2014
J. Comput. Sci. Technol., 2014
A High-Performance and Cost-Efficient Interconnection Network for High-Density Servers.
J. Comput. Sci. Technol., 2014
Proceedings of the Big Data Benchmarks, Performance Optimization, and Emerging Hardware, 2014
Proceedings of the 10th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2014
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014
Proceedings of the International Symposium on Low Power Electronics and Design, 2014
Proceedings of the ACM/IEEE 41st International Symposium on Computer Architecture, 2014
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
Dandelion: A locally-high-performance and globally-high-scalability hierarchical data center network.
Proceedings of the 23rd International Conference on Computer Communication and Networks, 2014
Achieving efficient packet-based memory system by exploiting correlation of memory requests.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014
A Swap-based Cache Set Index Scheme to Leverage both Superpage and Page Coloring Optimizations.
Proceedings of the 51st Annual Design Automation Conference 2014, 2014
Reducing Communication in Parallel Breadth-First Search on Distributed Memory Systems.
Proceedings of the 17th IEEE International Conference on Computational Science and Engineering, 2014
2013
Comput. Sci. Res. Dev., 2013
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2013
Scattered superpage: A case for bridging the gap between superpage and page coloring.
Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013
ParaInsight: An Assistant for Quantitatively Analyzing Multi-granularity Parallel Region.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013
Proceedings of the 16th IEEE International Conference on Computational Science and Engineering, 2013
2012
Compression and Sieve: Reducing Communication in Parallel Breadth First Search on Distributed Memory Systems
CoRR, 2012
Proceedings of the 2012 ACM SIGPLAN workshop on Memory Systems Performance and Correctness: held in conjunction with PLDI '12, 2012
A lightweight hybrid hardware/software approach for object-relative memory profiling.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2012
A Case Study of Designing Efficient Algorithm-based Fault Tolerant Application for Exascale Parallelism.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Proceedings of the 2012 IEEE International Symposium on Workload Characterization, 2012
Proceedings of the International Conference on Supercomputing, 2012
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012
A software memory partition approach for eliminating bank-level interference in multicore systems.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
2011
Inf. Process. Lett., 2011
A New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism
CoRR, 2011
HMTT: A Hybrid Hardware/Software Tracing System for Bridging Memory Trace's Semantic Gap
CoRR, 2011
On the random access performance of Cell Broadband Engine with graph analysis application
CoRR, 2011
Poster: revisiting virtual channel memory for performance and fairness on multi-core architecture.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011
Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system.
Proceedings of the 20th ACM International Symposium on High Performance Distributed Computing, 2011
Proceedings of the 18th International Conference on High Performance Computing, 2011
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011
2010
Proceedings of the 11th ACIS International Conference on Software Engineering, 2010
P-GAS: Parallelizing a Cycle-Accurate Event-Driven Many-Core Processor Simulator Using Parallel Discrete Event Simulation.
Proceedings of the 24th ACM/IEEE/SCS Workshop on Principles of Advanced and Distributed Simulation, 2010
Proceedings of the Fifth International Conference on Networking, Architecture, and Storage, 2010
Proceedings of the 15th IEEE Symposium on Computers and Communications, 2010
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010
DMA cache: Using on-chip storage to architecturally separate I/O data from CPU data for improving I/O performance.
Proceedings of the 16th International Conference on High-Performance Computer Architecture (HPCA-16 2010), 2010
2009
SIGMETRICS Perform. Evaluation Rev., 2009
Proceedings of the 2009 Spring Simulation Multiconference, SpringSim 2009, 2009
A Scalability Analysis of the Symmetric Multiprocessing Architecture in Multi-Core System.
Proceedings of the International Conference on Networking, Architecture, and Storage, 2009
Proceedings of the 23rd international conference on Supercomputing, 2009
2008
Proceedings of the 2008 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2008
Proceedings of The 2008 IEEE International Conference on Networking, 2008
Proceedings of the 9th International Conference for Young Computer Scientists, 2008
Proceedings of the Seventh International Conference on Grid and Cooperative Computing, 2008
2005
Proceedings of the Sixth International Conference on Parallel and Distributed Computing, 2005
Proceedings of the Computational Intelligence and Security, International Conference, 2005
2004
Proceedings of the Parallel and Distributed Processing and Applications, 2004