Xiaobing Feng
Orcid: 0000-0003-2909-7750Affiliations:
- Chinese Academy of Sciences, Institute of Computing Technology, State Key Lab of Computer Architecture, Beijing, China
- University of Chinese Academy of Sciences , Beijing, China
According to our database1,
Xiaobing Feng
authored at least 99 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Fast Convolution Meets Low Precision: Exploring Efficient Quantized Winograd Convolution on Modern CPUs.
ACM Trans. Archit. Code Optim., March, 2024
A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
Optimizing Dynamic-Shape Neural Networks on Accelerators via On-the-Fly Micro-Kernel Polymerization.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
J. Comput. Sci. Technol., December, 2023
Dataset, October, 2023
J. Comput. Sci. Technol., September, 2023
Facilitating hardware-aware neural architecture search with learning-based predictive models.
J. Syst. Archit., April, 2023
Portable and Scalable All-Electron Quantum Perturbation Simulations on Exascale Supercomputers.
Proceedings of the International Conference for High Performance Computing, 2023
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
OPTango: Multi-central Representation Learning against Innumerable Compiler Optimization for Binary Diffing.
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
IEEE Trans. Software Eng., 2022
IEEE Trans. Parallel Distributed Syst., 2022
ACM Trans. Archit. Code Optim., 2022
Optimizing deep neural networks on intelligent edge accelerators via flexible-rate filter pruning.
J. Syst. Archit., 2022
2021
Unified Holistic Memory Management Supporting Multiple Big Data Processing Frameworks over Hybrid Memories.
ACM Trans. Comput. Syst., 2021
Int. J. Parallel Program., 2021
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021
Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30, 2021
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021
2020
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020
ACM Trans. Archit. Code Optim., 2020
Proceedings of the 27th IEEE International Conference on Software Analysis, 2020
Proceedings of the Network and Parallel Computing, 2020
Characterizing the I/O Pipeline in the Deployment of CNNs on Commercial Accelerators.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020
Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the Euro-Par 2020: Parallel Processing, 2020
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020
2019
J. Comput. Sci. Technol., 2019
Int. J. Parallel Program., 2019
Proceedings of the 26th IEEE International Conference on Software Analysis, 2019
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 2019
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019
PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion.
Proceedings of the 28th International Conference on Compiler Construction, 2019
Proceedings of the Benchmarking, Measuring, and Optimizing, 2019
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019
2018
NVM Streaker: a fast and reconfigurable performance simulator for non-volatile memory-based memory architecture.
J. Supercomput., 2018
RARE: An Efficient Static Fault Detection Framework for Definition-Use Faults in Large Programs.
IEEE Access, 2018
Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2018
Lazygraph: lazy data coherency for replicas in distributed graph-parallel computation.
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018
Proceedings of the Network and Parallel Computing, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 IEEE International Symposium on Workload Characterization, 2018
Proceedings of the 32nd International Conference on Supercomputing, 2018
Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018
2017
IEEE Trans. Software Eng., 2017
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017
J. Comput. Sci. Technol., 2017
Int. J. Parallel Program., 2017
2016
Predicting Cross-Core Performance Interference on Multicore Processors with Regression Analysis.
IEEE Trans. Parallel Distributed Syst., 2016
J. Comput. Sci. Technol., 2016
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016
Proceedings of the Network and Parallel Computing, 2016
2015
WiseThrottling: a new asynchronous task scheduler for mitigating I/O bottleneck in large-scale datacenter servers.
J. Supercomput., 2015
ACM Trans. Archit. Code Optim., 2015
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Proceedings of the 37th IEEE/ACM International Conference on Software Engineering, 2015
Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015
Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015
2014
Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms.
J. Comput. Sci. Technol., 2014
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014
Proceedings of the ACM/IEEE International Conference on Automated Software Engineering, 2014
A collaborative divide-and-conquer K-means clustering algorithm for processing large data.
Proceedings of the Computing Frontiers Conference, CF'14, 2014
2013
ACM Trans. Archit. Code Optim., 2013
Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013
An empirical model for predicting cross-core performance interference on multicore processors.
Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, 2013
2012
ACM Trans. Archit. Code Optim., 2012
J. Comput. Sci. Technol., 2012
Proceedings of the 13th International Conference on Parallel and Distributed Computing, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
2011
Dependence-based multi-level tracing and replay for wireless sensor networks debugging.
Proceedings of the ACM SIGPLAN/SIGBED 2011 conference on Languages, 2011
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Proceedings of the Seventh International Conference on Natural Computation, 2011
2010
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2010
Level by level: making flow- and context-sensitive pointer analysis scalable for millions of lines of code.
Proceedings of the CGO 2010, 2010
Proceedings of the CGO 2010, 2010
2009
J. Comput. Sci. Technol., 2009
Detecting and Eliminating Potential Violations of Sequential Consistency for Concurrent C/C++ Programs.
Proceedings of the CGO 2009, 2009
2008
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008
Global Tiling for Communication Minimal Parallelization on Distributed Memory Systems.
Proceedings of the Euro-Par 2008, 2008
2006
J. Comput. Res. Dev., 2006
2005
J. Comput. Sci. Technol., 2005
2004
Proceedings of the Languages and Compilers for High Performance Computing, 2004