Yanqi Zhou

According to our database1, Yanqi Zhou authored at least 53 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention.
CoRR, 2024

Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
FLuRKA: Fast fused Low-Rank & Kernel Attention.
CoRR, 2023

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts.
CoRR, 2023

Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Large Graph Property Prediction via Graph Segment Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Brainformers: Trading Simplicity for Efficiency.
Proceedings of the International Conference on Machine Learning, 2023

Lifelong Language Pretraining with Distribution-Specialized Experts.
Proceedings of the International Conference on Machine Learning, 2023

TripLe: Revisiting Pretrained Model Reuse and Progressive Learning for Efficient Vision Transformer Scaling and Searching.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
An Image Compression Encryption Algorithm Based on Chaos and ZUC Stream Cipher.
Entropy, 2022

LaMDA: Language Models for Dialog Applications.
CoRR, 2022

Mixture-of-Experts with Expert Choice Routing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards the Co-design of Neural Networks and Accelerators.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022


Searching for Efficient Neural Architectures for On-Device ML on Edge TPUs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Visual Secure Image Encryption Scheme Based on Compressed Sensing and Regional Energy.
Entropy, 2021

Rethinking Co-design of Neural Architectures and Hardware Accelerators.
CoRR, 2021

Apollo: Transferable Architecture Exploration.
CoRR, 2021

Quantum Particle Swarm Optimization Extraction Algorithm Based on Quantum Chaos Encryption.
Complex., 2021

An Efficient Convolutional Blind Source Separation Algorithm for Speech Signals under Chaotic Masking.
Algorithms, 2021

A Learned Performance Model for Tensor Processing Units.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

An Block Image Encryption Algorithm Based on Reversible Cellular Automata.
Proceedings of the 21st International Conference on Communication Technology, 2021

Do Transformer Modifications Transfer Across Implementations and Applications?
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Flexible Approach to Autotuning Multi-Pass Machine Learning Compilers.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020
A Single-Shot Generalized Device Placement for Large Dataflow Graphs.
IEEE Micro, 2020

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer.
J. Mach. Learn. Res., 2020

Reinforcement-Learning-Empowered MLaaS Scheduling for Serving Intelligent Internet of Things.
IEEE Internet Things J., 2020

A Learned Performance Model for the Tensor Processing Unit.
CoRR, 2020

ODE-CNN: Omnidirectional Depth Extension Networks.
CoRR, 2020

Parallel Encryption of Noisy Images Based on Sequence Generator and Chaotic Measurement Matrix.
Complex., 2020

Transferable Graph Optimizers for ML Compilers.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Omnidirectional Depth Extension Networks.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

2019
GDP: Generalized Device Placement for Dataflow Graphs.
CoRR, 2019

EPNAS: Efficient Progressive Neural Architecture Search.
CoRR, 2019

OpenPiton: an open source hardware platform for your research.
Commun. ACM, 2019

Swift machine learning model serving scheduling: a region based reinforcement learning approach.
Proceedings of the International Conference for High Performance Computing, 2019

EPNAS: Efficient Progressive Neural Architecture Search.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Resource-Efficient Neural Architect.
CoRR, 2018

Neural Voice Cloning with a Few Samples.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Power and Energy Characterization of an Open Source 25-Core Manycore Processor.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017
Piton: A Manycore Processor for Multitenant Clouds.
IEEE Micro, 2017

Deep Learning Scaling is Predictable, Empirically.
CoRR, 2017

Deep Voice 2: Multi-Speaker Neural Text-to-Speech.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Camouflage: Memory Traffic Shaping to Mitigate Timing Attacks.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

Atomic In-place Updates for Non-volatile Main Memories with Kamino-Tx.
Proceedings of the Twelfth European Conference on Computer Systems, 2017

2016
MITTS: Memory Inter-arrival Time Traffic Shaping.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

CASH: Supporting IaaS Customers with a Sub-core Configurable Architecture.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Piton: A 25-core academic manycore research processor.
Proceedings of the 2016 IEEE Hot Chips 28 Symposium (HCS), 2016

OpenPiton: An Open Source Manycore Research Framework.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2014
The sharing architecture: sub-core configurability for IaaS clouds.
Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2014


  Loading...