Ting Cao

Orcid: 0000-0002-9107-013X

According to our database1, Ting Cao authored at least 118 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A BiFPN-SECA detection network for foreign objects on top of railway freight vehicles.
Signal Image Video Process., December, 2024

Adaptive and flexible ℓ <sub>1</sub>-norm graph embedding for unsupervised feature selection.
Appl. Intell., November, 2024

VSPIM: SRAM Processing-in-Memory DNN Acceleration via Vector-Scalar Operations.
IEEE Trans. Computers, October, 2024

Accelerating predictions of electronic transport and superconductivity.
Nat. Comput. Sci., August, 2024

HiMoDepth: Efficient Training-Free High-Resolution On-Device Depth Perception.
IEEE Trans. Mob. Comput., May, 2024

Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management.
CoRR, 2024

Making Every Frame Matter: Continuous Video Understanding for Large Models via Adaptive State Modeling.
CoRR, 2024

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs.
CoRR, 2024

LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration.
CoRR, 2024

Advancing Multi-Modal Sensing Through Expandable Modality Alignment.
CoRR, 2024

T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge.
CoRR, 2024

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance.
CoRR, 2024

ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Poster: Design of Elastic Deep Neural Network Candidate Spaces for Inference on Diverse Devices.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Cross-Domain Activity Recognition Based on Stacked Transfer Network.
Proceedings of the International Joint Conference on Neural Networks, 2024

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

A Fine-Grained Tri-Modal Interaction Model for Multimodal Sentiment Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2024

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hybrid SLM and LLM for Edge-Cloud Collaborative Inference.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

AFPQ: Asymmetric Floating Point Quantization for LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Generated Pseudo-Labels Guided by Background Skeletons for Overcoming Under-Segmentation in Overlapping Particle Objects.
IEEE Trans. Circuits Syst. Video Technol., June, 2023

Pavement Crack Detection Based on 3D Edge Representation and Data Communication With Digital Twins.
IEEE Trans. Intell. Transp. Syst., 2023

Multi-focus image fusion based on gradient tensor HOSVD.
J. Electronic Imaging, 2023

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations.
CoRR, 2023

Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference.
CoRR, 2023

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
CoRR, 2023

Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing.
CoRR, 2023

LUT-NN: Towards Unified Neural Network Inference by Table Lookup.
CoRR, 2023

Boosting DNN Cold Inference on Edge Devices.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Accurate and Structured Pruning for Efficient Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

2022
A Fractional Integral and Fractal Dimension-Based Deep Learning Approach for Pavement Crack Detection in Transportation Service Management.
IEEE Trans. Netw. Serv. Manag., December, 2022

A Novel, Efficient, Green and Real-Time Load Balancing Algorithm for 5G Network Measurement Report Collecting Clusters.
J. Circuits Syst. Comput., December, 2022

A Coke Detection Method Based on Reweighting a Composite Feature for Mixed Material Recognition and Quantification.
IEEE Trans. Instrum. Meas., 2022

A frequency-aware management strategy for virtual machines in DVFS-enabled clouds.
Sustain. Comput. Informatics Syst., 2022

Enhanced Edge Detection for 3D Crack Segmentation and Depth Measurement with Laser Data.
Int. J. Pattern Recognit. Artif. Intell., 2022

Understanding and Optimizing Deep Learning Cold-Start Latency on Edge Devices.
CoRR, 2022

Performance modeling for I/O-intensive applications on virtual machines.
Concurr. Comput. Pract. Exp., 2022

Exploiting Renewable Energy and UPS Systems to Reduce Power Consumption in Data Centers.
Big Data Res., 2022

Non-intrusive load monitoring method with inception structured CNN.
Appl. Intell., 2022

Hyperion: A Generic and Distributed Mobile Offloading Framework on OpenCL.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

Turbo: Opportunistic Enhancement for Edge Video Analytics.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

Energy-efficient Management of Data Centers using a Renewable-aware Scheduler.
Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2022

Accelerating the Energy Efficient Design of Traditional Data Centers Through Modeling<sup>*</sup>.
Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2022

DDF-GAN: A Generative Adversarial Network with Dual-Discriminator for Multi-Focus Image Fusion.
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022

CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

MobiDepth: real-time depth estimation using on-device dual cameras.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

Romou: rapidly generate high-performance tensor kernels for mobile GPUs.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Locating Identities in Time: An Examination of the Formation and Impact of Temporality on Presentations of the Self through Location-Based Social Networks.
ACM Trans. Soc. Comput., 2021

Unified Holistic Memory Management Supporting Multiple Big Data Processing Frameworks over Hybrid Memories.
ACM Trans. Comput. Syst., 2021

nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices.
GetMobile Mob. Comput. Commun., 2021

Depth Image Vibration Filtering and Shadow Detection Based on Fusion and Fractional Differential.
Int. J. Pattern Recognit. Artif. Intell., 2021

The Method of Dance Movement Segmentation and Labanotation Generation Based on Rhythm.
IEEE Access, 2021

Energy-Aware Privacy Controls for Clouds.
Proceedings of the 3rd IEEE International Conference on Trust, 2021

DDoS Detection Systems for Cloud Data Storage.
Proceedings of the 3rd IEEE International Conference on Trust, 2021

Towards Energy-Efficient and Real-Time Cloud Computing.
Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2021

nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.
Proceedings of the MobiSys '21: The 19th Annual International Conference on Mobile Systems, Applications, and Services, Virtual Event, Wisconsin, USA, 24 June, 2021

AsyMo: scalable and efficient deep-learning inference on asymmetric mobile CPUs.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

To Bridge Neural Network Design and Real-World Performance: A Behaviour Study for Neural Networks.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

Insights and Lessons Learned from the Design, Development and Deployment of Pervasive Location-Based Mobile Systems "in the Wild".
Proceedings of the Design, User Experience, and Usability: Design for Contemporary Technological Environments, 2021

Smartphones-Based Non-contact Children's Posture Evaluation.
Proceedings of the Wireless Sensor Networks - 15th China Conference, 2021

Signal Processing Analysis in the Context of Big Data.
Proceedings of the Cyber Security Intelligence and Analytics, 2021

A Condition Prediction Method of Blast Furnace Based on Flame Morphology Information.
Proceedings of the CAA Symposium on Fault Detection, 2021

2020
Performing the Digital Self: Understanding Location-Based Social Networking, Territory, Space, and Identity in the City.
ACM Trans. Comput. Hum. Interact., 2020

Wearable Sensor-Based Human Activity Recognition Using Hybrid Deep Learning Techniques.
Secur. Commun. Networks, 2020

A popularity-aware reconstruction technique in erasure-coded storage systems.
J. Parallel Distributed Comput., 2020

Differential Evolution Optimized a Second-Order Divided Difference Particle Filter.
J. Electr. Comput. Eng., 2020

webTDat: A Web-Based, Real-Time, 3D Visualization Framework for Mesoscopic Whole-Brain Images.
Frontiers Neuroinformatics, 2020

Future Directions of the Cyberinfrastructure for Sustained Scientific Innovation (CSSI) Program.
CoRR, 2020

Dance Emotion Recognition Based on Laban Motion Analysis Using Convolutional Neural Network and Long Short-Term Memory.
IEEE Access, 2020

Electric Vehicle Charging Warning and Path Planning Method Based on Spark.
IEEE Access, 2020

Security-Aware Energy Management in Clouds.
Proceedings of the Second IEEE International Conference on Trust, 2020

Data Security and Malware Detection in Cloud Storage Services.
Proceedings of the Second IEEE International Conference on Trust, 2020

Modeling Energy Consumption of Virtual Machines in DVFS-Enabled Cloud Data Centers.
Proceedings of the 39th IEEE International Performance Computing and Communications Conference, 2020

Profiling and optimizing deep learning inference on mobile GPUs.
Proceedings of the APSys '20: 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 2020

2019
基于àtrous-NSCT变换和区域特性的图像融合方法 (Image Fusion Method Based on àtrous-NSCT Transform and Region Characteristic).
计算机科学, 2019

Crack image detection based on fractional differential and fractal dimension.
IET Comput. Vis., 2019

Symptom-based network classification identifies distinct clinical subgroups of liver diseases with common molecular pathways.
Comput. Methods Programs Biomed., 2019

Device to Device Networks With Cache-Enabled and Self-Sustained Mobile Helpers.
IEEE Access, 2019

Wireless Content Caching in Sliced Cellular Networks with Multicast Beamforming.
Proceedings of the 11th International Conference on Wireless Communications and Signal Processing, 2019

Panthera: holistic memory management for big data processing over hybrid memories.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019

2018
Depth Image Enhancement and Detection on NSCT and Fractional Differential.
Wirel. Pers. Commun., 2018

Taking advantage of motif matrix inference for rotated image indexing and retrieval.
EURASIP J. Adv. Signal Process., 2018

2017
Conquering the City: Understanding perceptions of Mobility and Human Territoriality in Location-based Mobile Games.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2017

Automatic fracture detection based on Terrestrial Laser Scanning data: A new method and case study.
Comput. Geosci., 2017

Unfolding the interplay of self-identity and expressions of territoriality in location-based social networks.
Proceedings of the Adjunct Proceedings of the 2017 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable Computers, 2017

An energy-based weight selection algorithm of monitor node in MANETs.
Proceedings of the International Conference on Computer, 2017

2016
Parallel Processing Systems for Big Data: A Survey.
Proc. IEEE, 2016

Efficient Management for Hybrid Memory in Managed Language Runtime.
Proceedings of the Network and Parallel Computing, 2016

Portable performance on asymmetric multicore processors.
Proceedings of the 2016 International Symposium on Code Generation and Optimization, 2016

2015
Optimizing Image Sharpening Algorithm on GPU.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Road Detection Based on Image Boundary Prior.
Proceedings of the Image and Graphics - 8th International Conference, 2015

2014
iCHAT: Inter-cache Hardware-Assistant Data Transfer for Heterogeneous Chip Multiprocessors.
Proceedings of the 9th IEEE International Conference on Networking, 2014

2013
WADE: Writeback-aware dynamic cache management for NVM-based main memory system.
ACM Trans. Archit. Code Optim., 2013

Traffic Aware Routing in urban vehicular networks.
Proceedings of the 2013 IEEE Wireless Communications and Networking Conference (WCNC), 2013

2012
TL-plane-based multi-core energy-efficient real-time scheduling algorithm for sporadic tasks.
ACM Trans. Archit. Code Optim., 2012

What is Happening to Power, Performance, and Software?
IEEE Micro, 2012

Looking back and looking forward: power, performance, and upheaval.
Commun. ACM, 2012

The Yin and Yang of power and performance for asymmetric hardware and managed software.
Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), 2012

2011
Looking back on the language and hardware revolutions: measured power, performance, and scaling.
Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems, 2011

2010
A Novel Bus Lane Enforcement System with Vehicular Sensor Networks.
Proceedings of the 2010 IEEE Wireless Communications and Networking Conference, 2010

DSS: Applying asynchronous techniques to architectures exploiting ILP at compile time.
Proceedings of the 28th International Conference on Computer Design, 2010


  Loading...