Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
GPU-Accelerated Wfst Beam Search Decoder for CTC-Based Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
A Data-Centric Approach for Training Deep Neural Networks with Less Data.
CoRR, 2021
Gpu-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Optimizing the efficiency of deep learning through accelerator virtualization.
IBM J. Res. Dev., 2017
Massively-Parallel Lossless Data Decompression.
Proceedings of the 45th International Conference on Parallel Processing, 2016
Mercury: bringing efficiency to key-value stores.
Proceedings of the 6th Annual International Systems and Storage Conference, 2013
WOW: what the world of (data) warehousing can learn from the World of Warcraft.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013
Clydesdale: structured data processing on hadoop.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012
Clydesdale: structured data processing on MapReduce.
Proceedings of the 15th International Conference on Extending Database Technology, 2012
GPU join processing revisited.
Proceedings of the Eighth International Workshop on Data Management on New Hardware, 2012
Designing fast architecture-sensitive tree search on modern multicore/many-core processors.
ACM Trans. Database Syst., 2011
FAST: fast architecture sensitive tree search on modern CPUs and GPUs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010
Programming Video Cards for Database Applications.
login Usenix Mag., 2009
Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs.
Proc. VLDB Endow., 2009
Virtualizing Disk Performance.
Proceedings of the 14th IEEE Real-Time and Embedded Technology and Applications Symposium, 2008
Efficient guaranteed disk request scheduling with fahrrad.
Proceedings of the 2008 EuroSys Conference, Glasgow, Scotland, UK, April 1-4, 2008, 2008
End-to-end performance management for scalable distributed storage.
Proceedings of the 2nd International Petascale Data Storage Workshop (PDSW '07), 2007
Diverse Soft Real-Time Processing in an Integrated System.
Proceedings of the 27th IEEE Real-Time Systems Symposium (RTSS 2006), 2006
Proactive Hot Spot Avoidance for Web Server Dependability.
Proceedings of the 23rd International Symposium on Reliable Distributed Systems (SRDS 2004), 2004