Se Jung Kwon
Orcid: 0000-0003-3456-9038
According to our database1,
Se Jung Kwon
authored at least 32 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices.
CoRR, 2024
To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability.
CoRR, 2024
No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization.
CoRR, 2024
LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Sparsity-Aware Memory Interface Architecture using Stacked XORNet Compression for Accelerating Pruned-DNN Models.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization.
Proceedings of the International Conference on Machine Learning, 2023
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models.
Proceedings of the International Conference on Machine Learning, 2023
Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
TF-MVP: Novel Sparsity-Aware Transformer Accelerator with Mixed-Length Vector Pruning.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023
2022
nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
CoRR, 2021
Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity.
CoRR, 2021
2020
Adaptive Discrete Event Simulation Systems to Embrace Changes of Requirements Using Event Control Models.
IEEE Trans. Syst. Man Cybern. Syst., 2020
BiQGEMM: matrix multiplication with lookup table for binary-coding-based quantized DNNs.
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Structured Compression by Weight Encryption for Unstructured Pruning and Quantization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
CoRR, 2019
2018
Simulation-Based Optimization on the System-of-Systems Model via Model Transformation and Genetic Algorithm: A Case Study of Network-Centric Warfare.
Complex., 2018
2013
Integrated hybrid systems modeling and simulation methodology based on HDEVS formalism.
Proceedings of the 2013 Summer Simulation Multiconference, 2013
2012
Design and implementation of event-based DEVS execution environment for faster execution of iterative simulation.
Proceedings of the 2012 Spring Simulation Multiconference, 2012