2025
A Controllable Examination for Long-Context Language Models.
CoRR, June, 2025
Leveraging Large Reconfigurable Intelligent Surfaces as Anchors for Near-Field Positioning.
CoRR, May, 2025
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
Purification of Contaminated Convolutional Neural Networks via Robust Recovery: An Approach With Theoretical Guarantee in One-Hidden-Layer Case.
IEEE J. Sel. Top. Signal Process., April, 2025
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models.
CoRR, January, 2025
Spike-driven incepformer: A hierarchical spiking transformer with inception-inspired feature learning.
Neurocomputing, 2025
Multi-class Agent Trajectory Prediction with Selective State Spaces for autonomous driving.
Eng. Appl. Artif. Intell., 2025
A Closer Look into Mixture-of-Experts in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Layerwise Recurrent Router for Mixture-of-Experts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Post-hoc Reward Calibration: A Case Study on Length Bias.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
InsightBridge: Enhancing Empathizing with Users through Real-Time Information Synthesis and Visual Communication.
Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 2025
2024
Spatial and Surface Correspondence Field for Interaction Transfer.
ACM Trans. Graph., July, 2024
A wideband filtering power divider with improved out-of-band suppression.
IEICE Electron. Express, 2024
DINA: Deformable INteraction Analogy.
Graph. Model., 2024
PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models.
CoRR, 2024
DiscipLink: Unfolding Interdisciplinary Information Seeking Process via Human-AI Co-Exploration.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024
SelfGauge: An Intelligent Tool to Support Student Self-assessment in GenAI-enhanced Project-based Learning.
Proceedings of the Adjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024
A Hierarchical Heterogeneous IoT Time Series Data Index for NVM.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Unlocking Emergent Modularity in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Removing Stray-Light for Wild-Field Fundus Image Fusion Based on Large Generative Models.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024
A Dual-Stream Network for Langerhans' Cells Segmentation in CCM Images.
Proceedings of the Ophthalmic Medical Image Analysis - 11th International Workshop, 2024
DNZ-LSM-Tree for Hybrid Storage Systems.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2024
Be There, Be Together, Be Streamed! AR Scenic Live-Streaming for an Interactive and Collective Experience.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct, 2024
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Unlocking Continual Learning Abilities in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
FRI-Net: Floorplan Reconstruction via Room-Wise Implicit Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Sharing Frissons among Online Video Viewers: Exploring the Design of Affective Communication for Aesthetic Chills.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024
Proceedings of the 35th IEEE International Conference on Application-specific Systems, 2024
2023
AlGaN/GaN HEMTs with a magnetron-sputtered AlN buffer layer.
Microelectron. J., October, 2023
Token Relation Aware Chinese Named Entity Recognition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2023
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
CoRR, 2023
Enhancing Healthcare Model Trustworthiness Through Theoretically Guaranteed One-Hidden-Layer CNN Purification.
Proceedings of the Trustworthy Machine Learning for Healthcare, 2023
Single Frequency vs. Broadband: Factors to Consider in Wideband RIS Configuration.
Proceedings of the 19th International Symposium on Wireless Communication Systems, 2023
Subgraph-Oriented Heterogeneous Drug-Target Interaction Identification.
Proceedings of the International Joint Conference on Neural Networks, 2023
Solving Math Word Problems Following Logically Consistent Template.
Proceedings of the International Joint Conference on Neural Networks, 2023
NIFT: Neural Interaction Field and Template for Object Manipulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Transformer-Patcher: One Mistake Worth One Neuron.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
A Novel Cache and Consistency Mechanism for IoT Time Series Data.
Proceedings of the IEEE International Conference on High Performance Computing & Communications, 2023
Misfocus-Reduction in RIS-Assisted Ultra-Wideband Wireless Communication.
Proceedings of the 2023 Joint European Conference on Networks and Communications & 6G Summit, 2023
ARO-Net: Learning Implicit Fields from Anchored Radial Observations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
CoArgue : Fostering Lurkers' Contribution to Collective Arguments in Community-based QA Platforms.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
2022
PlanHelper: Supporting Activity Plan Construction with Answer Posts in Community-based QA Platforms.
Proc. ACM Hum. Comput. Interact., 2022
ARO-Net: Learning Neural Fields from Anchored Radial Observations.
CoRR, 2022
An efficient and low power deep learning framework for image recognition on mobile devices.
CCF Trans. Pervasive Comput. Interact., 2022
Identification of RIS-Assisted Paths for Wireless Integrated Sensing and Communication.
Proceedings of the 2022 IEEE 33rd Annual International Symposium on Personal, 2022
Mixture of Attention Heads: Selecting Attention Heads Per Token.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Geometry-Based MPC Tracking and Modeling Algorithm for Time-Varying UAV Channels.
IEEE Trans. Wirel. Commun., 2021
Empirical Dynamic Modeling for Low-Altitude UAV Propagation Channels.
IEEE Trans. Wirel. Commun., 2021
TAERT: Triple-Attentional Explainable Recommendation with Temporal Convolutional Network.
Inf. Sci., 2021
MmWave Fronthaul-to-Backhaul Interference in 5G NR Networks.
Proceedings of the 32nd IEEE Annual International Symposium on Personal, 2021
MetaMap: Supporting Visual Metaphor Ideation through Multi-dimensional Example-based Exploration.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021
2020
Graph2Plan: learning floorplan generation from layout graphs.
ACM Trans. Graph., 2020
Adaptive Weighted Finite Mixture Model: Identifying the Feature-Influence of Real Estate.
Trans. Data Sci., 2020
Measurement-based Characterization of 73GHz Propagation Channels in Scatterer-rich Environments.
Proceedings of the 2020 IEEE Wireless Communications and Networking Conference, 2020
Transductive Multi-Object Tracking in Complex Events by Interactive Self-Training.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Entity Extraction with Knowledge from Web Scale Corpora.
Proceedings of the Databases Theory and Applications, 2020
2019
Designing efficient accelerator of depthwise separable convolutional neural network on FPGA.
J. Syst. Archit., 2019
Finding the Key Influences on the House Price by Finite Mixture Model Based on the Real Estate Data in Changchun.
Proceedings of the Database Systems for Advanced Applications, 2019