Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection.
CoRR, 2024
Data-induced multiscale losses and efficient multirate gradient descent schemes.
CoRR, 2024
Alignment at Pre-training! Towards Native Alignment for Arabic LLMs.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
AceGPT, Localizing Large Language Models in Arabic.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev Loss.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
MgNO: Efficient Parameterization of Linear Operators via Multigrid.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
An interpretive constrained linear model for ResNet and MgNet.
Neural Networks, May, 2023
FV-MgNet: Fully connected V-cycle MgNet for interpretable time series forecasting.
J. Comput. Sci., May, 2023
Fast Area Optimization Approach for XNOR/OR-based Fixed Polarity Reed-Muller Logic Circuits based on Multi-strategy Wolf Pack Algorithm.
ACM Trans. Design Autom. Electr. Syst., 2023
Expressivity and Approximation Properties of Deep Neural Networks with ReLU<sup>k</sup> Activation.
CoRR, 2023
Deep Neural Networks and Finite Elements of Any Order on Arbitrary Dimensions.
CoRR, 2023
On the Optimal Expressive Power of ReLU DNNs and Its Application in Approximation with Kolmogorov Superposition Theorem.
CoRR, 2023
An Enhanced V-cycle MgNet Model for Operator Learning in Numerical Partial Differential Equations.
CoRR, 2023
Linear Regression on Manifold Structured Data: the Impact of Extrinsic Geometry on Solutions.
Proceedings of the Topological, 2023
Power series expansion neural network.
J. Comput. Sci., 2022
Side-effects of Learning from Low Dimensional Data Embedded in an Euclidean Space.
CoRR, 2022
ReLU deep neural networks from the hierarchical basis perspective.
Comput. Math. Appl., 2022
A weight initialization based on the linear product structure for neural networks.
Appl. Math. Comput., 2022
Approximation Properties of Deep ReLU CNNs.
CoRR, 2021
Make ℓ <sub>1</sub> regularization effective in training sparse CNN.
Comput. Optim. Appl., 2020
Generalized Gaffney inequality and discrete compactness for discrete differential forms.
Numerische Mathematik, 2019
Constrained Linear Data-feature Mapping for Image Classification.
CoRR, 2019
MgNet: A Unified Framework of Multigrid and Convolutional Neural Network.
CoRR, 2019
Modified Regularized Dual Averaging Method for Training Sparse Convolutional Neural Networks.
CoRR, 2018