Nhat Ho

According to our database1, Nhat Ho authored at least 131 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Global Optimality of the EM Algorithm for Mixtures of Two-Component Linear Regressions.
IEEE Trans. Inf. Theory, September, 2024

Statistical and Computational Complexities of BFGS Quasi-Newton Method for Generalized Linear Models.
Trans. Mach. Learn. Res., 2024

A Diffusion Process Perspective on Posterior Contraction Rates for Parameters.
SIAM J. Math. Data Sci., 2024

On the Computational and Statistical Complexity of Over-parameterized Matrix Sensing.
J. Mach. Learn. Res., 2024

Leveraging Hierarchical Taxonomies in Prompt-based Continual Learning.
CoRR, 2024

Improving Generalization with Flat Hilbert Bayesian Inference.
CoRR, 2024

LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model.
CoRR, 2024

Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts.
CoRR, 2024

Backdoor Attack in Prompt-Based Continual Learning.
CoRR, 2024

A Primal-Dual Framework for Transformers and Neural Networks.
CoRR, 2024

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts.
CoRR, 2024

Mixture of Experts Meets Prompt-Based Continual Learning.
CoRR, 2024

Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts.
CoRR, 2024

Borrowing Strength in Distributionally Robust Optimization via Hierarchical Dirichlet Processes.
CoRR, 2024

Marginal Fairness Sliced Wasserstein Barycenter.
CoRR, 2024

Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions.
CoRR, 2024

FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion.
CoRR, 2024

On Least Squares Estimation in Softmax Gating Mixture of Experts.
CoRR, 2024

CompeteSMoE - Effective Training of Sparse Mixture of Experts via Competition.
CoRR, 2024

Bayesian Nonparametrics Meets Data-Driven Robust Optimization.
CoRR, 2024

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model.
CoRR, 2024

Sliced Wasserstein with Random-Path Projecting Directions.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On Least Square Estimation in Softmax Gating Mixture of Experts.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Computational Complexity in Statistical Models with Local Curvature Information.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Features Model.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sliced Wasserstein Estimation with Control Variates.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Quasi-Monte Carlo for 3D Sliced Wasserstein.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Beyond Vanilla Variational Autoencoders: Detecting Posterior Collapse in Conditional and Hierarchical Variational Autoencoders.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fast Approximation of the Generalized Sliced-Wasserstein Distance.
Proceedings of the IEEE International Conference on Acoustics, 2024

Integrating Efficient Optimal Transport and Functional Maps for Unsupervised Shape Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

On Parameter Estimation in Deviated Gaussian Mixture of Experts.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation.
CoRR, 2023

Privacy Preserving Bayesian Federated Learning in Heterogeneous Settings.
CoRR, 2023

Posterior Collapse in Linear Conditional and Hierarchical Variational Autoencoders.
CoRR, 2023

Diffeomorphic Deformation via Sliced Wasserstein Distance Optimization for Cortical Surface Reconstruction.
CoRR, 2023

Demystifying Softmax Gating in Gaussian Mixture of Experts.
CoRR, 2023

Control Variate Sliced Wasserstein Estimators.
CoRR, 2023

Markovian Sliced Wasserstein Distances: Beyond Independent Projections.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Demystifying Softmax Gating Function in Gaussian Mixture of Experts.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Energy-Based Sliced Wasserstein Distance.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Designing Robust Transformers using Robust Kernel Density Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Minimax Optimal Rate for Parameter Estimation in Multivariate Deviated Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction.
Proceedings of the International Conference on Machine Learning, 2023

Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature.
Proceedings of the International Conference on Machine Learning, 2023

On Excess Mass Behavior in Gaussian Mixture Models with Orlicz-Wasserstein Distances.
Proceedings of the International Conference on Machine Learning, 2023

Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data.
Proceedings of the International Conference on Machine Learning, 2023

Hierarchical Sliced Wasserstein Distance.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Primal-Dual Framework for Transformers and Neural Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

On Cross-Layer Alignment for Model Fusion of Heterogeneous Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Probabilistic Framework for Pruning Transformers Via a Finite Admixture of Keys.
Proceedings of the IEEE International Conference on Acoustics, 2023

Global-Local Regularization Via Distributional Robustness.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Joint Self-Supervised Image-Volume Representation Learning with Intra-inter Contrastive Clustering.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
On the Efficiency of Entropic Regularized Algorithms for Optimal Transport.
J. Mach. Learn. Res., 2022

On the Complexity of Approximating Multimarginal Optimal Transport.
J. Mach. Learn. Res., 2022

Convergence Rates for Gaussian Mixtures of Experts.
J. Mach. Learn. Res., 2022

Revisiting Over-smoothing and Over-squashing using Ollivier's Ricci Curvature.
CoRR, 2022

Improving Multi-task Learning via Seeking Task-based Flat Regions.
CoRR, 2022

Robustify Transformers with Robust Kernel Density Estimation.
CoRR, 2022

Improving Generative Flow Networks with Path Regularization.
CoRR, 2022

Hierarchical Sliced Wasserstein Distance.
CoRR, 2022

Transformer with Fourier Integral Attentions.
CoRR, 2022

Efficient Forecasting of Large Scale Hierarchical Time Series via Multilevel Clustering.
CoRR, 2022

Federated Self-supervised Learning for Heterogeneous Clients.
CoRR, 2022

Beyond EM Algorithm on Over-specified Two-Component Location-Scale Gaussian Mixtures.
CoRR, 2022

An Exponentially Increasing Step-size for Parameter Estimation in Statistical Models.
CoRR, 2022

Global-Local Regularization Via Distributional Robustness.
CoRR, 2022

Improving Computational Complexity in Statistical Models with Second-Order Information.
CoRR, 2022

Stochastic Multiple Target Sampling Gradient Descent.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Transformer with an Admixture of Attention Heads.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Amortized Projection Optimization for Sliced Wasserstein Generative Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Revisiting Sliced Wasserstein on Images: From Vectorization to Convolution.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FourierFormer: Transformer Meets Generalized Fourier Integral Theorem.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Beyond black box densities: Parameter learning for the deviated components.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Mini-batch Optimal Transport via Partial Transportation.
Proceedings of the International Conference on Machine Learning, 2022

On Transportation of Mini-batches: A Hierarchical Approach.
Proceedings of the International Conference on Machine Learning, 2022

Improving Transformers with Probabilistic Attention Keys.
Proceedings of the International Conference on Machine Learning, 2022

Refined Convergence Rates for Maximum Likelihood Estimation under Finite Mixture Models.
Proceedings of the International Conference on Machine Learning, 2022

Architecture Agnostic Federated Learning for Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

Entropic Gromov-Wasserstein between Gaussian Distributions.
Proceedings of the International Conference on Machine Learning, 2022

Towards Statistical and Computational Complexities of Polyak Step Size Gradient Descent.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

On Multimarginal Partial Optimal Transport: Equivalent Forms and Computational Complexity.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

On Structured Filtering-Clustering: Global Error Bound and Optimal First-Order Algorithms.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Weak Separation in Mixture Models and Implications for Principal Stratification.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
On efficient multilevel Clustering via Wasserstein distances.
J. Mach. Learn. Res., 2021

Model Fusion of Heterogeneous Neural Networks via Cross-Layer Alignment.
CoRR, 2021

On Label Shift in Domain Adaptation via Wasserstein Distance.
CoRR, 2021

Transformer with a Mixture of Gaussian Keys.
CoRR, 2021

Entropic Gromov-Wasserstein between Gaussian Distributions.
CoRR, 2021

An Efficient Mini-batch Method via Partial Transportation.
CoRR, 2021

On Multimarginal Partial Optimal Transport: Equivalent Forms and Computational Complexity.
CoRR, 2021

Statistical Analysis from the Fourier Integral Theorem.
CoRR, 2021

Improving Bayesian Inference in Deep Neural Networks with Variational Structured Dropout.
CoRR, 2021

On Robust Optimal Transport: Computational Complexity, Low-rank Approximation, and Barycenter Computation.
CoRR, 2021

BoMb-OT: On Batch of Mini-batches Optimal Transport.
CoRR, 2021

Structured Dropout Variational Inference for Bayesian Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On Robust Optimal Transport: Computational Complexity and Barycenter Computation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

LAMDA: Label Matching Deep Domain Adaptation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Improving Relational Regularized Autoencoders with Spherical Sliced Fused Gromov Wasserstein.
Proceedings of the 9th International Conference on Learning Representations, 2021

Distributional Sliced-Wasserstein and Applications to Generative Modeling.
Proceedings of the 9th International Conference on Learning Representations, 2021

Point-set Distances for Learning Representations of 3D Point Clouds.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Flow-based Alignment Approaches for Probability Measures in Different Spaces.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

On the Minimax Optimality of the EM Algorithm for Learning Two-Component Mixed Linear Regression.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Instability, Computational Efficiency and Statistical Accuracy.
CoRR, 2020

Revisiting Fixed Support Wasserstein Barycenter: Computational Hardness and Efficient Algorithms.
CoRR, 2020

Fixed-Support Wasserstein Barycenters: Computational Hardness and Fast Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Projection Robust Wasserstein Distance and Riemannian Optimization.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On Unbalanced Optimal Transport: An Analysis of Sinkhorn Algorithm.
Proceedings of the 37th International Conference on Machine Learning, 2020

Fast Algorithms for Computational Optimal Transport and Wasserstein Barycenter.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Singularity Structures and Impacts on Parameter Estimation in Finite Mixtures of Distributions.
SIAM J. Math. Data Sci., 2019

Sampling for Bayesian Mixture Models: MCMC with Polynomial-Time Mixing.
CoRR, 2019

On Scalable Variant of Wasserstein Barycenter.
CoRR, 2019

Computationally Efficient Tree Variants of Gromov-Wasserstein.
CoRR, 2019

On the Acceleration of the Sinkhorn and Greenkhorn Algorithms for Optimal Transport.
CoRR, 2019

Posterior Distribution for the Number of Clusters in Dirichlet Process Mixture Models.
CoRR, 2019

Accelerated Primal-Dual Coordinate Descent for Computational Optimal Transport.
CoRR, 2019

Global Error Bounds and Linear Convergence for Gradient-Based Algorithms for Trend Filtering and 𝓁<sub>1</sub>-Convex Clustering.
CoRR, 2019

Challenges with EM in application to weakly identifiable mixture models.
CoRR, 2019

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms.
Proceedings of the 36th International Conference on Machine Learning, 2019

Probabilistic Multilevel Clustering via Composite Transportation Distance.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Neural Rendering Model: Joint Generation and Prediction for Semi-Supervised Learning.
CoRR, 2018

Theoretical guarantees for EM under misspecified Gaussian mixture models.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Multilevel Clustering via Wasserstein Means.
Proceedings of the 34th International Conference on Machine Learning, 2017


  Loading...