Tuo Zhao

Hua Wang

CoRR, 2020

Deep Reinforcement Learning with Smooth Policy.

[BibT_eX]

[DOI]

CoRR, 2020

Differentiable Top-k Operator with Optimal Transport.

[BibT_eX]

[DOI]

CoRR, 2020

Statistical Guarantees of Generative Adversarial Networks for Distribution Estimation.

[BibT_eX]

[DOI]

CoRR, 2020

Spatial Resolution Enhancement of Remote Sensing Hyperspectral Images With Localized Spatial-Spectral Dictionary Pair.

[BibT_eX]

[DOI]

IEEE Access, 2020

The Role of Mobile Social Application in Stimulating Learning Stickiness in Blended Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th Pacific Asia Conference on Information Systems, 2020

Differentiable Top-k with Optimal Transport.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? - A Neural Tangent Kernel Perspective.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Transformer Hawkes Process.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Deep Reinforcement Learning with Robust and Smooth Policy.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Implicit Bias of Gradient Descent based Adversarial Training on Separable Data.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

On Computation and Generalization of Generative Adversarial Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

On Generalization Bounds of a Family of Recurrent Neural Networks.

[BibT_eX]

[DOI]

Minshuo Chen

Xingguo Li

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Multi-Domain Neural Machine Translation with Word-Level Adaptive Layer-wise Domain Mixing.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2019

Misspecified nonconvex statistical optimization for sparse phase retrieval.

[BibT_eX]

[DOI]

Math. Program., 2019

Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Towards Understanding the Importance of Noise in Training Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Inductive Bias of Gradient Descent based Adversarial Training on Separable Data.

[BibT_eX]

[DOI]

CoRR, 2019

Review wearable sensing system for gait recognition.

[BibT_eX]

[DOI]

Clust. Comput., 2019

Online Factorization and Partition of Complex Networks by Random Walk.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

On Fast Convergence of Proximal Algorithms for SQRT-Lasso Optimization: Don't Worry About its Nonsmooth Loss Function.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Meta Learning with Relational Information for Short Sequences.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Understanding the Importance of Shortcut Connections in Residual Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Efficient Approximation of Deep ReLU Networks for Functions on Low Dimensional Manifolds.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Toward Understanding the Importance of Noise in Training Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

On Scalable and Efficient Computation of Large Scale Optimal Transport.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

On Computation and Generalization of Generative Adversarial Networks under Spectrum Control.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Learning to Defense by Learning to Attack.

[BibT_eX]

[DOI]

Proceedings of the Deep Generative Models for Highly Structured Data, 2019

On Constrained Nonconvex Stochastic Optimization: A Case Study for Generalized Eigenvalue Decomposition.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

On Computation and Generalization of GANs with Spectrum Control.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to Defense by Learning to Attack.

[BibT_eX]

[DOI]

CoRR, 2018

On Tighter Generalization Bound for Deep Neural Networks: CNNs, ResNets, and Beyond.

[BibT_eX]

[DOI]

CoRR, 2018

On Landscape of Lagrangian Functions and Stochastic Search for Constrained Nonconvex Optimization.

[BibT_eX]

[DOI]

CoRR, 2018

Detecting Nonlinear Causality in Multivariate Time Series with Sparse Additive Models.

[BibT_eX]

[DOI]

CoRR, 2018

Toward Deeper Understanding of Nonconvex Stochastic Optimization with Momentum using Diffusion Approximations.

[BibT_eX]

[DOI]

CoRR, 2018

Provable Gaussian Embedding with One Observation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

The Physical Systems Behind Optimization Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Towards Understanding Acceleration Tradeoff between Momentum and Asynchrony in Nonconvex Stochastic Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Dimensionality Reduction for Stationary Time Series via Stochastic Nonconvex Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017

An enhance excavation equipments classification algorithm based on acoustic spectrum dynamic feature.

[BibT_eX]

[DOI]

Multidimens. Syst. Signal Process., 2017

On Faster Convergence of Cyclic Block Coordinate Descent-type Methods for Strongly Convex Minimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

Excavation equipment classification based on improved MFCC features and ELM.

[BibT_eX]

[DOI]

Neurocomputing, 2017

Misspecified Nonconvex Statistical Optimization for Phase Retrieval.

[BibT_eX]

[DOI]

CoRR, 2017

Deep Hyperspherical Learning.

[BibT_eX]

[DOI]

CoRR, 2017

Dynamic Factorization and Partition of Complex Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Homotopy Parametric Simplex Method for Sparse Learning.

[BibT_eX]

[DOI]

CoRR, 2017

On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions.

[BibT_eX]

[DOI]

CoRR, 2017

Online Multiview Representation Learning: Dropping Convexity for Better Efficiency.

[BibT_eX]

[DOI]

CoRR, 2017

Parametric Simplex Method for Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Deep Hyperspherical Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Opensesame NIST 2016 Speaker Recognition Evaluation System.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Online Partial Least Square Optimization: Dropping Convexity for Better Efficiency and Scalability.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Hyperspectral and multispectral image fusion using local spatial-spectral dictionary pair.

[BibT_eX]

[DOI]

Yifan Zhang

Mingyi He

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Ensemble Acoustic Modeling for CD-DNN-HMM Using Random Forests of Phonetic Decision Trees.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2016

Symmetry, Saddle Points, and Global Geometry of Nonconvex Matrix Factorization.

[BibT_eX]

[DOI]

CoRR, 2016

A First Order Free Lunch for SQRT-Lasso.

[BibT_eX]

[DOI]

CoRR, 2016

NESTT: A Nonconvex Primal-Dual Splitting Method for Distributed and Stochastic Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Hyperspectral and multispectral image fusion using collaborative representation with local adaptive dictionary pair.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium, 2016

Subpixel mapping of hyperspectral images based on collaborative representation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium, 2016

Stochastic Variance Reduced Optimization for Nonconvex Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

An Improved Convergence Analysis of Cyclic Block Coordinate Descent-type Methods for Strongly Convex Minimization.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015

Calibrated multivariate regression with application to neural semantic basis discovery.

[BibT_eX]

[DOI]

Lie Wang

J. Mach. Learn. Res., 2015

The flare package for high dimensional linear regression and precision matrix estimation in R.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

A Nonconvex Optimization Framework for Low Rank Matrix Estimation.

[BibT_eX]

[DOI]

Zhaoran Wang

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Time-frequency kernel-based CNN for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Calibrated Precision Matrix Estimation for High-Dimensional Elliptical Distributions.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2014

Pathwise Coordinate Optimization for Sparse Learning: Algorithm and Theory.

[BibT_eX]

[DOI]

Tong Zhang

CoRR, 2014

Accelerated Mini-batch Randomized Block Coordinate Descent Method.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Multivariate Regression with Calibration.

[BibT_eX]

[DOI]

Lie Wang

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Building an ensemble of CD-DNN-HMM acoustic model using random forests of phonetic decision trees.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Multilevel sampling and aggregation for discriminative training.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

2013

CODA: high dimensional copula discriminant analysis.

[BibT_eX]

[DOI]

Fang Han

J. Mach. Learn. Res., 2013

Sparse Inverse Covariance Estimation with Calibration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012

The huge Package for High-dimensional Undirected Graph Estimation in R.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2012

Sparse Additive Machine.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Smooth-projected Neighborhood Pursuit for High-dimensional Nonparanormal Graph Estimation.

[BibT_eX]

[DOI]

Kathryn Roeder

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2010

Projected gradient method for kernel discriminant nonnegative matrix factorization and the applications.

[BibT_eX]

[DOI]

Zhizheng Liang

Youfu Li