Tong Zhang

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Inconsistency, Instability, and Generalization Gap of Deep Neural Network Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes.

[BibT_eX]

[DOI]

Han Zhong

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Generalized Polyak Step Size for First Order Optimization with Momentum.

[BibT_eX]

[DOI]

Xiaoyu Wang

Mikael Johansson

Proceedings of the International Conference on Machine Learning, 2023

Beyond Uniform Lipschitz Condition in Differentially Private Optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

On the Convergence of Federated Averaging with Cyclic Client Participation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Learning in POMDPs is Sample-Efficient with Hindsight Observability.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Particle-based Variational Inference with Preconditioned Functional Gradient Flow.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data.

[BibT_eX]

[DOI]

Kashun Shum

Shizhe Diao

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DetGPT: Detect What You Need via Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Doolittle: Benchmarks and Corpora for Academic Writing Formalization.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency.

[BibT_eX]

[DOI]

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation.

[BibT_eX]

[DOI]

Yujia Jin

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memories.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Covariate-Shift Generalization via Random Sample Weighting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning.

[BibT_eX]

[DOI]

SIAM J. Math. Data Sci., June, 2022

Convex Formulation of Overparameterized Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2022

A stochastic extra-step quasi-Newton method for nonsmooth nonconvex optimization.

[BibT_eX]

[DOI]

Math. Program., 2022

When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint.

[BibT_eX]

[DOI]

Yoav Freund

Yi-An Ma

J. Mach. Learn. Res., 2022

Weakly Supervised Disentangled Generative Causal Representation Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2022

ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT.

[BibT_eX]

[DOI]

CoRR, 2022

Normalizing Flow with Variational Latent Representation.

[BibT_eX]

[DOI]

CoRR, 2022

GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond.

[BibT_eX]

[DOI]

CoRR, 2022

Asymptotic Statistical Analysis of f-divergence GAN.

[BibT_eX]

[DOI]

Xinwei Shen

Kani Chen

Teodor Vanislavov Marinov

CoRR, 2022

Dimension Independent Generalization of DP-SGD for Overparameterized Smooth Convex Optimization.

[BibT_eX]

[DOI]

Yi-An Ma

CoRR, 2022

Black-box Prompt Learning for Pre-trained Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Probabilistic Bilevel Coreset Selection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Sparse Invariant Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Model Agnostic Sample Reweighting for Out-of-Distribution Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Achieving Minimax Rates in Pool-Based Batch Active Learning.

[BibT_eX]

[DOI]

Claudio Gentile

Zhilei Wang

Proceedings of the International Conference on Machine Learning, 2022

Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums.

[BibT_eX]

[DOI]

Rui Pan

Proceedings of the Tenth International Conference on Learning Representations, 2022

HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge Representation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Semi-supervised Monocular 3D Object Detection by Multi-view Consistency.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Bayesian Invariant Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Exploring Geometric Consistency for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling.

[BibT_eX]

[DOI]

Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Minimax Regret Optimization for Robust Machine Learning under Distribution Shift.

[BibT_eX]

[DOI]

Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Multilingual Word Sense Disambiguation with Unified Sense Representation.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Exploiting Hybrid Semantics of Relation Paths for Multi-hop Question Answering over Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Rare and Zero-shot Word Sense Disambiguation using Z-Reweighting.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Frequency-Aware Contrastive Learning for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Local-Global Memory Neural Network for Medication Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Finite-Sample Analysis for Decentralized Batch Multiagent Reinforcement Learning With Networked Agents.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2021

Mathematical Models of Overparameterized Neural Networks.

[BibT_eX]

[DOI]

Hanze Dong

Proc. IEEE, 2021

A Framework of Composite Functional Gradient Methods for Generative Adversarial Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

DeEPCA: Decentralized Exact PCA with Linear Convergence Rate.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2021

Why Stable Learning Works? A Theory of Covariate Shift Generalization.

[BibT_eX]

[DOI]

CoRR, 2021

A Field Guide to Federated Optimization.

[BibT_eX]

[DOI]

CoRR, 2021

Near Optimal Stochastic Algorithms for Finite-Sum Unbalanced Convex-Concave Minimax Optimization.

[BibT_eX]

[DOI]

CoRR, 2021

Adder Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

ZEN 2.0: Continue Training and Adaption for N-gram Enhanced Text Encoders.

[BibT_eX]

[DOI]

CoRR, 2021

Geometry-aware data augmentation for monocular 3D object detection.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient Neural Network Training via Forward and Backward Propagation Sparsification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Error Compensated Distributed SGD Can Be Accelerated.

[BibT_eX]

[DOI]

Xun Qian

Peter Richtárik

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-Hop Transformer for Document-Level Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

DiffMG: Differentiable Meta Graph Search for Heterogeneous Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving Event Detection by Exploiting Label Hierarchy.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Effective Sparsification of Neural Networks With Global Sparsity Constraint.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Involution: Inverting the Inherence of Convolution for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Modeling from Features: a Mean-field Framework for Over-parameterized Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Conference on Learning Theory, 2021

Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Proximal Gradient Method for Nonsmooth Optimization over the Stiefel Manifold.

[BibT_eX]

[DOI]

SIAM J. Optim., 2020

End-to-End Active Object Tracking and Its Real-World Deployment via Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Accelerated dual-averaging primal-dual method for composite convex minimization.

[BibT_eX]

[DOI]

Optim. Methods Softw., 2020

MAP Inference Via ℓ <sub>2</sub>-Sphere Linear Program Reformulation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

PMGT-VR: A decentralized proximal-gradient algorithmic framework with variance reduction.

[BibT_eX]

[DOI]

Wei Xiong

CoRR, 2020

Multi-modal AsynDGAN: Learn From Distributed Medical Image Data without Sharing Private Information.

[BibT_eX]

[DOI]

CoRR, 2020

VEGA: Towards an End-to-End Configurable AutoML Pipeline.

[BibT_eX]

[DOI]

CoRR, 2020

Propagation Model Search for Graph Neural Networks.

[BibT_eX]

[DOI]

Yuhui Ding

Quanming Yao

CoRR, 2020

Disentangled Generative Causal Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2020

CorrAttack: Black-box Adversarial Attack with Structured Search.

[BibT_eX]

[DOI]

Zhichao Huang

Yaowei Huang

CoRR, 2020

Bidirectional Generative Modeling Using Adversarial Gradient Estimation.

[BibT_eX]

[DOI]

Xinwei Shen

Kani Chen

CoRR, 2020

Mean-Field Analysis of Two-Layer Neural Networks: Non-Asymptotic Rates and Generalization Bounds.

[BibT_eX]

[DOI]

CoRR, 2020

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems.

[BibT_eX]

[DOI]

Luo Luo

CoRR, 2020

Decentralized Accelerated Proximal Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

How to Characterize The Landscape of Overparameterized Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Generalized Neural Tangent Kernel Analysis for Two-layer Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stable Learning via Differentiated Variable Decorrelation.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Guided Learning of Nonconvex Models through Successive Functional Gradient Optimization.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Black-Box Adversarial Attack with Transferable Model-based Embedding.

[BibT_eX]

[DOI]

Zhichao Huang

Proceedings of the 8th International Conference on Learning Representations, 2020

Improving Constituency Parsing with Span Attention.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

CATCH: Context-Based Meta Reinforcement Learning for Transferrable Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Leveraging Human Prior Knowledge to Learn Sense Representations.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Improving Chinese Word Segmentation with Wordhood Memory Networks.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Stable Learning via Sample Reweighting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Utilizing Second Order Information in Minibatch Stochastic Variance Reduced Proximal Iterations.

[BibT_eX]

[DOI]

Jialei Wang

J. Mach. Learn. Res., 2019

Layer-Wise Learning Strategy for Nonparametric Tensor Product Smoothing Spline Regression and Graphical Models.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Robust Frequent Directions with Application in Online Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Fast Generalized Matrix Regression with Applications in Machine Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Multi-objective Neural Architecture Search via Predictive Network Performance Optimization.

[BibT_eX]

[DOI]

CoRR, 2019

Over Parameterized Two-level Neural Networks Can Learn Near Optimal Feature Representations.

[BibT_eX]

[DOI]

Hanze Dong

CoRR, 2019

Mirror Natural Evolution Strategies.

[BibT_eX]

[DOI]

CoRR, 2019

StacNAS: Towards stable and consistent optimization for differentiable Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2019

DeepSqueeze: Parallel Stochastic Gradient Descent with Double-Pass Error-Compensated Compression.

[BibT_eX]

[DOI]

CoRR, 2019

DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-Pass Error-Compensated Compression.

[BibT_eX]

[DOI]

CoRR, 2019

MAP Inference via L2-Sphere Linear Program Reformulation.

[BibT_eX]

[DOI]

CoRR, 2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Graph-guided multi-task sparse learning model: a method for identifying antigenic variants of influenza A(H3N2) virus.

[BibT_eX]

[DOI]

Bioinform., 2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.

[BibT_eX]

[DOI]

IEEE Access, 2019

Divergence-Augmented Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Hybrid Character Representation for Chinese Event Detection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

DHER: Hindsight Experience Replay for Dynamic Goals.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Efficient Decision-Based Black-Box Adversarial Attacks on Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Sharp Analysis for Nonconvex SGD Escaping from Saddle Points.

[BibT_eX]

[DOI]

Zhouchen Lin

Proceedings of the Conference on Learning Theory, 2019

Sentiment Analysis Using Autoregressive Language Modeling and Broad Learning System.

[BibT_eX]

[DOI]

Xin-Rong Gong

Jian-Xiu Jin

Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Reinforced Training Data Selection for Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Neural Machine Translation with Adequacy-Oriented Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Bayesian Model Averaging With Exponentiated Least Squares Loss.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2018

Learning to Remember Translation History with a Continuous Cache.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2018

Near-optimal stochastic approximation for online principal component estimation.

[BibT_eX]

[DOI]

Math. Program., 2018

An Ensemble Approach for Detecting Anomalous User Behaviors.

[BibT_eX]

[DOI]

Int. J. Softw. Eng. Knowl. Eng., 2018

Hessian-Aware Zeroth-Order Optimization for Black-Box Adversarial Attack.

[BibT_eX]

[DOI]

CoRR, 2018

Finite-Sample Analyses for Fully Decentralized Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space.

[BibT_eX]

[DOI]

CoRR, 2018

Fully Implicit Online Learning.

[BibT_eX]

[DOI]

CoRR, 2018

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game.

[BibT_eX]

[DOI]

CoRR, 2018

A convex formulation for high-dimensional sparse sliced inverse regression.

[BibT_eX]

[DOI]

CoRR, 2018

Diffusion Approximations for Online Principal Component Estimation and Global Convergence.

[BibT_eX]

[DOI]

CoRR, 2018

Incorporating Pseudo-Parallel Data for Quantifiable Sequence Editing.

[BibT_eX]

[DOI]

CoRR, 2018

Decentralization Meets Quantization.

[BibT_eX]

[DOI]

CoRR, 2018

Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset.

[BibT_eX]

[DOI]

Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Gradient Sparsification for Communication-Efficient Distributed Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Exponentially Weighted Imitation Learning for Batched Historical Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Communication Compression for Decentralized Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Stochastic Expectation Maximization with Variance Reduction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Sketched Follow-The-Regularized-Leader for Online Factorization Machine.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Safe Element Screening for Submodular Function Minimization.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Error Compensated Quantized SGD and its Applications to Large-scale Distributed Optimization.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Graphical Nonconvex Optimization via an Adaptive Convex Relaxation.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

End-to-end Active Object Tracking via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Composite Functional Gradient Learning of Generative Adversarial Models.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Candidates vs. Noises Estimation for Large Multi-Class Classification Problem.

[BibT_eX]

[DOI]

Yiheng Huang

Proceedings of the 35th International Conference on Machine Learning, 2018

Modeling Localness for Self-Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

QuaSE: Sequence Editing under Quantifiable Guidance.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multi-Head Attention with Disagreement Regularization.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Exploiting Deep Representations for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Super-Identity Convolutional Neural Network for Face Hallucination.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Recurrent Fusion Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Stereoscopic Image Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Video Re-localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Translating Pro-Drop Languages With Reconstruction Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Sparseness Analysis in the Pretraining of Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2017

Hierarchical Contextual Attention Recurrent Neural Network for Map Query Suggestion.

[BibT_eX]

[DOI]

Zhongfei (Mark) Zhang

Wenwu Zhu

IEEE Trans. Knowl. Data Eng., 2017

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

Gradient Hard Thresholding Pursuit.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

Candidates v.s. Noises Estimation for Large Multi-Class Classification Problem.

[BibT_eX]

[DOI]

CoRR, 2017

Improved Optimization of Finite Sums with Minibatch Stochastic Variance Reduced Proximal Iterations.

[BibT_eX]

[DOI]

Jialei Wang

CoRR, 2017

On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions.

[BibT_eX]

[DOI]

CoRR, 2017

On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Diffusion Approximations for Online Principal Component Estimation and Global Convergence.

[BibT_eX]

[DOI]

Chris Junchi Li

Mengdi Wang

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Projection-free Distributed Online Learning in Networks.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Efficient Distributed Learning with Sparsity.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Deep Pyramid Convolutional Neural Networks for Text Categorization.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

Accelerated proximal stochastic dual coordinate ascent for regularized loss minimization.

[BibT_eX]

[DOI]

Math. Program., 2016

Towards More Efficient SPSD Matrix Approximation and CUR Matrix Decomposition.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2016

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization.

[BibT_eX]

[DOI]

CoRR, 2016

Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level.

[BibT_eX]

[DOI]

CoRR, 2016

Supervised and Semi-Supervised Text Categorization using One-Hot LSTM for Region Embeddings.

[BibT_eX]

[DOI]

CoRR, 2016

Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression.

[BibT_eX]

[DOI]

Ting Yang

CoRR, 2016

Learning Additive Exponential Family Graphical Models via \ell_{2, 1}-norm Regularized M-Estimation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Exact Recovery of Hard Thresholding Pursuit.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Fast Component Pursuit for Large-Scale Inverse Covariance Estimation.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Fundamentals of Predictive Text Mining, Second Edition

[BibT_eX]

[DOI]

Sholom M. Weiss

Nitin Indurkhya

Texts in Computer Science, Springer, ISBN: 978-1-4471-6750-1, 2015

Learning sparse low-threshold linear classifiers.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

Sparse Nonlinear Regression: Parameter Estimation and Asymptotic Inference.

[BibT_eX]

[DOI]

CoRR, 2015

Improved Analyses of the Randomized Power Method and Block Lanczos Method.

[BibT_eX]

[DOI]

CoRR, 2015

Towards More Efficient Nystrom Approximation and CUR Matrix Decomposition.

[BibT_eX]

[DOI]

CoRR, 2015

Semi-Supervised Learning with Multi-View Embedding: Theory and Application with Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2015

Crowd Fraud Detection in Internet Advertising.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on World Wide Web, 2015

Local Smoothness in Variance Reduced Optimization.

[BibT_eX]

[DOI]

Daniel Vainsencher

Han Liu

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Quartz: Randomized Dual Coordinate Ascent with Arbitrary Sampling.

[BibT_eX]

[DOI]

Zheng Qu

Peter Richtárik

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Effective Use of Word Order for Text Categorization with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Stochastic Optimization with Importance Sampling for Regularized Loss Minimization.

[BibT_eX]

[DOI]

Peilin Zhao

Proceedings of the 32nd International Conference on Machine Learning, 2015

Adaptive Stochastic Alternating Direction Method of Multipliers.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

2014

Partial Gaussian Graphical Model Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2014

A Proximal Stochastic Gradient Method with Progressive Variance Reduction.

[BibT_eX]

[DOI]

Lin Xiao

SIAM J. Optim., 2014

Learning Nonlinear Functions Using Regularized Greedy Forest.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Random Design Analysis of Ridge Regression.

[BibT_eX]

[DOI]

Found. Comput. Math., 2014

Pathwise Coordinate Optimization for Sparse Learning: Algorithm and Theory.

[BibT_eX]

[DOI]

Tuo Zhao

Han Liu

CoRR, 2014

Accelerating Minibatch Stochastic Gradient Descent using Stratified Sampling.

[BibT_eX]

[DOI]

Peilin Zhao

CoRR, 2014

Stochastic Optimization with Importance Sampling.

[BibT_eX]

[DOI]

Peilin Zhao

CoRR, 2014

Adjusting Leverage Scores by Row Weighting: A Practical Approach to Coherent Matrix Completion.

[BibT_eX]

[DOI]

CoRR, 2014

Randomized Dual Coordinate Ascent with Arbitrary Sampling.

[BibT_eX]

[DOI]

Zheng Qu

Peter Richtárik

CoRR, 2014

Sparse Recovery with Very Sparse Compressed Counting.

[BibT_eX]

[DOI]

Cun-Hui Zhang

CoRR, 2014

Batch-Mode Active Learning via Error Bound Minimization.

[BibT_eX]

[DOI]

Quanquan Gu

Jiawei Han

Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Gradient boosting factorization machines.

[BibT_eX]

[DOI]

Proceedings of the Eighth ACM Conference on Recommender Systems, 2014

Efficient mini-batch training for stochastic optimization.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization.

[BibT_eX]

[DOI]

Xiaotong Yuan

Proceedings of the 31th International Conference on Machine Learning, 2014

A Convergence Rate Analysis for LogitBoost, MART and Their Variant.

[BibT_eX]

[DOI]

Peng Sun

Jie Zhou

Proceedings of the 31th International Conference on Machine Learning, 2014

Communication-Efficient Distributed Optimization using an Approximate Newton-type Method.

[BibT_eX]

[DOI]

Ohad Shamir

Nathan Srebro

Proceedings of the 31th International Conference on Machine Learning, 2014

Compressed Counting Meets Compressed Sensing.

[BibT_eX]

[DOI]

Cun-Hui Zhang

Proceedings of The 27th Conference on Learning Theory, 2014

2013

A Proximal-Gradient Homotopy Method for the Sparse Least-Squares Problem.

[BibT_eX]

[DOI]

Lin Xiao

SIAM J. Optim., 2013

Truncated power method for sparse eigenvalue problems.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2013

Stochastic dual coordinate ascent methods for regularized loss.

[BibT_eX]

[DOI]

Krishnakumar Balasubramanian

J. Mach. Learn. Res., 2013

Accelerating Stochastic Alternating Direction Method of Multipliers with Adaptive Subgradient.

[BibT_eX]

[DOI]

CoRR, 2013

Aggregation of Affine Estimators.

[BibT_eX]

[DOI]

CoRR, 2013

High-dimensional Joint Sparsity Random Effects Model for Multi-task Learning.

[BibT_eX]

[DOI]

Kai Yu

Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Accelerating Stochastic Gradient Descent using Predictive Variance Reduction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes.

[BibT_eX]

[DOI]

Ohad Shamir

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

A spectral algorithm for learning Hidden Markov Models.

[BibT_eX]

[DOI]

J. Comput. Syst. Sci., 2012

Analysis of a randomized approximation scheme for matrix multiplication

[BibT_eX]

[DOI]

CoRR, 2012

Proximal Stochastic Dual Coordinate Ascent

[BibT_eX]

[DOI]

CoRR, 2012

Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization

[BibT_eX]

[DOI]

CoRR, 2012

Deviation Optimal Learning using Greedy Q-aggregation

[BibT_eX]

[DOI]

Dong Dai

Philippe Rigollet

CoRR, 2012

AntigenMap 3D: an online antigenic cartography resource.

[BibT_eX]

[DOI]

Bioinform., 2012

Selective Labeling via Error Bound Minimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Proximal-Gradient Homotopy Method for the L1-Regularized Least-Squares Problem.

[BibT_eX]

[DOI]

Lin Xiao

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Sparse Recovery With Orthogonal Matching Pursuit Under RIP.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2011

Adaptive Forward-Backward Greedy Algorithm for Learning Sparse Representations.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2011

Robust Matrix Decomposition With Sparse Corruptions.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2011

Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation.

[BibT_eX]

[DOI]

Xianghong Jasmine Zhou

PLoS Comput. Biol., 2011

Learning with Structured Sparsity.

[BibT_eX]

[DOI]

Junzhou Huang

Dimitris N. Metaxas

J. Mach. Learn. Res., 2011

A tail inequality for quadratic forms of subgaussian random vectors

[BibT_eX]

[DOI]

CoRR, 2011

An Analysis of Random Design Linear Regression

[BibT_eX]

[DOI]

CoRR, 2011

Dimension-free tail inequalities for sums of random matrices.

[BibT_eX]

[DOI]

CoRR, 2011

Efficient Optimal Learning for Contextual Bandits.

[BibT_eX]

[DOI]

Proceedings of the UAI 2011, 2011

Learning to Search Efficiently in High Dimensions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Greedy Model Averaging.

[BibT_eX]

[DOI]

Dong Dai

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Spectral Methods for Learning Multivariate Latent Tree Structure.

[BibT_eX]

[DOI]

Animashree Anandkumar

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

2010

Fundamentals of Predictive Text Mining.

[BibT_eX]

[DOI]

Sholom M. Weiss

Nitin Indurkhya

Texts in Computer Science 41, Springer, ISBN: 978-1-84996-226-1, 2010

Fundamental Statistical Techniques.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Natural Language Processing, Second Edition., 2010

Trading Accuracy for Sparsity in Optimization Problems with Sparsity Constraints.

[BibT_eX]

[DOI]

Nathan Srebro

SIAM J. Optim., 2010

A Computational Framework for Influenza Antigenic Cartography.

[BibT_eX]

[DOI]

Zhipeng Cai

Xiu-Feng Wan

PLoS Comput. Biol., 2010

Analysis of Multi-stage Convex Relaxation for Sparse Regularization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2010

Robust Matrix Decomposition with Outliers

[BibT_eX]

[DOI]

CoRR, 2010

Deep Coding Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Agnostic Active Learning Without Constraints.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Improved Local Coordinate Coding using Local Tangents.

[BibT_eX]

[DOI]

Kai Yu

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Image Classification Using Super-Vector Coding of Local Image Descriptors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2010, 2010

2009

Classifying search queries using the Web as a source of knowledge.

[BibT_eX]

[DOI]

ACM Trans. Web, 2009

On the Consistency of Feature Selection using Greedy Least Squares Regression.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2009

Sparse Online Learning via Truncated Gradient.

[BibT_eX]

[DOI]

John Langford

Lihong Li

J. Mach. Learn. Res., 2009

Nonlinear Learning using Local Coordinate Coding.

[BibT_eX]

[DOI]

Kai Yu

Yihong Gong

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Multi-Label Prediction via Compressed Sensing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning nonlinear dynamic models.

[BibT_eX]

[DOI]

John Langford

Ruslan Salakhutdinov

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008

Graph-Based Semi-Supervised Learning and Spectral Kernel Design.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2008

Statistical Analysis of Bayes Optimal Subset Ranking.

[BibT_eX]

[DOI]

David Cossock

IEEE Trans. Inf. Theory, 2008

An Online Relevant Set Algorithm for Statistical Machine Translation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Multi-stage Convex Relaxation for Learning with Sparse Regularization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

2007

A block bigram prediction model for statistical machine translation.

[BibT_eX]

[DOI]

ACM Trans. Speech Lang. Process., 2007

On the Effectiveness of Laplacian Normalization for Graph Semi-supervised Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2007

Robust classification of rare queries using web knowledge.

[BibT_eX]

[DOI]

Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

A General Boosting Method and its Application to Learning Ranking Functions for Web Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information.

[BibT_eX]

[DOI]

John Langford

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Two-view feature generation model for semi-supervised learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2007

Margin Based Active Learning.

[BibT_eX]

[DOI]

Maria-Florina Balcan

Andrei Z. Broder

Proceedings of the Learning Theory, 20th Annual Conference on Learning Theory, 2007

2006

Information-theoretic upper and lower bounds for statistical estimation.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2006

Learning on Graph with Laplacian Regularization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Linear prediction models with graph regularization for web-page categorization.

[BibT_eX]

[DOI]

Alexandrin Popescul

Byron Dom

Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Subset Ranking Using Regression.

[BibT_eX]

[DOI]

David Cossock

Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

Effectiveness of Meeting Outcomes in Virtual vs. Face-to-Face Teams: A Comparison Study in China.

[BibT_eX]

[DOI]

Proceedings of the Connecting the Americas. 12th Americas Conference on Information Systems, 2006

A Discriminative Global Training Algorithm for Statistical MT.

[BibT_eX]

[DOI]

Proceedings of the ACL 2006, 2006

2005

Learning Bounds for Kernel Regression Using Effective Data Dimensionality.

[BibT_eX]

[DOI]

Neural Comput., 2005

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2005

TREC 2005 Genomics Track Experiments at IBM Watson.

[BibT_eX]

[DOI]

Mark Dredze

Proceedings of the Fourteenth Text REtrieval Conference, 2005

Analysis of Spectral Kernel Design based Semi-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Localized Upper and Lower Bounds for Some Estimation Problems.

[BibT_eX]

[DOI]

Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

Data Dependent Concentration Bounds for Sequential Prediction Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

A Localized Prediction Model for Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the ACL 2005, 2005

A High-Performance Semi-Supervised Learning Method for Text Chunking.

[BibT_eX]

[DOI]

Proceedings of the ACL 2005, 2005

2004

Statistical Analysis of Some Multi-Category Large Margin Classification Methods.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2004

Text categorization for a comprehensive time-dependent benchmark.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2004

Focused named entity recognition using machine learning.

[BibT_eX]

[DOI]

Li Zhang

Yue Pan

Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

Class-size Independent Generalization Analsysis of Some Discriminative Multi-Category Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Support Vector Classification with Input Data Uncertainty.

[BibT_eX]

[DOI]

Jinbo Bi

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Column-generation boosting methods for mixture of kernels.

[BibT_eX]

[DOI]

Jinbo Bi

Kristin P. Bennett

Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Chinese Named Entity Recognition Based on Multilevel Linguistic Features.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing, 2004

Solving large scale linear prediction problems using stochastic gradient descent algorithms.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2004

On the Convergence of MDL Density Estimation.

[BibT_eX]

[DOI]

Proceedings of the Learning Theory, 17th Annual Conference on Learning Theory, 2004

2003

Sequential greedy approximation for certain convex optimization problems.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2003

Leave-One-Out Bounds for Kernel Methods.

[BibT_eX]

[DOI]

Neural Comput., 2003

Generalization Error Bounds for Bayesian Mixture Algorithms.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2003

Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity.

[BibT_eX]

[DOI]

Shie Mannor

J. Mach. Learn. Res., 2003

Learning Bounds for a Generalized Family of Bayesian Posterior Distributions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

An Infinity-sample Theory for Multi-category Large Margin Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

On the Convergence of Boosting Procedures.

[BibT_eX]

[DOI]

Bin Yu

Proceedings of the Machine Learning, 2003

HowtogetaChineseName(Entity): Segmentation and Combination Issues.

[BibT_eX]

[DOI]

Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2003

Named Entity Recognition through Classifier Combination.

[BibT_eX]

[DOI]

Proceedings of the Seventh Conference on Natural Language Learning, 2003

A Robust Risk Minimization based Named Entity Recognition System.

[BibT_eX]

[DOI]

Proceedings of the Seventh Conference on Natural Language Learning, 2003

Updating an NLP system to fit new domains: an empirical study on the sentence segmentation problem.

[BibT_eX]

[DOI]

Fred Damerau

Proceedings of the Seventh Conference on Natural Language Learning, 2003

2002

Two-Sided Arnoldi and Nonsymmetric Lanczos Algorithms.

[BibT_eX]

[DOI]

Jane Cullum

SIAM J. Matrix Anal. Appl., 2002

Approximation Bounds for Some Sparse Kernel Regression Algorithms.

[BibT_eX]

[DOI]

Neural Comput., 2002

On the Dual Formulation of Regularized Linear Systems with Convex Risks.

[BibT_eX]

[DOI]

Mach. Learn., 2002

Recommender Systems Using Linear Classifier.

[BibT_eX]

[DOI]

Vijay S. Iyengar

J. Mach. Learn. Res., 2002

Text Chunking based on a Generalization of Winnow.

[BibT_eX]

[DOI]

Fred Damerau

J. Mach. Learn. Res., 2002

Covering Number Bounds of Certain Regularized Linear Function Classes.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2002

On the Consistency of Instantaneous Rigid Motion Estimation.

[BibT_eX]

[DOI]

Carlo Tomasi

Int. J. Comput. Vis., 2002

A decision-tree-based symbolic rule induction system for text categorization.

[BibT_eX]

[DOI]

IBM Syst. J., 2002

Experiments in high-dimensional text categorization.

[BibT_eX]

[DOI]

Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Effective Dimension and Generalization of Kernel Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Data-Dependent Bounds for Bayesian Mixture Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Statistical Behavior and Consistency of Support Vector Machines, Boosting, and Beyond.

[BibT_eX]

Proceedings of the Machine Learning, 2002

The Consistency of Greedy Algorithms for Classification.

[BibT_eX]

[DOI]

Shie Mannor

Proceedings of the Computational Learning Theory, 2002

2001

Rank-One Approximation to High Order Tensors.

[BibT_eX]

[DOI]

Gene H. Golub

SIAM J. Matrix Anal. Appl., 2001

Text Categorization Based on Regularized Linear Classification Methods.

[BibT_eX]

[DOI]

Frank J. Oles

Inf. Retr., 2001

An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods.

[BibT_eX]

[DOI]

AI Mag., 2001

Empirical Study of Recommender Systems Using Linear Classifiers.

[BibT_eX]

[DOI]

Vijay S. Iyengar

Proceedings of the Knowledge Discovery and Data Mining, 2001

A General Greedy Approximation Algorithm with Applications.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Generalization Performance of Some Learning Problems in Hilbert Functional Spaces.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Some Sparse Approximation Bounds for Regression Problems.

[BibT_eX]

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

A Leave-One-out Cross Validation Bound for Kernel Methods with Applications in Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational Learning Theory, 2001

A Sequential Approximation Bound for Some Sample-Dependent Convex Optimization Problems with Applications in Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational Learning Theory, 2001

Text Chunking using Regularized Winnow.

[BibT_eX]

[DOI]

Fred Damerau

Proceedings of the Association for Computational Linguistic, 2001

2000

Regularized Winnow Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Convergence of Large Margin Separable Linear Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Active learning using adaptive resampling.

[BibT_eX]

[DOI]

Vijay S. Iyengar

Chidanand Apté

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

1999

Some Theoretical Results Concerning the Convergence of Compositions of Regularized Linear Functions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Fast, Robust, and Consistent Camera Motion Estimation.

[BibT_eX]

[DOI]

Carlo Tomasi

Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

Theoretical Analysis of a Class of Randomized Regularization Methods.

[BibT_eX]

[DOI]

Proceedings of the Twelfth Annual Conference on Computational Learning Theory, 1999

1998

Methods for computational and statistical estimation with applications.

[BibT_eX]

[DOI]

PhD thesis, 1998

On the Homotopy Method for Perturbed Symmetric Generalized Eigenvalue Problems.

[BibT_eX]

[DOI]

Kincho H. Law

Gene H. Golub

SIAM J. Sci. Comput., 1998

A Linear Algorithm for Optimal Context Clustering with Application to Bi-level Image Coding.

[BibT_eX]

[DOI]

Daniel H. Greene

F. Frances Yao

Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Compression by Model Combination.

[BibT_eX]

[DOI]

Proceedings of the Data Compression Conference, 1998

1997

A progressive Ziv-Lempel algorithm for image compression.

[BibT_eX]

[DOI]

Proceedings of the Compression and Complexity of SEQUENCES 1997, 1997

1996

Optimal Surface Smoothing as Filter Design.

[BibT_eX]

[DOI]

Gabriel Taubin

Gene H. Golub

Proceedings of the Computer Vision, 1996

1995

Densities of Self-Similar Measures on the Line.

[BibT_eX]

[DOI]

Robert S. Strichartz

Arthur Taylor