Ping Li

Orcid: 0000-0002-7985-8477

Affiliations:
  • LinkedIn Ads, Bellevue, WA, USA
  • Baidu Research USA, Bellevue, WA, USA
  • Rutgers University, Department of Statistics, Piscataway, NJ, USA (former)
  • Cornell University, Department of Statistical Science, Ithaca, NY, USA (former)
  • Stanford University, Department of Statistics, CA, USA (PhD)


According to our database1, Ping Li authored at least 415 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Cluster-Adaptive Network A/B Testing: From Randomization to Estimation.
J. Mach. Learn. Res., 2024

Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality.
CoRR, 2024

GUITAR: Gradient Pruning toward Fast Neural Ranking.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Word Embedding with Neural Probabilistic Prior.
Proceedings of the 2024 SIAM International Conference on Data Mining, 2024

Pb-Hash: Partitioned b-bit Hashing.
Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval, 2024

Visual Transformer with Differentiable Channel Selection: An Information Bottleneck Inspired Approach.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Stochastic Controlled Averaging for Federated Learning with Communication Compression.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative AI and Retrieval-Augmented Generation (RAG) Systems for Enterprise.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
An Energy-Based Prior for Generative Saliency.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Extreme Bandits Using Robust Statistics.
IEEE Trans. Inf. Theory, March, 2023

Sharper Analysis for Minibatch Stochastic Proximal Point Methods: Stability, Smoothness, and Deviation.
J. Mach. Learn. Res., 2023

Recovering Linear Causal Models with Latent Variables via Cholesky Factorization of Covariance Matrix.
CoRR, 2023

STANLEY: Stochastic Gradient Anisotropic Langevin Dynamics for Learning Energy-Based Models.
CoRR, 2023

Blockwise Feature Interaction in Recommendation Systems.
CoRR, 2023

Differentially Private One Permutation Hashing and Bin-wise Consistent Weighted Sampling.
CoRR, 2023

Practice with Graph-based ANN Algorithms on Sparse Data: Chi-square Two-tower model, HNSW, Sign Cauchy Projections.
CoRR, 2023

Differential Privacy with Random Projections and Sign Random Projections.
CoRR, 2023

Learning Latent Structural Relations with Message Passing Prior.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Dynamic Re-weighting for Long-tailed Semi-supervised Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Greed is good: correspondence recovery for unlabeled linear regression.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Copula for Instance-wise Feature Selection and Rank.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Fed-LAMB: Layer-wise and Dimension-wise Locally Adaptive Federated Learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Building K-Anonymous User Cohorts with Consecutive Consistent Weighted Sampling (CCWS).
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Power Norm Based Lifelong Learning for Paraphrase Generations.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Asymmetric Hashing for Fast Ranking via Neural Network Measures.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

L<sub>2</sub>-Uniform Stability of Randomized Learning Algorithms: Sharper Generalization Bounds and Confidence Boosting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Overlooked Structure of Stochastic Gradients.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Smooth Flipping Probability for Differential Private Sign Random Projection Methods.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

OPORP: One Permutation + One Random Projection.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

One-Step Estimator for Permuted Sparse Recovery.
Proceedings of the International Conference on Machine Learning, 2023

A/B Testing in Network Data with Covariate-Adaptive Randomization.
Proceedings of the International Conference on Machine Learning, 2023

Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Fast Convergence and Partial Participation.
Proceedings of the International Conference on Machine Learning, 2023

Regression with Label Permutation in Generalized Linear Model.
Proceedings of the International Conference on Machine Learning, 2023

LSDS++ : Dual Sampling for Accelerated k-means++.
Proceedings of the International Conference on Machine Learning, 2023

Exponential Generalization Bounds with Near-Optimal Rates for $L_q$-Stable Algorithms.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Improved Convergence of Differential Private SGD with Gradient Clipping.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dataset Pruning: Reducing Training Data by Examining Generalization Influence.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Universal Bias Reduction in Estimation of Smooth Additive Function in High Dimensions.
Proceedings of the International Conference on Algorithmic Learning Theory, 2023

Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Improved Bound on Generalization Error of Compressed KNN Estimator.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Piecewise Stationary Bandits under Risk Criteria.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Denoising Enhanced Distantly Supervised Ultrafine Entity Typing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Semi-Autoregressive Graph Generative Model for Dependency Graph Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

CoopInit: Initializing Generative Adversarial Networks via Cooperative Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

A Tale of Two Latent Flows: Learning Latent Space Normalizing Flow with Short-Run Langevin Flow for Approximate Inference.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Defending Backdoor Attacks on Vision Transformer via Patch Processing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
The Benefits of Diversity: Permutation Recovery in Unlabeled Sensing From Multiple Measurement Vectors.
IEEE Trans. Inf. Theory, 2022

Stability and Risk Bounds of Iterative Hard Thresholding.
IEEE Trans. Inf. Theory, 2022

Rethinking the Structure of Stochastic Gradients: Empirical and Statistical Evidence.
CoRR, 2022

Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression.
CoRR, 2022

Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations.
CoRR, 2022

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition.
CoRR, 2022

Constrained Approximate Similarity Search on Proximity Graph.
CoRR, 2022

Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models.
CoRR, 2022

CoopHash: Cooperative Learning of Multipurpose Descriptor and Contrastive Pair Generator via Variational MCMC Teaching for Supervised Image Hashing.
CoRR, 2022

Boost CTR Prediction for New Advertisements via Modeling Visual Content.
CoRR, 2022

Catoni-style Confidence Sequences under Infinite Variance.
CoRR, 2022

Package for Fast ABC-Boost.
CoRR, 2022

pGMM Kernel Regression and Comparisons with Boosted Trees.
CoRR, 2022

Noisy 𝓁<sup>0</sup>-Sparse Subspace Clustering on Dimensionality Reduced Data.
CoRR, 2022

Proximity Graph Maintenance for Fast Online Nearest Neighbor Search.
CoRR, 2022

Planning through Workspace Constraint Satisfaction and Optimization.
CoRR, 2022

Planning and Formulations in Pursuit-Evasion: Keep-away Games and Their Strategies.
CoRR, 2022

Finding Control Synthesis for Kinematic Shortest Paths.
CoRR, 2022

Towards Constructing Finer then Homotopy Path Classes.
CoRR, 2022

Boosting the Confidence of Generalization for L<sub>2</sub>-Stable Randomized Learning Algorithms.
CoRR, 2022

Fast ABC-Boost: A Unified Framework for Selecting the Base Class in Multi-Class Classification.
CoRR, 2022

Breaking the Linear Error Barrier in Differentially Private Graph Distance Release.
CoRR, 2022

A Class of Two-Timescale Stochastic EM Algorithms for Nonconvex Latent Variable Models.
CoRR, 2022

Label-Smoothed Backdoor Attack.
CoRR, 2022

On the Power-Law Spectrum in Deep Learning: A Bridge to Protein Science.
CoRR, 2022

Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising.
CoRR, 2022

GCWSNet: Generalized Consistent Weighted Sampling for Scalable and Accurate Training of Neural Networks.
CoRR, 2022

S<sup>2</sup>-MLP: Spatial-Shift MLP Architecture for Vision.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Calibrating CNNs for Few-Shot Meta Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Noisy L0-sparse subspace clustering on dimensionality reduced data.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Offline change detection under contamination.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

End-to-end Distantly Supervised Information Extraction with Retrieval Augmentation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Cross-Probe BERT for Fast Cross-Modal Search.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Explainable Concept Graph Completion by Bridging Open-Domain Relations and Concepts.
Proceedings of the 2022 SIAM International Conference on Data Mining, 2022

Sensitivity-aware Distance Measurement for Boosting Metric Learning.
Proceedings of the 2022 SIAM International Conference on Data Mining, 2022

On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SignRFF: Sign Random Fourier Features.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Private Graph All-Pairwise-Shortest-Path Distance Release with Improved Error Rate.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Marksman Backdoor: Backdoor Attacks with Arbitrary Target Class.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PromptGen: Automatically Generate Prompts using Generative Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Multi-Hop Open-Domain Question Answering over Structured and Unstructured Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

EGM: Enhanced Graph-based Model for Large-scale Video Advertisement Search.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Variational Flow Graphical Model.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Integrity Authentication in Tree Models.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Metric Nearness with Minimum Distortion: Optimal and Approximation.
Proceedings of the IEEE Information Theory Workshop, 2022

Regret Analysis for RL using Renewal Bandit Feedback.
Proceedings of the IEEE Information Theory Workshop, 2022

Distances Release with Differential Privacy in Tree and Grid Graph.
Proceedings of the IEEE International Symposium on Information Theory, 2022

Sensitivity of Under-Determined Linear System.
Proceedings of the IEEE International Symposium on Information Theory, 2022

Learning Cluster Causal Diagrams: An Information-Theoretic Approach.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Towards Deeper Understanding of Variational Auto-encoders for Binary Collaborative Filtering.
Proceedings of the ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Madrid, Spain, July 11, 2022

U-BERT for Fast and Scalable Text-Image Retrieval.
Proceedings of the ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Madrid, Spain, July 11, 2022

C-MinHash: Improving Minwise Hashing with Circulant Permutation.
Proceedings of the International Conference on Machine Learning, 2022

Nearly Optimal Catoni's M-estimator for Infinite Variance.
Proceedings of the International Conference on Machine Learning, 2022

Minimax M-estimation under Adversarial Contamination.
Proceedings of the International Conference on Machine Learning, 2022

Constructing Orthogonal Convolutions in an Explicit Manner.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Discriminative Similarity for Data Clustering.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Tale of Two Flows: Cooperative Learning of Langevin Flow and Normalizing Flow Toward Energy-Based Model.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Optimal Transport for Long-Tailed Recognition with Learnable Cost Matrix.
Proceedings of the Tenth International Conference on Learning Representations, 2022

On Distributed Adaptive Optimization with Gradient Compression.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Causal Effect Prediction with Flow-based Inference.
Proceedings of the IEEE International Conference on Data Mining, 2022

Identification for Deep Neural Network: Simply Adjusting Few Weights!
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

SpaceE: Knowledge Graph Embedding by Relational Linear Transformation in the Entity Space.
Proceedings of the HT '22: 33rd ACM Conference on Hypertext and Social Media, 2022

One Loss for Quantization: Deep Hashing with Discrete Wasserstein Distributional Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Continual Learning for Natural Language Generations with Transformer Calibration.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022

NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language.
Proceedings of the 10th IEEE Conference on Communications and Network Security, 2022

Multi-scale Multi-modal Dictionary BERT For Effective Text-image Retrieval in Multimedia Advertising.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Texture BERT for Cross-modal Texture Image Retrieval.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Latent Coreset Sampling based Data-Free Continual Learning.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Flow-based Perturbation for Cause-effect Inference.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

GCWSNet: Generalized Consistent Weighted Sampling for Scalable and Accurate Training of Neural Networks.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

℘-MinHash Algorithm for Continuous Probability Measures: Theory and Application to Machine Learning.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

CGAR: Critic Guided Action Redistribution in Reinforcement Leaning.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

BOAT: Bilateral Local Attention Vision Transformer.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Feature Fusion Network for Personalized Online Advertising Systems.
Proceedings of the IEEE International Conference on Big Data, 2022

FeatureBox: Feature Engineering on GPUs for Massive-Scale Ads Systems.
Proceedings of the IEEE International Conference on Big Data, 2022

PaddleBox: Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising.
Proceedings of the IEEE International Conference on Big Data, 2022

Decomposing User-APP Graph into Subgraphs for Effective APP and User Embedding Learning.
Proceedings of the IEEE International Conference on Big Data, 2022

Tree-based Text-Vision BERT for Video Search in Baidu Video Advertising.
Proceedings of the IEEE International Conference on Big Data, 2022

Boost CTR Prediction for New Advertisements via Modeling Visual Content.
Proceedings of the IEEE International Conference on Big Data, 2022

Enhanced Video BERT for Fast Video Advertisement Retrieval.
Proceedings of the IEEE International Conference on Big Data, 2022

Faster Noisy Power Method.
Proceedings of the International Conference on Algorithmic Learning Theory, 29 March, 2022

Minimization by Incremental Stochastic Surrogate Optimization for Large Scale Nonconvex Problems.
Proceedings of the International Conference on Algorithmic Learning Theory, 29 March, 2022

Adaptive A/B Test on Networks with Cluster Structures.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Noisy Riemannian Gradient Descent for Eigenvalue Computation with Application to Inexact Stochastic Recursive Gradient Algorithm.
Proceedings of the Asian Conference on Machine Learning, 2022

On the Convergence of Decentralized Adaptive Gradient Methods.
Proceedings of the Asian Conference on Machine Learning, 2022

OIE@OIA: an Adaptable and Efficient Open Information Extraction Framework.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Efficient Compact Bilinear Pooling via Kronecker Product.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

DeepAuth: A DNN Authentication Framework by Model-Unique and Fragile Signature Embedding.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Closed-Form, Provable, and Robust PCA via Leverage Statistics and Innovation Search.
IEEE Trans. Signal Process., 2021

Fast Neural Ranking on Bipartite Graph Indices.
Proc. VLDB Endow., 2021

On the Riemannian Search for Eigenvector Computation.
J. Mach. Learn. Res., 2021

C-OPH: Improving the Accuracy of One Permutation Hashing (OPH) with Circulant Permutations.
CoRR, 2021

Fed-LAMB: Layerwise and Dimensionwise Locally Adaptive Optimization Algorithm.
CoRR, 2021

C-MinHash: Practically Reducing Two Permutations to Just One.
CoRR, 2021

C-MinHash: Rigorously Reducing K Permutations to Two.
CoRR, 2021

S<sup>2</sup>-MLPv2: Improved Spatial-Shift MLP Architecture for Vision.
CoRR, 2021

Cross-lingual Language Model Pretraining for Retrieval.
Proceedings of the WWW '21: The Web Conference 2021, 2021

MQuadE: a Unified Model for Knowledge Fact Embedding.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Consistent Sampling Through Extremal Process.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Efficient greedy coordinate descent via variable partitioning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Agile and Accurate CTR Prediction Model Training for Massive-Scale Online Advertising Systems.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

ReadsRE: Retrieval-Augmented Distantly Supervised Relation Extraction.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Heterogeneous Attention Network for Effective and Efficient Cross-modal Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Multi-Type Textual Reasoning for Product-Aware Answer Generation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

GemNN: Gating-enhanced Multi-task Neural Networks with Feature Interaction Learning for CTR Prediction.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Towards finding the shortest-paths for 3D rigid bodies.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

Rate-Optimal Subspace Estimation on Random Graphs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mitigating Forgetting in Online Continual Learning with Neuron Calibration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Comprehensively Tight Analysis of Gradient Descent for PCA.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Backdoor Attack with Imperceptible Input and Latent Modification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Note on Sparse Generalized Eigenvalue Problem.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Cross-lingual Cross-modal Pretraining for Multimodal Retrieval.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Norm Adjusted Proximity Graph for Fast Inner Product Retrieval.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Sparse Recovery with Shuffled Labels: Statistical Limits and Practical Estimators.
Proceedings of the IEEE International Symposium on Information Theory, 2021

FROS: Fast Regularized Optimization by Sketching.
Proceedings of the IEEE International Symposium on Information Theory, 2021

Two-Timescale Stochastic EM Algorithms.
Proceedings of the IEEE International Symposium on Information Theory, 2021

Textual Analysis and Timely Detection of Suspended Social Media Accounts.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

Optimal Estimation of High Dimensional Smooth Additive Function Based on Noisy Observations.
Proceedings of the 38th International Conference on Machine Learning, 2021

Quantization Algorithms for Random Fourier Features.
Proceedings of the 38th International Conference on Machine Learning, 2021

On Variational Inference in Biclustering Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

On Estimation in Latent Variable Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Energy-Based Generative Models via Coarse-to-Fine Expanding and Sampling.
Proceedings of the 9th International Conference on Learning Representations, 2021

Graph-based Adversarial Online Kernel Learning with Adaptive Embedding.
Proceedings of the IEEE International Conference on Data Mining, 2021

Causal Discovery with Flow-based Conditional Density Estimation.
Proceedings of the IEEE International Conference on Data Mining, 2021

TIRA in Baidu Image Advertising.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Robust Watermarking for Deep Neural Networks via Bi-level Optimization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LIRA: Learnable, Imperceptible and Robust Backdoor Attacks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Fast and Provable Robust PCA VIA Normalized Coherence Pursuit.
Proceedings of the IEEE International Conference on Acoustics, 2021

Inflate and Shrink: Enriching and Reducing Interactions for Fast Text-Image Retrieval.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Deep Decomposable Model for Disentangling Syntax and Semantics in Sentence Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Patchwise Generative ConvNet: Training Energy-Based Models From a Single Natural Image for Internal Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Deep Latent Variable Models by Short-Run MCMC Inference With Optimal Transport Correction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-modal Dictionary BERT for Cross-modal Video Search in Baidu Advertising.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Assorted Attention Network for Cross-Lingual Language-to-Vision Retrieval.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

MixBERT for Image-Ad Relevance Scoring in Advertising.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Multiple Exemplars Learning for Fast Image Retrieval.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Adversarial Kernel Sampling on Class-imbalanced Data Streams.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Efficient Learning to Learn a Robust CTR Model for Web-scale Online Sponsored Search Advertising.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Rethinking Token-Mixing MLP for MLP-based Vision Backbone.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

MVT: Multi-view Vision Transformer for 3D Object Recognition.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Multi-Task and Multi-Scene Unified Ranking Model for Online Advertising.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Principal Subspace Estimation Under Information Diffusion.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

On the Faster Alternating Least-Squares for CCA.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

One-Sketch-for-All: Non-linear Random Features from Compressed Linear Measurements.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Identification of Matrix Joint Block Diagonalization.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Non-Local Feature Aggregation on Graphs via Latent Fixed Data Structures.
Proceedings of the 55th Asilomar Conference on Signals, Systems, and Computers, 2021

Provable Data Clustering via Innovation Search.
Proceedings of the 55th Asilomar Conference on Signals, Systems, and Computers, 2021

An Optimistic Acceleration of AMSGrad for Nonconvex Optimization.
Proceedings of the Asian Conference on Machine Learning, 2021

Fast and Compact Bilinear Pooling by Shifted Random Maclaurin.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Learning Energy-Based Model with Variational Auto-Encoder as Amortized Sampler.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Rejection Sampling for Weighted Jaccard Similarity Revisited.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

A Blind Block Term Decomposition of High Order Tensors.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
On Convergence of Distributed Approximate Newton Methods: Globalization, Sharper Bounds and Beyond.
J. Mach. Learn. Res., 2020

Two-Stage Approach to Multivariate Linear Regression with Sparsely Mismatched Data.
J. Mach. Learn. Res., 2020

Tensor Completion via Tensor Networks with a Tucker Wrapper.
CoRR, 2020

FedSKETCH: Communication-Efficient and Private Federated Learning via Sketching.
CoRR, 2020

Efficient Online Multi-Task Learning via Adaptive Kernel Selection.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Improved Touch-screen Inputting Using Sequence-level Prediction Generation.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Estimate the Implicit Likelihoods of GANs with Application to Anomaly Detection.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Extracting Knowledge from Web Text with Monte Carlo Tree Search.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Fast Item Ranking under Neural Network based Measures.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

A Practical Riemannian Algorithm for Computing Dominant Generalized Eigenspace.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Video Recommendation with Multi-gate Mixture of Experts Soft Actor Critic.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

An Advantage Actor-Critic Algorithm with Confidence Exploration for Open Information Extraction.
Proceedings of the 2020 SIAM International Conference on Data Mining, 2020

Towards Better Generalization of Adaptive Gradient Methods.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Thunder: a Fast Coordinate Selection Solver for Sparse Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Ratio Trace Formulation of Wasserstein Discriminant Analysis.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Optimal Prediction of the Number of Unseen Species with Multiplicity.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems.
Proceedings of the Third Conference on Machine Learning and Systems, 2020

Combo-Attention Network for Baidu Video Advertising.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Optimal Estimator for Unlabeled Linear Regression.
Proceedings of the 37th International Conference on Machine Learning, 2020

The Necessity of Geometrical Representation for Deep Graph Analysis.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

SONG: Approximate Nearest Neighbor Search on GPU.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Classification Acceleration via Merging Decision Trees.
Proceedings of the FODS '20: ACM-IMS Foundations of Data Science Conference, 2020

Toward Communication Efficient Adaptive Gradient Method.
Proceedings of the FODS '20: ACM-IMS Foundations of Data Science Conference, 2020

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Toward Faster and Simpler Matrix Normalization via Rank-1 Update.
Proceedings of the Computer Vision - ECCV 2020, 2020

Randomized Kernel Multi-View Discriminant Analysis.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Nearly Non-Expansive Bounds for Mahalanobis Hard Thresholding.
Proceedings of the Conference on Learning Theory, 2020

Bessel Smoothing and Multi-Distribution Property Estimation.
Proceedings of the Conference on Learning Theory, 2020

A Reinforced Semi-supervised Neural Network for Helpful Review Identification.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Sample Optimization For Display Advertising.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Understanding and Detecting Convergence for Stochastic Gradient Descent with Momentum.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Adaptive Online Kernel Sampling for Vertex Classification.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Solving the Robust Matrix Completion Problem via a System of Nonlinear Equations.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

An Inverse-free Truncated Rayleigh-Ritz Method for Sparse Generalized Eigenvalue Problem.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Learning Interpretable Relationships between Entities, Relations and Concepts via Bayesian Structure Learning on Open Domain Facts.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Meta-CoTGAN: A Meta Cooperative Training Paradigm for Improving Adversarial Text Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Distributed Primal-Dual Optimization for Online Multi-Task Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

IVFS: Simple and Efficient Feature Selection for High Dimensional Topology Preservation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Outlier Detection and Data Clustering via Innovation Search.
CoRR, 2019

A Fourier Analytical Approach to Estimation of Smooth Functions in Gaussian Shift Model.
CoRR, 2019

Graph Analysis and Graph Pooling in the Spatial Domain.
CoRR, 2019

Optimistic Adaptive Acceleration for Optimization.
CoRR, 2019

Large Scale Semantic Indexing with Deep Level-wise Extreme Multi-label Learning.
Proceedings of the World Wide Web Conference, 2019

Product-Aware Helpfulness Prediction of Online Reviews.
Proceedings of the World Wide Web Conference, 2019

Knowledge Graph Embedding Based Question Answering.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

A Sparse Representation-Based Approach to Linear Regression with Partially Shuffled Labels.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Deep Skip-Gram Networks for Text Classification.
Proceedings of the 2019 SIAM International Conference on Data Mining, 2019

TMSA: A Mutual Learning Model for Topic Discovery and Word Embedding.
Proceedings of the 2019 SIAM International Conference on Data Mining, 2019

Reading Customer Reviews to Answer Product-related Questions.
Proceedings of the 2019 SIAM International Conference on Data Mining, 2019

Möbius Transformation for Fast Inner Product Search on Graph.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Practical Alternating Least-Squares for CCA.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Outlier Detection and Robust PCA Using a Convex Measure of Innovation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Re-randomized Densification for One Permutation Hashing and Bin-wise Consistent Weighted Sampling.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Generalization Error Analysis of Quantized Compressive Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Random Projections with Asymmetric Quantization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Integration of Knowledge Graph Embedding Into Topic Modeling with Hierarchical Dirichlet Process.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Hierarchical Multi-Task Word Embedding Learning for Synonym Prediction.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

MOBIUS: Towards the Next Generation of Query-Ad Matching in Baidu's Sponsored Search.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Permutation Recovery from Multiple Measurement Vectors in Unlabeled Sensing.
Proceedings of the IEEE International Symposium on Information Theory, 2019

Coreference Aware Representation Learning for Neural Named Entity Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

On Random Deep Weight-Tied Autoencoders: Exact Asymptotic Analysis, Phase Transitions, and Implications to Training.
Proceedings of the 7th International Conference on Learning Representations, 2019

On Efficient Retrieval of Top Similarity Vectors.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Reinforced Product Metadata Selection for Helpfulness Assessment of Customer Reviews.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AIBox: CTR Prediction Model Training on a Single Node.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

MONOPOLY: Learning to Price Public Facilities for Revaluing Private Properties with Large-Scale Urban Data.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Large Margin Prototypical Network for Few-shot Relation Classification with Fine-grained Features.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Graph to Graph: a Topology Aware Approach for Graph Structures Learning and Generation.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Robust Matrix Completion from Quantized Observations.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

End-to-end Deep Reinforcement Learning Based Coreference Resolution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Sign-Full Random Projections.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Multi-Agent Discussion Mechanism for Natural Language Generation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
On the Trade-Off Between Bit Depth and Number of Samples for a Basic Approach to Structured Signal Recovery From b-Bit Quantized Linear Measurements.
IEEE Trans. Inf. Theory, 2018

Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Collaborative Filtering via Additive Ordinal Regression.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

R 2 SDH: Robust Rotated Supervised Discrete Hashing.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Provable Variable Selection for Streaming Features.
Proceedings of the 35th International Conference on Machine Learning, 2018

Multi-view Feature Selection for Heterogeneous Face Recognition.
Proceedings of the IEEE International Conference on Data Mining, 2018

Logician and Orator: Learning from the Duality between Language and Knowledge in Open Domain.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

FastInput: Improving Input Efficiency on Mobile Devices.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Collaborative Multi-objective Ranking.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

A Globalization-Semantic Matching Neural Network for Paraphrase Identification.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Representation Learning for Question Classification via Topic Sparse Autoencoder and Entity Embedding.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Multi-Task Neural Learning Architecture for End-to-End Identification of Helpful Reviews.
Proceedings of the IEEE/ACM 2018 International Conference on Advances in Social Networks Analysis and Mining, 2018

2017
Recovery of Sparse Signals Using Multiple Orthogonal Least Squares.
IEEE Trans. Signal Process., 2017

Object proposal with kernelized partial ranking.
Pattern Recognit., 2017

Blessing of Dimensionality: Recovering Mixture Data via Dictionary Pursuit.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Online optimization for max-norm regularization.
Mach. Learn., 2017

Gradient Hard Thresholding Pursuit.
J. Mach. Learn. Res., 2017

A Tight Bound of Hard Thresholding.
J. Mach. Learn. Res., 2017

Tunable GMM Kernels.
CoRR, 2017

Decoupled Collaborative Ranking.
Proceedings of the 26th International Conference on World Wide Web, 2017

Theory of the GMM Kernel.
Proceedings of the 26th International Conference on World Wide Web, 2017

Online Matrix Completion for Signed Link Prediction.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Partial Hard Thresholding: Towards A Principled Analysis of Support Recovery.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Simple strategies for recovering inner products from coarsely quantized random projections.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Linearized GMM Kernels and Normalized Random Fourier Features.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

On the Iteration Complexity of Support Recovery via Hard Thresholding Pursuit.
Proceedings of the 34th International Conference on Machine Learning, 2017

Online city-scale hyper-local event detection via analysis of social media and human mobility.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Binary and Multi-Bit Coding for Stable Random Projections.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Learning the second-moment matrix of a smooth function from point samples.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016
Recovery of Sparse Signals via Generalized Orthogonal Matching Pursuit: A New Analysis.
IEEE Trans. Signal Process., 2016

Low-Rank Matrix Completion in the Presence of High Coherence.
IEEE Trans. Signal Process., 2016

Linear signal recovery from b-bit-quantized linear measurements: precise analysis of the trade-off between bit depth and number of measurements.
CoRR, 2016

Methods for Sparse and Low-Rank Recovery under Simplex Constraints.
CoRR, 2016

Generalized Intersection Kernel.
CoRR, 2016

Nystrom Method for Approximating the GMM Kernel.
CoRR, 2016

Generalized Min-Max Kernel and Generalized Consistent Weighted Sampling.
CoRR, 2016

A Comparison Study of Nonlinear Kernels.
CoRR, 2016

Learning the Differential Correlation Matrix of a Smooth Function From Point Samples.
CoRR, 2016

2-Bit Random Projections, NonLinear Estimators, and Approximate Near Neighbor Search.
CoRR, 2016

Learning Additive Exponential Family Graphical Models via \ell_{2, 1}-norm Regularized M-Estimation.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Exact Recovery of Hard Thresholding Pursuit.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Quantized Random Projections and Non-Linear Estimation of Cosine Similarity.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Online Low-Rank Subspace Clustering by Basis Dictionary Pursuit.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Improved and Scalable Bradley-Terry Model for Collaborative Ranking.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Learning Structured Low-Rank Representation via Matrix Factorization.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

One Scan 1-Bit Compressed Sensing.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
Cross-Domain Person Reidentification Using Domain Adaptation Ranking SVMs.
IEEE Trans. Image Process., 2015

Sign Stable Random Projections for Large-Scale Learning.
CoRR, 2015

Min-Max Kernels.
CoRR, 2015

Asymmetric Minwise Hashing for Indexing Binary Inner Products and Set Containment.
Proceedings of the 24th International Conference on World Wide Web, 2015

Improved Asymmetric Locality Sensitive Hashing (ALSH) for Maximum Inner Product Search (MIPS).
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Regularization-Free Estimation in Trace Regression with Symmetric Positive Semidefinite Matrices.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

b-bit Marginal Regression.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

0-Bit Consistent Weighted Sampling.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Adaptive Stochastic Alternating Direction Method of Multipliers.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Compressed Sensing with Very Sparse Gaussian Random Projections.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Asymmetric Minwise Hashing.
CoRR, 2014

An Improved Scheme for Asymmetric LSH.
CoRR, 2014

Graph Kernels via Functional Embedding.
CoRR, 2014

Advancing Matrix Completion by Modeling Extra Structures beyond Low-Rankness.
CoRR, 2014

Sparse Recovery with Very Sparse Compressed Counting.
CoRR, 2014

Coding for Random Projections and Approximate Near Neighbor Search.
CoRR, 2014

Improved Densification of One Permutation Hashing.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

CoRE Kernels.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product Search (MIPS).
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Recovery of Coherent Data via Low-Rank Dictionary Pursuit.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization.
Proceedings of the 31th International Conference on Machine Learning, 2014

Densifying One Permutation Hashing via Rotation for Fast Near Neighbor Search.
Proceedings of the 31th International Conference on Machine Learning, 2014

Coding for Random Projections.
Proceedings of the 31th International Conference on Machine Learning, 2014

Sparse Additive Subspace Clustering.
Proceedings of the Computer Vision - ECCV 2014, 2014

Compressed Counting Meets Compressed Sensing.
Proceedings of The 27th Conference on Learning Theory, 2014

A new space for comparing graphs.
Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2014

In Defense of Minhash over Simhash.
Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

Query Based Adaptive Re-ranking for Person Re-identification.
Proceedings of the Computer Vision - ACCV 2014, 2014

Semi-Supervised Ranking for Re-identification with Few Labeled Image Pairs.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Accelerating Stochastic Alternating Direction Method of Multipliers with Adaptive Subgradient.
CoRR, 2013

Sign Stable Projections, Sign Cauchy Projections and Chi-Square Kernels.
CoRR, 2013

Beyond Pairwise: Provably Fast Algorithms for Approximate <i>k</i>-Way Similarity Search.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Sign Cauchy Projections and Chi-Square Kernel.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Exact sparse recovery with L0 projections.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

b-bit minwise hashing in practice.
Proceedings of the 5th Asia-Pacific Symposium on Internetware, 2013

2012
Improving clustering by learning a bi-stochastic data similarity matrix.
Knowl. Inf. Syst., 2012

Testing for Membership to the IFRA and the NBU Classes of Distributions.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

One Permutation Hashing for Efficient Search and Learning
CoRR, 2012

b-Bit Minwise Hashing in Practice: Large-Scale Batch and Online Learning and Using GPUs for Fast Preprocessing with Simple Hash Functions.
CoRR, 2012

Query spelling correction using multi-task learning.
Proceedings of the 21st World Wide Web Conference, 2012

GPU-based minwise hashing: GPU-based minwise hashing.
Proceedings of the 21st World Wide Web Conference, 2012

Fast Near Neighbor Search in High-Dimensional Binary Data.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Entropy Estimations Using Correlated Symmetric Stable Random Projections.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

One Permutation Hashing.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Fast multi-task learning for query spelling correction.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
A New Algorithm for Compressed Counting with Applications in Shannon Entropy Estimation in Dynamic Data.
Proceedings of the COLT 2011, 2011

Training Logistic Regression and SVM on 200GB Data Using b-Bit Minwise Hashing and Comparisons with Vowpal Wabbit (VW)
CoRR, 2011

Accurate Estimators for Improving Minwise Hashing and b-Bit Minwise Hashing
CoRR, 2011

b-Bit Minwise Hashing for Large-Scale Linear SVM
CoRR, 2011

Theory and applications of <i>b</i>-bit minwise hashing.
Commun. ACM, 2011

Efficient Document Clustering via Online Nonnegative Matrix Factorizations.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Towards a Universal Sketch for Origin-Destination Network Measurements.
Proceedings of the Network and Parallel Computing - 8th IFIP International Conference, 2011

Hashing Algorithms for Large-Scale Learning.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

User-level sentiment analysis incorporating social networks.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

A New Multi-task Learning Method for Personalized Activity Recognition.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

2010
Guest editorial: special issue on learning theory.
Mach. Learn., 2010

Fast ABC-Boost for Multi-Class Classification
CoRR, 2010

On Practical Algorithms for Entropy Estimation and the Improved Sample Complexity of Compressed Counting
CoRR, 2010

An Empirical Evaluation of Four Algorithms for Multi-Class Classification: Mart, ABC-Mart, Robust LogitBoost, and ABC-LogitBoost
CoRR, 2010

b-Bit minwise hashing.
Proceedings of the 19th International Conference on World Wide Web, 2010

Approximating Higher-Order Distances Using Random Projections.
Proceedings of the UAI 2010, 2010

Robust LogitBoost and Adaptive Base Class (ABC) LogitBoost.
Proceedings of the UAI 2010, 2010

Efficient Nonnegative Matrix Factorization with Random Projections.
Proceedings of the SIAM International Conference on Data Mining, 2010

b-Bit Minwise Hashing for Estimating Three-Way Similarities.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning a Bi-Stochastic Data Similarity Matrix.
Proceedings of the ICDM 2010, 2010

Compressed Nonnegative Sparse Coding.
Proceedings of the ICDM 2010, 2010

2009
Stable Distribution.
Proceedings of the Encyclopedia of Database Systems, 2009

Estimating Entropy of Data Streams Using Compressed Counting
CoRR, 2009

On the Sample Complexity of Compressed Counting
CoRR, 2009

ABC-LogitBoost for Multi-class Classification
CoRR, 2009

Improving Compressed Counting.
Proceedings of the UAI 2009, 2009

Compressed counting.
Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, 2009

ABC-boost: adaptive base class boost for multi-class classification.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008
Adaptive Base Class Boost for Multi-class Classification
CoRR, 2008

A Very Efficient Scheme for Estimating Entropy of Data Streams Using Compressed Counting
CoRR, 2008

The Optimal Quantile Estimator for Compressed Counting
CoRR, 2008

On Approximating the Lp Distances for p>2
CoRR, 2008

Computationally Efficient Estimators for Dimension Reductions Using Stable Random Projections
CoRR, 2008

On Approximating Frequency Moments of Data Streams with Skewed Projections
CoRR, 2008

Estimators and tail bounds for dimension reduction in <i>l<sub>α</sub></i> (0 < α ≤ 2) using stable random projections.
Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2008

One sketch for all: Theory and Application of Conditional Random Sampling.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Computationally Efficient Estimators for Dimension Reductions UsingStable Random Projections.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

2007
Nonlinear Estimators and Tail Bounds for Dimension Reduction in <i>l</i><sub>1</sub> Using Cauchy Random Projections.
J. Mach. Learn. Res., 2007

A Sketch Algorithm for Estimating Two-Way and Multi-Way Associations.
Comput. Linguistics, 2007

A Unified Near-Optimal Estimator For Dimension Reduction in l<sub>alpha</sub>(0 < alpha <= 2) Using Stable Random Projections.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

McRank: Learning to Rank Using Multiple Classification and Gradient Boosting.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Very sparse stable random projections for dimension reduction in <i>l</i>alpha (0 <alpha<=2) norm.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

2006
On the distribution of SINR for the MMSE MIMO receiver and performance analysis.
IEEE Trans. Inf. Theory, 2006

Very Sparse Stable Random Projections, Estimators and Tail Bounds for Stable Random Projections
CoRR, 2006

Nonlinear Estimators and Tail Bounds for Dimension Reduction in $l_1$ Using Cauchy Random Projections
CoRR, 2006

Conditional Random Sampling: A Sketch-based Sampling Technique for Sparse Data.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Very sparse random projections.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Improving Random Projections Using Marginal Information.
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

2005
Using Sketches to Estimate Associations.
Proceedings of the HLT/EMNLP 2005, 2005


  Loading...