Weijie Zhao

Orcid: 0000-0003-0967-1436

Affiliations:
  • Rochester Institute of Technology, Rochester, NY, USA
  • LinkedIn Ads
  • Baidu Research, Bellevue, WA, USA (former)
  • University of California Merced, CA, USA (former)


According to our database1, Weijie Zhao authored at least 46 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality.
CoRR, 2024

GUITAR: Gradient Pruning toward Fast Neural Ranking.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Pb-Hash: Partitioned b-bit Hashing.
Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval, 2024

Token-wise Influential Training Data Retrieval for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Blockwise Feature Interaction in Recommendation Systems.
CoRR, 2023

Practice with Graph-based ANN Algorithms on Sparse Data: Chi-square Two-tower model, HNSW, Sign Cauchy Projections.
CoRR, 2023

Building K-Anonymous User Cohorts with Consecutive Consistent Weighted Sampling (CCWS).
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Asymmetric Hashing for Fast Ranking via Neural Network Measures.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Machine Unlearning in Gradient Boosting Decision Trees.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

2022
Constrained Approximate Similarity Search on Proximity Graph.
CoRR, 2022

Package for Fast ABC-Boost.
CoRR, 2022

pGMM Kernel Regression and Comparisons with Boosted Trees.
CoRR, 2022

Proximity Graph Maintenance for Fast Online Nearest Neighbor Search.
CoRR, 2022

Fast ABC-Boost: A Unified Framework for Selecting the Base Class in Multi-Class Classification.
CoRR, 2022

Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising.
CoRR, 2022

GCWSNet: Generalized Consistent Weighted Sampling for Scalable and Accurate Training of Neural Networks.
CoRR, 2022

Integrity Authentication in Tree Models.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Identification for Deep Neural Network: Simply Adjusting Few Weights!
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Feature Fusion Network for Personalized Online Advertising Systems.
Proceedings of the IEEE International Conference on Big Data, 2022

FeatureBox: Feature Engineering on GPUs for Massive-Scale Ads Systems.
Proceedings of the IEEE International Conference on Big Data, 2022

PaddleBox: Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising.
Proceedings of the IEEE International Conference on Big Data, 2022

On the Convergence of Decentralized Adaptive Gradient Methods.
Proceedings of the Asian Conference on Machine Learning, 2022

DeepAuth: A DNN Authentication Framework by Model-Unique and Fragile Signature Embedding.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Fast Neural Ranking on Bipartite Graph Indices.
Proc. VLDB Endow., 2021

Consistent Sampling Through Extremal Process.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Agile and Accurate CTR Prediction Model Training for Massive-Scale Online Advertising Systems.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Multi-Type Textual Reasoning for Product-Aware Answer Generation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Norm Adjusted Proximity Graph for Fast Inner Product Retrieval.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

TIRA in Baidu Image Advertising.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

LIRA: Learnable, Imperceptible and Robust Backdoor Attacks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Task and Multi-Scene Unified Ranking Model for Online Advertising.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Thunder: a Fast Coordinate Selection Solver for Sparse Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems.
Proceedings of the Third Conference on Machine Learning and Systems, 2020

SONG: Approximate Nearest Neighbor Search on GPU.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

2019
AIBox: CTR Prediction Model Training on a Single Node.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Advanced Database Techniques for Processing Scientific Multi-Dimensional Data.
PhD thesis, 2018

Automatic identification and classification of Palomar Transient Factory astrophysical objects in GLADE.
Int. J. Comput. Sci. Eng., 2018

Distributed Caching for Complex Querying of Raw Arrays.
CoRR, 2018

Distributed caching for processing raw arrays.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

2017
OLA-RAW: Scalable Exploration over Raw Data.
CoRR, 2017

Bi-Level Online Aggregation on Raw Data.
Proceedings of the 29th International Conference on Scientific and Statistical Database Management, 2017

Incremental View Maintenance over Array Data.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

ArrayUDF: User-Defined Scientific Data Analysis on Arrays.
Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017

2016
Similarity Join over Array Data.
Proceedings of the 2016 International Conference on Management of Data, 2016

2015
Workload-Driven Vertical Partitioning for Effective Query Processing over Raw Data.
CoRR, 2015

Vertical partitioning for query processing over raw data.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015


  Loading...