Yuxiao Dong

Orcid: 0000-0002-6092-2002

According to our database1, Yuxiao Dong authored at least 165 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Does Negative Sampling Matter? a Review With Insights Into its Theory and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

VisScience: An Extensive Benchmark for Evaluating K12 Educational Multi-modal Scientific Reasoning.
CoRR, 2024

MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model.
CoRR, 2024

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA.
CoRR, 2024

CogVLM2: Visual Language Models for Image and Video Understanding.
CoRR, 2024

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models.
CoRR, 2024

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs.
CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.
CoRR, 2024

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer.
CoRR, 2024

Multi-turn Response Selection with Commonsense-enhanced Language Models.
CoRR, 2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools.
CoRR, 2024

AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models.
CoRR, 2024

LGB: Language Model and Graph Neural Network-Driven Social Bot Detection.
CoRR, 2024

LVBench: An Extreme Long Video Understanding Benchmark.
CoRR, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search.
CoRR, 2024

GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment.
CoRR, 2024

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts.
CoRR, 2024

Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer.
CoRR, 2024

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent.
CoRR, 2024

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback.
CoRR, 2024

Extensive Self-Contrast Enables Feedback-Free Language Model Alignment.
CoRR, 2024

Understanding Emergent Abilities of Language Models from the Loss Perspective.
CoRR, 2024

AutoRE: Document-Level Relation Extraction with Large Language Models.
CoRR, 2024

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations.
CoRR, 2024

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning.
CoRR, 2024

APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding.
CoRR, 2024

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein.
CoRR, 2024

RecDCL: Dual Contrastive Learning for Recommendation.
Proceedings of the ACM on Web Conference 2024, 2024

Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

AutoWebGLM: A Large Language Model-based Web Navigating Agent.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Generative AI Day.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

AgentBench: Evaluating LLMs as Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Open-World Semi-Supervised Learning for Node Classification.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LongAlign: A Recipe for Long Context Alignment of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

CogAgent: A Visual Language Model for GUI Agents.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

AlignBench: Benchmarking Chinese Alignment of Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TriSampler: A Better Negative Sampling Principle for Dense Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SketchNE: Embedding Billion-Scale Networks Accurately in One Hour.
IEEE Trans. Knowl. Data Eng., October, 2023

OAG: Linking Entities Across Large-Scale Heterogeneous Knowledge Graphs.
IEEE Trans. Knowl. Data Eng., September, 2023

GCCAD: Graph Contrastive Coding for Anomaly Detection.
IEEE Trans. Knowl. Data Eng., August, 2023

Automated Unsupervised Graph Representation Learning.
IEEE Trans. Knowl. Data Eng., March, 2023

OAG$_{\mathrm {know}}$ know : Self-Supervised Learning for Linking Knowledge Graphs.
IEEE Trans. Knowl. Data Eng., 2023

CogAgent: A Visual Language Model for GUI Agents.
CoRR, 2023

AlignBench: Benchmarking Chinese Alignment of Large Language Models.
CoRR, 2023

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.
CoRR, 2023

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models.
CoRR, 2023

CogVLM: Visual Expert for Pretrained Language Models.
CoRR, 2023

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X.
CoRR, 2023

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation.
CoRR, 2023

ApeGNN: Node-Wise Adaptive Aggregation in GNNs for Recommendation.
Proceedings of the ACM Web Conference 2023, 2023

GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner.
Proceedings of the ACM Web Conference 2023, 2023

CogDL: A Comprehensive Library for Graph Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

International Workshop on Learning with Knowledge Graphs: Construction, Embedding, and Reasoning.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Semi-Supervised Social Bot Detection with Initial Residual Relation Attention Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track, 2023

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

WinGNN: Dynamic Graph Neural Networks with Random Gradient Aggregation Window.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

BatchSampler: Sampling Mini-Batches for Contrastive Learning in Vision, Language, and Graphs.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Web-Scale Academic Name Disambiguation: The WhoIsWho Benchmark, Leaderboard, and Toolkit.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

The 3rd Workshop on Graph Learning Benchmarks (GLB 2023).
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GLM-130B: An Open Bilingual Pre-trained Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
On the distribution alignment of propagation in graph neural networks.
AI Open, January, 2022

Understanding WeChat User Preferences and "Wow" Diffusion.
IEEE Trans. Knowl. Data Eng., 2022

GLM-130B: An Open Bilingual Pre-trained Model.
CoRR, 2022

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers.
CoRR, 2022

ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

SelfKG: Self-Supervised Entity Alignment in Knowledge Graphs.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

GRAND+: Scalable Graph Random Neural Networks.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

EvoKG: Jointly Modeling Event Time and Network Structure for Reasoning over Temporal Knowledge Graphs.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

IDPG: An Instance-Dependent Prompt Generation Method.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

OAG-BERT: Towards a Unified Backbone Language Model for Academic Knowledge Services.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

GraphMAE: Self-Supervised Masked Graph Autoencoders.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021
Mining Fraudsters and Fraudulent Strategies in Large-Scale Mobile Social Networks.
IEEE Trans. Knowl. Data Eng., 2021

Guest Editorial: AI for COVID-19.
IEEE Trans. Big Data, 2021

Science as a Public Good: Public Use and Funding of Science.
CoRR, 2021

MATCH: Metadata-Aware Text Classification in A Large Hierarchy.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Adaptive Diffusion in Graph Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

A Large-Scale Database for Graph Representation Learning.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

TDGIA: Effective Injection Attacks on Graph Neural Networks.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Are we really making much progress?: Revisiting, benchmarking and refining heterogeneous graph neural networks.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

MixGCF: An Improved Training Method for Graph Neural Network-based Recommender Systems.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

The International Workshop on Pretraining: Algorithms, Architectures, and Applications ([email protected] 2021).
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

P-INT: A Path-based Interaction Model for Few-shot Knowledge Graph Completion.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Microsoft Academic Graph: When experts are not enough.
Quant. Sci. Stud., 2020

Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature.
Frontiers Res. Metrics Anal., 2020

Graph Random Neural Network.
CoRR, 2020

Heterogeneous Graph Transformer.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Open Graph Benchmark: Datasets for Machine Learning on Graphs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Graph Random Neural Networks for Semi-Supervised Learning on Graphs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

GPT-GNN: Generative Pre-Training of Graph Neural Networks.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Heterogeneous Network Representation Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019
A Review of Microsoft Academic Services for Science of Science Studies.
Frontiers Big Data, 2019

NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization.
Proceedings of the World Wide Web Conference, 2019

Representation Learning on Networks: Theories, Algorithms, and Applications.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Neural Tensor Factorization for Temporal Interaction Learning.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

OAG: Toward Linking Large-scale Heterogeneous Entity Graphs.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Learning From Networks: Algorithms, Theory, and Applications.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

ProNE: Fast and Scalable Network Representation Learning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Will Triadic Closure Strengthen Ties in Social Networks?
ACM Trans. Knowl. Discov. Data, 2018

Collaboration Diversity and Scientific Impact.
CoRR, 2018

Neural Tensor Factorization.
CoRR, 2018

BigNet 2018 Chairs' Welcome & Organization.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Who will Attend This Event Together? Event Attendance Prediction via Deep LSTM Networks.
Proceedings of the 2018 SIAM International Conference on Data Mining, 2018

DeepInf: Social Influence Prediction with Deep Learning.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

RESTFul: Resolution-Aware Forecasting of Behavioral Time Series Data.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
User Modeling on Demographic Attributes in Big Mobile Social Networks.
ACM Trans. Inf. Syst., 2017

UAPD: Predicting Urban Anomalies from Spatial-Temporal Data.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

metapath2vec: Scalable Representation Learning for Heterogeneous Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Reliable fake review detection via modeling temporal and behavioral patterns.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016
Can Scientific Impact Be Predicted?
IEEE Trans. Big Data, 2016

Gender Differences in Communication Behaviors, Spatial Proximity Patterns, and Mobility Habits.
CoRR, 2016

Influence Activation Model: A New Perspective in Social Influence Analysis and Social Network Evolution.
CoRR, 2016

Do the Young Live in a "Smaller World" Than the Old? Age-Specific Degrees of Separation in a Large-Scale Mobile Communication Network.
CoRR, 2016

Structural Diversity and Homophily: A Study Across More than One Hundred Large-Scale Networks.
CoRR, 2016

User Modeling in Large Social Networks.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Deep learning for network analysis: Problems, approaches and challenges.
Proceedings of the 2016 IEEE Military Communications Conference, 2016

Analysis of link formation, persistence and dissolution in NetSense data.
Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2016

2015
Modeling the Interplay Between Individual Behavior and Network Distributions.
CoRR, 2015

Will This Paper Increase Your <i>h</i>-index?: Scientific Impact Prediction.
Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 2015

Inferring Unusual Crowd Events from Mobile Phone Call Detail Records.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Will This Paper Increase Your h-index?
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

The Evolution of Social Relationships and Strategies Across the Lifespan.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

CoupledLP: Link Prediction in Coupled Networks.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Collaboration Signatures Reveal Scientific Impact.
Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2015

2014
Predicting Node Degree Centrality with the Node Prominence Profile.
CoRR, 2014

Inferring social status and rich club effects in enterprise communication networks.
CoRR, 2014

Will This Paper Increase Your h-index? Scientific Impact Prediction.
CoRR, 2014

Inferring user demographics and social strategies in mobile social networks.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

2013
A link clustering based overlapping community detection algorithm.
Data Knowl. Eng., 2013

Microscopic Evolution of Social Networks by Triad Position Profile.
CoRR, 2013

How Long Will She Call Me? Distribution, Social Theory and Duration Prediction.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

2012
Link Prediction and Recommendation across Heterogeneous Social Networks.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

2011
A parallel computing model for large-graph mining with MapReduce.
Proceedings of the Seventh International Conference on Natural Computation, 2011

Random Walk Based Resource Allocation: Predicting and Recommending Links in Cross-Operator Mobile Communication Networks.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Approximating algorithm for RNA structure prediction including pseudoknots.
Proceedings of the IEEE International Conference on Automation and Logistics, 2011

Degree and similarity based search in networks.
Proceedings of the Eighth International Conference on Fuzzy Systems and Knowledge Discovery, 2011

SAKU: A distributed system for data analysis in large-scale dataset based on cloud computing.
Proceedings of the Eighth International Conference on Fuzzy Systems and Knowledge Discovery, 2011

Predicting missing links via local feature of common neighbors.
Proceedings of the Eighth International Conference on Fuzzy Systems and Knowledge Discovery, 2011

KANGAROO: A Distributed System for SNA - Social Network Analysis in Huge-scale Networks.
Proceedings of the CLOSER 2011, 2011

Saurida: Cloud Computing based - Data Mining System in Telecommunication Industry.
Proceedings of the CLOSER 2011, 2011

Efficient Search in Networks Using Conductance.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

Link Prediction Based on Local Information.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

A Novel Genetic Algorithm for Overlapping Community Detection.
Proceedings of the Advanced Data Mining and Applications - 7th International Conference, 2011

2010
A implementation of an automatic examination paper generation system.
Math. Comput. Model., 2010


  Loading...