Yu Zhang

Orcid: 0000-0003-0540-6758

Affiliations:
  • University of Illinois at Urbana-Champaign, Department of Computer Science, IL, USA


According to our database1, Yu Zhang authored at least 75 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Understanding the social construction of juvenile delinquency: insights from semantic analysis of big-data historical newspaper collections.
J. Comput. Soc. Sci., October, 2024

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs.
CoRR, 2024

A Unified Taxonomy-Guided Instruction Tuning Framework for Entity Set Expansion and Taxonomy Expansion.
CoRR, 2024

Bridging Text Data and Graph Data: Towards Semantics and Structure-aware Knowledge Discovery.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Ontology Enrichment for Effective Fine-grained Entity Typing.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
The MAPLE Benchmark for Graph Mining.
Dataset, February, 2023

The MAPLE Benchmark for Graph Mining.
Dataset, February, 2023

The MAPLE Benchmark for Scientific Literature Tagging.
Dataset, February, 2023

"Why Should I Review This Paper?" Unifying Semantic, Topic, and Citation Factors for Paper-Reviewer Matching.
CoRR, 2023

Learning Multiplex Embeddings on Text-rich Networks with One Text Encoder.
CoRR, 2023

PromptClass: Weakly-Supervised Text Classification with Prompting Enhanced Noise-Robust Self-Training.
CoRR, 2023

Chain-of-Skills: A Configurable Model for Open-domain Question Answering.
CoRR, 2023

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study.
Proceedings of the ACM Web Conference 2023, 2023


Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

Weakly Supervised Multi-Label Classification of Full-Text Scientific Papers.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Pretrained Language Representations for Text Understanding: A Weakly-Supervised Perspective.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning.
Proceedings of the International Conference on Machine Learning, 2023

Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Mining Structures from Massive Texts by Exploring the Power of Pre-trained Language Models.
Proceedings of the Proceedings 26th International Conference on Extending Database Technology, 2023

Chain-of-Skills: A Configurable Model for Open-Domain Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Patton: Language Model Pretraining on Text-Rich Networks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Heterogeneous Network Representation Learning: A Unified Framework With Survey and Benchmark.
IEEE Trans. Knowl. Data Eng., 2022

Heterformer: A Transformer Architecture for Node Representation Learning on Heterogeneous Text-Rich Networks.
CoRR, 2022

Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Toward a big data analysis system for historical newspaper collections research.
Proceedings of the PASC '22: Platform for Advanced Scientific Computing Conference, Basel, Switzerland, June 27, 2022

Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Adapting Pretrained Representations for Text Mining.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Entity Set Co-Expansion in StackOverflow.
Proceedings of the IEEE International Conference on Big Data, 2022

REACTCLASS: Cross-Modal Supervision for Subword-Guided Reactant Entity Classification.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

2021
MATCH: Metadata-Aware Text Classification in A Large Hierarchy.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Hierarchical Metadata-Aware Document Categorization under Weak Supervision.
Proceedings of the WSDM '21, 2021


On the Power of Pre-Trained Text Representations: Models and Applications in Text Mining.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Partially-Typed NER Datasets Integration: Connecting Practice to Theory.
CoRR, 2020

Heterogeneous Network Representation Learning: Survey, Benchmark, Evaluation, and Beyond.
CoRR, 2020

Multiscale online media simulation with SocialCube.
Comput. Math. Organ. Theory, 2020

Discriminative Topic Mining via Category-Name Guided Text Embedding.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Minimally Supervised Categorization of Text with Metadata.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Textual Evidence Mining via Spherical Heterogeneous Information Network Embedding.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Pattern-enhanced Named Entity Recognition with Distant Supervision.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019
Cross-type biomedical named entity recognition with deep multi-task learning.
Bioinform., 2019

Integrating Local Context and Global Cohesiveness for Open Information Extraction.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

A Historical Big Data Analysis to Understand the Social Construction of Juvenile Delinquency in the United States.
Proceedings of the 15th International Conference on eScience, 2019

Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Diversifying seeds and audience in social influence maximization.
Proceedings of the ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining, 2019

2018
Integrating Local Context and Global Cohesiveness for Open Information Extraction.
CoRR, 2018

Open Information Extraction with Global Structure Constraints.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Pattern Discovery for Wide-Window Open Information Extraction in Biomedical Literature.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

PENNER: Pattern-enhanced Nested Named Entity Recognition in Biomedical Literature.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Open Information Extraction with Meta-pattern Discovery in Biomedical Literature.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

2017
Overcoming Limited Supervision in Relation Extraction: A Pattern-enhanced Distributional Representation Approach.
CoRR, 2017

Top-K Influential Nodes in Social Networks: A Game Perspective.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2009
Self-Learning Disk Scheduling.
IEEE Trans. Knowl. Data Eng., 2009

An Agent-based Simulation Study for Exploring Organizational Adaptation.
Simul., 2009

Collaborative attacks in WiMAX networks.
Secur. Commun. Networks, 2009

The Effects of Threading, Infection Time, and Multiple-Attacker Collaboration on Malware Propagation.
Proceedings of the 28th IEEE Symposium on Reliable Distributed Systems (SRDS 2009), 2009

Addressing Collaborative Attacks and Defense in Ad Hoc Wireless Networks.
Proceedings of the 29th IEEE International Conference on Distributed Computing Systems Workshops (ICDCS 2009 Workshops), 2009

2008
Multi-hop Cross-Layer Design in Wireless Sensor Networks: A Case Study.
Proceedings of the IEEE International Conference on Wireless and Mobile Computing, 2008

2007
Indexing Noncrashing Failures: A Dynamic Program Slicing-Based Approach.
Proceedings of the 23rd IEEE International Conference on Software Maintenance (ICSM 2007), 2007


  Loading...