Peitian Zhang

Orcid: 0009-0007-1926-7433

According to our database1, Peitian Zhang authored at least 25 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding.
CoRR, 2024

MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery.
CoRR, 2024

Are Long-LLMs A Necessity For Long-Context Tasks?
CoRR, 2024

Extending Llama-3's Context Ten-Fold Overnight.
CoRR, 2024

From Matching to Generation: A Survey on Generative Information Retrieval.
CoRR, 2024

Extensible Embedding: A Flexible Multipler For LLM's Context Length.
CoRR, 2024

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation.
CoRR, 2024

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization.
CoRR, 2024

Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon.
CoRR, 2024

Generative Retrieval via Term Set Generation.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

C-Pack: Packed Resources For General Chinese Embeddings.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

A Multi-Task Embedder For Retrieval Augmented LLMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LM-Cocktail: Resilient Tuning of Language Models via Model Merging.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

An Element is Worth a Thousand Words: Enhancing Legal Case Retrieval by Incorporating Legal Elements.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
LM-Cocktail: Resilient Tuning of Language Models via Model Merging.
CoRR, 2023

Retrieve Anything To Augment Large Language Models.
CoRR, 2023

C-Pack: Packaged Resources To Advance General Chinese Embedding.
CoRR, 2023

Term-Sets Can Be Strong Document Identifiers For Auto-Regressive Search Engines.
CoRR, 2023

Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Bi-Phase Enhanced IVFPQ for Time-Efficient Ad-hoc Retrieval.
CoRR, 2022

Ultron: An Ultimate Retriever on Corpus with a Model-based Indexer.
CoRR, 2022

GateFormer: Speeding Up News Feed Recommendation with Input Gated Transformers.
CoRR, 2022

2021
Learning to Select Historical News Articles for Interaction based Neural News Recommendation.
CoRR, 2021


  Loading...