Junyang Lin

Orcid: 0000-0001-9931-383X

According to our database1, Junyang Lin authored at least 66 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation.
CoRR, 2024

Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference.
CoRR, 2024

Analyzing and Mitigating Inconsistency in Discrete Audio Tokens for Neural Codec Language Models.
CoRR, 2024

SongTrans: An unified song transcription and alignment method for lyrics and notes.
CoRR, 2024

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution.
CoRR, 2024

Qwen2.5-Coder Technical Report.
CoRR, 2024

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement.
CoRR, 2024

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents.
CoRR, 2024

Qwen2-Audio Technical Report.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback.
CoRR, 2024

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Synthesizing Text-to-SQL Data from Weak and Strong LLMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning.
CoRR, 2023

Qwen Technical Report.
CoRR, 2023

TouchStone: Evaluating Vision-Language Models by Language Models.
CoRR, 2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities.
CoRR, 2023

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts.
CoRR, 2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities.
CoRR, 2023

Prompt Tuning for Unified Multimodal Pretrained Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Transferring General Multimodal Pretrained Models to Text Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models.
CoRR, 2022

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese.
CoRR, 2022

Prompt Tuning for Generative Multimodal Pretrained Models.
CoRR, 2022

Instance-wise Prompt Tuning for Pretrained Language Models.
CoRR, 2022

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework.
CoRR, 2022

BaGuaLu: targeting brain scale pretrained models with over 37 million cores.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022

OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework.
Proceedings of the International Conference on Machine Learning, 2022

Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably).
Proceedings of the International Conference on Machine Learning, 2022

2021
M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining.
CoRR, 2021

Exploring Sparse Expert Models and Beyond.
CoRR, 2021

M6: A Chinese Multimodal Pretrainer.
CoRR, 2021

CogView: Mastering Text-to-Image Generation via Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

M6: Multi-Modality-to-Multi-Modality Multitask Mega-transformer for Unified Pretraining.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

KNAS: Green Neural Architecture Search.
Proceedings of the 38th International Conference on Machine Learning, 2021

Connecting Language and Vision for Natural Language-Based Vehicle Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Learning Relation Alignment for Calibrated Cross-modal Retrieval.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Graph-based Multi-hop Reasoning for Long Text Generation.
CoRR, 2020

InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining.
CoRR, 2020

A Gesture Air-Writing Tracking Method that Uses 24 GHz SIMO Radar SoC.
IEEE Access, 2020

2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection.
CoRR, 2019

Understanding and Improving Layer Normalization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Knowledge-Based Personalized Product Description Generation in E-commerce.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Specificity-Driven Cascading Approach for Unsupervised Sentiment Modification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Towards Knowledge-Based Recommender Dialog System.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Imitation Learning for Non-Autoregressive Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
A Deep Reinforced Sequence-to-Set Model for Multi-Label Text Classification.
CoRR, 2018

Future-Prediction-Based Model for Neural Machine Translation.
CoRR, 2018

Decoding-History-Based Adaptive Control of Attention for Neural Machine Translation.
CoRR, 2018

DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text.
CoRR, 2018

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

3D Articulated Model Retrieval Using Depth Image Input.
Proceedings of the Computer Vision, Imaging and Computer Graphics Theory and Applications, 2018

Retrieving 3D Objects with Articulated Limbs by Depth Image Input.
Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018), 2018

Diversity-Promoting GAN: A Cross-Entropy Based Generative Adversarial Network for Diversified Text Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Deconvolution-Based Global Decoding for Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Bag-of-Words as Target for Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Global Encoding for Abstractive Summarization.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018


  Loading...