2024
Exploring Human-Like Translation Strategy with Large Language Models.
Trans. Assoc. Comput. Linguistics, 2024
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding.
CoRR, 2024
ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation.
CoRR, 2024
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding.
CoRR, 2024
Evaluating Knowledge-based Cross-lingual Inconsistency in Large Language Models.
CoRR, 2024
Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning.
CoRR, 2024
Multiple-Choice Questions are Efficient and Robust LLM Evaluators.
CoRR, 2024
Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents.
CoRR, 2024
Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality.
CoRR, 2024
Final Submission of SJTULoveFiction to Literary Task.
Proceedings of the Ninth Conference on Machine Translation, 2024
SJTU System Description for the WMT24 Low-Resource Languages of Spain Task.
Proceedings of the Ninth Conference on Machine Translation, 2024
Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Improving Open-Ended Text Generation via Adaptive Decoding.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
MELA: Multilingual Evaluation of Linguistic Acceptability.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Unsupervised Sign Language Translation and Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Universal Multimodal Representation for Language Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models.
CoRR, 2023
A Survey on Language Models for Code.
CoRR, 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models.
CoRR, 2023
Meta-Reasoning: Semantics-Symbol Deconstruction For Large Language Models.
CoRR, 2023
Revisiting Acceptability Judgements.
CoRR, 2023
Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer.
CoRR, 2023
Imitation Attacks Can Steal More Than You Think from Machine Translation Systems.
Proceedings of the Natural Language Processing and Chinese Computing, 2023
Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Rethinking Word-Level Auto-Completion in Computer-Aided Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Rethinking Translation Memory Augmented Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
TeCS: A Dataset and Benchmark for Tense Consistency of Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
2022
Integrating Prior Translation Knowledge Into Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Tri-training for Dependency Parsing Domain Adaptation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022
SG-Net: Syntax Guided Transformer for Language Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Text Compression-Aided Transformer Encoding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
The AISP-SJTU Translation System for WMT 2022.
Proceedings of the Seventh Conference on Machine Translation, 2022
Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022
The AISP-SJTU Simultaneous Translation System for IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Effective Graph Context Representation for Document-level Machine Translation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Synchronous Refinement for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Detecting Source Contextual Barriers for Understanding Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Modeling Future Cost for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Unsupervised Neural Machine Translation for Similar and Distant Language Pairs: An Empirical Study.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021
Context-aware positional representation for self-attention networks.
Neurocomputing, 2021
Document-Level Neural Machine Translation with Associated Memory Network.
IEICE Trans. Inf. Syst., 2021
Cross-lingual Transferring of Pre-trained Contextualized Language Models.
CoRR, 2021
To Understand Representation of Layer-aware Sequence Encoders as Multi-order-graph.
CoRR, 2021
Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Advances and Challenges in Unsupervised Neural Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts, 2021
2020
A Novel Sentence-Level Agreement Architecture for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Unsupervised Neural Machine Translation With Cross-Lingual Language Representation Agreement.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Memory Network for Linguistic Structure Parsing.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Towards More Diverse Input Representation for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
A Survey of Domain Adaptation for Machine Translation.
J. Inf. Process., 2020
Neural Machine Translation with Target-Attention Model.
IEICE Trans. Inf. Syst., 2020
Accurate Word Representations with Universal Visual Guidance.
CoRR, 2020
Graph-to-Sequence Neural Machine Translation.
CoRR, 2020
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond.
CoRR, 2020
Syntax-aware Data Augmentation for Neural Machine Translation.
CoRR, 2020
Explicit Reordering for Neural Machine Translation.
CoRR, 2020
Robust Unsupervised Neural Machine Translation with Adversarial Training.
CoRR, 2020
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task.
Proceedings of the Fifth Conference on Machine Translation, 2020
Data-dependent Gaussian Prior Objective for Language Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020
Neural Machine Translation with Universal Visual Representation.
Proceedings of the 8th International Conference on Learning Representations, 2020
Reference Language based Unsupervised Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
High-order Semantic Role Labeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Regularized Context Gates on Transformer for Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Content Word Aware Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Explicit Sentence Compression for Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
SG-Net: Syntax-Guided Machine Reading Comprehension.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Neural Machine Translation With Sentence-Level Topic Context.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Probing Contextualized Sentence Representations with Visual Awareness.
CoRR, 2019
Dependency and Span, Cross-Style Semantic Role Labeling on PropBank and NomBank.
CoRR, 2019
Document-level Neural Machine Translation with Inter-Sentence Attention.
CoRR, 2019
An Empirical Study of Domain Adaptation for Unsupervised Neural Machine Translation.
CoRR, 2019
Dual Skew Divergence Loss for Neural Machine Translation.
CoRR, 2019
NICT's Unsupervised Neural and Statistical Machine Translation Systems for the WMT19 News Translation Task.
Proceedings of the Fourth Conference on Machine Translation, 2019
NICT's Supervised Neural Machine Translation Systems for the WMT19 News Translation Task.
Proceedings of the Fourth Conference on Machine Translation, 2019
Cross-Domain Transfer Learning for Dependency Parsing.
Proceedings of the Natural Language Processing and Chinese Computing, 2019
Syntax-aware Transformer Encoder for Neural Machine Translation.
Proceedings of the International Conference on Asian Language Processing, 2019
Recurrent Positional Embedding for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
SJTU-NICT at MRP 2019: Multi-Task Learning for End-to-End Uniform Semantic Graph Parsing.
Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning, 2019
NICT's Machine Translation Systems for CCMT-2019 Translation Task.
Proceedings of the Machine Translation - 15th China Conference, 2019
English-Myanmar Supervised and Unsupervised NMT: NICT's Machine Translation Systems at WAT-2019.
Proceedings of the 6th Workshop on Asian Translation, 2019
Sentence-Level Agreement for Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Lattice-Based Transformer Encoder for Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Neural Machine Translation with Reordering Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Sentence Selection and Weighting for Neural Machine Translation Domain Adaptation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
A Neural Approach to Source Dependence Based Context Model for Statistical Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Graph-Based Bilingual Word Embedding for Statistical Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2018
NICT's Corpus Filtering Systems for the WMT18 Parallel Corpus Filtering Task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018
NICT's Neural and Statistical Machine Translation Systems for the WMT18 News Translation Task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018
Exploring Recombination for Efficient Decoding of Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
A Survey of Domain Adaptation for Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
English-Myanmar NMT and SMT with Pre-ordering: NICT's Machine Translation Systems at WAT-2018.
Proceedings of the 32nd Pacific Asia Conference on Language, 2018
Dynamic Sentence Sampling for Efficient Training of Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Syntax-Directed Attention for Neural Machine Translation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Retinal optic disc localization using convergence tracking of blood vessels.
Multim. Tools Appl., 2017
Context-Aware Smoothing for Neural Machine Translation.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Instance Weighting for Neural Machine Translation Domain Adaptation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Neural Machine Translation with Source Dependency Representation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Sentence Embedding for Neural Machine Translation Domain Adaptation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Converting Continuous-Space Language Models into <i>N</i>-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016
A Novel Bilingual Word Embedding Method for Lexical Translation Using Bilingual Sense Clique.
CoRR, 2016
Real-time video stylization using spatial-temporal gabor filtering.
Proceedings of the 15th ACM SIGGRAPH Conference on Virtual-Reality Continuum and Its Applications in Industry, 2016
A Bilingual Graph-Based Semantic Model for Statistical Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Connecting Phrase based Statistical Machine Translation Adaptation.
Proceedings of the COLING 2016, 2016
2015
Bilingual Continuous-Space Language Model Growing for Statistical Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
English to Chinese Translation: How Chinese Character Matters.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015
A Machine Learning Method to Distinguish Machine Translation from Human Translation.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015
Neural Network Language Model for Chinese Pinyin Input Method Engine.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015
A novel word reordering method for statistical machine translation.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015
2014
Neural Network Based Bilingual Language Model Growing for Statistical Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
2013
Converting Continuous-Space Language Models into N-Gram Language Models for Statistical Machine Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013