Qin Jin
Orcid: 0000-0001-6486-6020
According to our database1,
Qin Jin
authored at least 227 papers
between 1998 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE Trans. Multim., 2024
CoRR, 2024
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding.
CoRR, 2024
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation.
CoRR, 2024
CoRR, 2024
SingOMD: Singing Oriented Multi-resolution Discrete Representation Construction from Speech Models.
CoRR, 2024
EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions?
CoRR, 2024
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024
ECR-Chain: Advancing Generative Language Models to Better Emotion-Cause Reasoners through Reasoning Chains.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Mach. Intell. Res., April, 2023
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
CoRR, 2023
No-frills Temporal Video Grounding: Multi-Scale Neighboring Attention and Zoom-in Boundary Detection.
CoRR, 2023
CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge.
Proceedings of the ACM Web Conference 2023, 2023
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the Natural Language Processing and Chinese Computing, 2023
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-view World.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Phoneix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation With Phoneme Distribution Predictor.
Proceedings of the IEEE International Conference on Acoustics, 2023
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Token Mixing: Parameter-Efficient Transfer Learning from Image-Language to Video-Language.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Multim., 2022
CoRR, 2022
CoRR, 2022
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Memobert: Pre-Training Model with Prompt-Based Learning for Multimodal Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 55th Hawaii International Conference on System Sciences, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Valence and Arousal Estimation based on Multimodal Temporal-Aware Features for Videos in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CoRR, 2021
CoRR, 2021
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Conversation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning.
CoRR, 2020
YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos.
CoRR, 2020
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020
VideoIC: A Video Interactive Comments Dataset and Multimodal Multitask Learning for Comments Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Context-Aware Goodness of Pronunciation for Computer-Assisted Pronunciation Training.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019.
CoRR, 2019
CoRR, 2019
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Adversarial Domain Adaption for Multi-Cultural Dimensional Emotion Recognition in Dyadic Interactions.
Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
RUC at MediaEval 2019: Video Memorability Prediction Based on Visual Textual and Concept Related Features.
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018
Multimodal Dimensional and Continuous Emotion Recognition in Dyadic Video Interactions.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018
Multi-modal Multi-cultural Dimensional Continues Emotion Recognition in Dyadic Interactions.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018
RUC at MediaEval 2018: Visual and Textual Features Exploration for Predicting Media Memorability.
Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018
2017
Int. J. Inf. Decis. Sci., 2017
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017
2016
ACM Trans. Inf. Syst., 2016
The Study of the Entrepreneurial Leadership Style of Real Estate Industry in China: Based on the Content Analysis of Microblog.
Int. J. Knowl. Based Organ., 2016
Int. J. Inf. Syst. Supply Chain Manag., 2016
A hybrid approach based on stochastic competitive Hopfield neural network and efficient genetic algorithm for frequency assignment problem.
Appl. Soft Comput., 2016
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
RUC at MediaEval 2016 Emotional Impact of Movies Task: Fusion of Multimodal Features.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
2015
IEEE Trans. Multim., 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
RUCMM at MediaEval 2015 Affective Impact of Movies Task: Fusion of Audio and Visual Cues.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
RUC-Tencent at ImageCLEF 2015: Concept Detection, Localization and Sentence Generation.
Proceedings of the Working Notes of CLEF 2015, 2015
Improving emotion classification on Chinese microblog texts with auxiliary cross-domain data.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Future Gener. Comput. Syst., 2014
A guided Hopfield evolutionary algorithm with local search for maximum clique problem.
Proceedings of the 2014 IEEE International Conference on Systems, Man, and Cybernetics, 2014
Does product recommendation meet its waterloo in unexplored categories?: no, price comes to help.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014
Emotion Classification of Chinese Microblog Text via Fusion of BoW and eVector Feature Representations.
Proceedings of the Natural Language Processing and Chinese Computing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2014, 2014
Proceedings of the Working Notes for CLEF 2014 Conference, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the Working Notes for CLEF 2013 Conference , 2013
2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
Proceedings of the Multimodal Technologies for Perception of Humans, 2007
2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the Multimodal Technologies for Perception of Humans, 2006
2005
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005
2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
The SuperSID project: exploiting high-level information for high-accuracy speaker recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems@ACL 2002, 2002
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998