Ruihua Song

Orcid: 0000-0001-6036-9035

Affiliations:
  • Renmin University of China, Beijing, China
  • Microsoft Research, Bejing, China (former)


According to our database1, Ruihua Song authored at least 122 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Show Me a Video: A Large-Scale Narrated Video Dataset for Coherent Story Illustration.
IEEE Trans. Multim., 2024

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024

LoVA: Long-form Video-to-Audio Generation.
CoRR, 2024

Robust Audiovisual Speech Recognition Models with Mixture-of-Experts.
CoRR, 2024

Towards Effective and Efficient Continual Pre-training of Large Language Models.
CoRR, 2024

YuLan: An Open-source Large Language Model.
CoRR, 2024

Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion.
CoRR, 2024

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition.
CoRR, 2024

Understanding Human Preferences: Towards More Personalized Video to Text Generation.
Proceedings of the ACM on Web Conference 2024, 2024

What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

TiVA: Time-Aligned Video-to-Audio Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

See or Guess: Counterfactually Regularized Image Captioning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Intelligent Agents with LLM-based Process Automation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Persuading across Diverse Domains: a Dataset and Persuasion Large Language Model.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Intelligent Virtual Assistants with LLM-based Process Automation.
CoRR, 2023

What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning.
CoRR, 2023

Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions.
CoRR, 2023

ViCo: Engaging Video Comment Generation with Human Preference Rewards.
CoRR, 2023

Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots.
CoRR, 2023

RecAgent: A Novel Simulation Paradigm for Recommender Systems.
CoRR, 2023

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation.
CoRR, 2023

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat.
CoRR, 2023

Translating Text Synopses to Video Storyboards.
CoRR, 2023

Evaluation of Sustainable Design Method for Three-Lane Entrance Ramps on Expressways in Urban Areas: A Case Study of Xi'an, China.
IEEE Access, 2023

Expanding the Horizons: Exploring Further Steps in Open-Vocabulary Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Going Beyond Closed Sets: A Multimodal Perspective for Video Emotion Analysis.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

TeViS: Translating Text Synopses to Video Storyboards.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Joint Semantic and Strategy Matching for Persuasive Dialogue.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Leveraging Narrative to Generate Movie Script.
ACM Trans. Inf. Syst., 2022

Class-Aware Sounding Objects Localization via Audiovisual Correspondence.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment.
CoRR, 2022

Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Modal Experience Inspired AI Creation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Text2Poster: Laying Out Stylized Texts on Retrieved Images.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Multi-Modal Knowledge Graph for Classical Chinese Poetry.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model.
CoRR, 2021

Stylistic Retrieval-based Dialogue System with Unparallel Training Data.
CoRR, 2021

Pre-Trained Models: Past, Present and Future.
CoRR, 2021

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training.
CoRR, 2021

Pre-trained models: Past, present and future.
AI Open, 2021

WenLan: Efficient Large-Scale Multi-Modal Pre-Training on Real World Data.
Proceedings of the MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, 2021

2020
What If Bots Feel Moods?
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Knowledge Enhanced Opinion Generation from an Attitude.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

ScriptWriter: Narrative-Guided Script Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Personalized Reason Generation for Explainable Song Recommendation.
ACM Trans. Intell. Syst. Technol., 2019

Attitude Detection for One-Round Conversation: Jointly Extracting Target-Polarity Pairs.
J. Inf. Process., 2019

From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Evaluating Image-Inspired Poetry Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Neural Response Generation with Relevant Emotions for Short Text Conversation.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

"Love Is as Complex as Math": Metaphor Generation System for Social Chatbot.
Proceedings of the Chinese Lexical Semantics - 20th Workshop, 2019

2018
Personalized Web Search.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Image Inspired Poetry Generation in XiaoIce.
CoRR, 2018

Why You Should Listen to This Song: Reason Generation for Explainable Recommendation.
Proceedings of the 2018 IEEE International Conference on Data Mining Workshops, 2018

2017
Search by Screenshots for Universal Article Clipping in Mobile Apps.
ACM Trans. Inf. Syst., 2017

Microsoft Research Asia at the NTCIR-13 STC-2 Task.
Proceedings of the 13th NTCIR Conference, 2017

Understanding People Lifestyles: Construction of Urban Movement Knowledge Graph from GPS Trajectory.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

A World of Difference: Divergent Word Interpretations Among People.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

2016
Automatically Mining Facets for Queries from Their Search Results.
IEEE Trans. Knowl. Data Eng., 2016

Enhancing web search with queries of equivalent intents.
Inf. Retr. J., 2016

UniClip: Leveraging Web Search for Universal Clipping of Articles on Mobile.
Data Sci. Eng., 2016

Microsoft Research Asia at NTCIR-12 STC Task.
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

Mining Shopping Patterns for Divergent Urban Regions by Incorporating Mobility Data.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Mobile Query Recommendation via Tensor Function Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

2014
Preface.
Proceedings of the Sixth International Workshop on Evaluating Information Access, 2014

Overview of the NTCIR-11 IMine Task.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

2013
Mining subtopics from text fragments for a web query.
Inf. Retr., 2013

Diversified search evaluation: lessons from the NTCIR-9 INTENT task.
Inf. Retr., 2013

Summary of the NTCIR-10 INTENT-2 task: subtopic mining and search result diversification.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Preface.
Proceedings of the 5th International Workshop on Evaluating Information Access, 2013

Overview of the NTCIR-10 INTENT-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

2012
New assessment criteria for query suggestion.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Adaptive query suggestion for difficult queries.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

The Reusability of a Diversified Search Test Collection.
Proceedings of the Information Retrieval Technology, 2012

2011
Select-the-Best-Ones: A new way to judge relative relevance.
Inf. Process. Manag., 2011

Multi-dimensional search result diversification.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Evaluating diversified search results using per-intent graded relevance.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Overview of the NTCIR-9 INTENT Task.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Microsoft Research Asia at the NTCIR-9 Intent Task.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Finding dimensions for queries.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Learning Query Ambiguity Models by Using Search Logs.
J. Comput. Sci. Technol., 2010

Constructing a Test Collection with Multi-Intent Queries.
Proceedings of the 3rd International Workshop on Evaluating Information Access, 2010

Overview of NTCIR-8 ACLIA IR4QA.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

Simple Evaluation Metrics for Diversified Search Results.
Proceedings of the 3rd International Workshop on Evaluating Information Access, 2010

Overview of the NTCIR-8 ACLIA Tasks: Advanced Cross-Lingual Information Access.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

2009
Personalized Web Search.
Proceedings of the Encyclopedia of Database Systems, 2009

Evaluating the Effectiveness of Personalized Web Search.
IEEE Trans. Knowl. Data Eng., 2009

Identification of ambiguous queries in web search.
Inf. Process. Manag., 2009

Ranking the NTCIR ACLIA IR4QA Systems without Relevance Assessments.
Inf. Media Technol., 2009

Microsoft Research Asia at the Web Track of TREC 2009.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

Using anchor texts with their hyperlink structure for web search.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Efficient record-level wrapper induction.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Clustering queries for better document ranking.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
NTCIR-7 ACLIA IR4QA Results based on Qrels Version 2.
Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2008

Overview of the NTCIR-7 ACLIA Tasks: Advanced Cross-Lingual Information Access.
Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2008

Pictor: an interactive system for importing data from a website.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Viewing Term Proximity from a Different Perspective.
Proceedings of the Advances in Information Retrieval , 2008

Are click-through data adequate for learning web search rankings?
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
Web page title extraction and its application.
Inf. Process. Manag., 2007

Identifying ambiguous queries in web search.
Proceedings of the 16th International Conference on World Wide Web, 2007

A large-scale evaluation and analysis of personalized search strategies.
Proceedings of the 16th International Conference on World Wide Web, 2007

Joint optimization of wrapper generation and template detection.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Template-Independent News Extraction Based on Visual Consistency.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Exploring URL Hit Priors for Web Search.
Proceedings of the Advances in Information Retrieval, 2006

2005
Gravitation-based model for information retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Title extraction from bodies of HTML documents and its application to web page retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model.
Proceedings of the 3rd IEEE International Conference on Pervasive Computing and Communications (PerCom 2005), 2005

2004
Learning important models for web page blocks based on layout and content analysis.
SIGKDD Explor., 2004

Learning block importance models for web pages.
Proceedings of the 13th international conference on World Wide Web, 2004

Microsoft Research Asia at Web Track and Terabyte Track of TREC 2004.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

A Query-Dependent Duplicate Detection Approach for Large Scale Search Engines.
Proceedings of the Advanced Web Technologies and Applications, 2004

2003
DF or IDF? On the Use of HTML Primary Feature Fields for Web IR.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

Microsoft Research Asia at the Web Track of TREC 2003.
Proceedings of The Twelfth Text REtrieval Conference, 2003

2002
THU TREC 2002: Novelty Track Experiments.
Proceedings of The Eleventh Text REtrieval Conference, 2002


  Loading...