2025
DS-VTON: High-Quality Virtual Try-on via Disentangled Dual-Scale Generation.
CoRR, June, 2025
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM.
CoRR, May, 2025
Robustness in AI-Generated Detection: Enhancing Resistance to Adversarial Attacks.
CoRR, May, 2025
Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting.
CoRR, May, 2025
COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization.
CoRR, April, 2025
Towards Explainable Fake Image Detection with Multi-Modal Large Language Models.
CoRR, April, 2025
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, April, 2025
InsightVision: A Comprehensive, Multi-Level Chinese-based Benchmark for Evaluating Implicit Visual Semantics in Large Vision Language Models.
CoRR, February, 2025
Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models.
CoRR, January, 2025
Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Aligning Retrieval with Reader Needs: Reader-Centered Passage Selection for Open-Domain Question Answering.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
WildFake: A Large-Scale and Hierarchical Dataset for AI-Generated Images Detection.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Maintain Plasticity in Long-timescale Continual Test-time Adaptation.
CoRR, 2024
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training.
CoRR, 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Conditional Prototype Rectification Prompt Learning.
CoRR, 2024
DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Chain-of-Rewrite: Aligning Question and Documents for Open-Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
COIN-Matting: Confounder Intervention for Image Matting.
Proceedings of the Computer Vision - ECCV 2024, 2024
ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion.
Proceedings of the Computer Vision - ECCV 2024, 2024
Supervised Contrastive Learning for Snapshot Spectral Imaging Face Anti-Spoofing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Probe Then Retrieve and Reason: Distilling Probing and Reasoning Capabilities into Smaller Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label Classification.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Debiasing In-Context Learning by Instructing LLMs How to Follow Demonstrations.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Boosting Audio-visual Zero-shot Learning with Large Language Models.
CoRR, 2023
ControlCom: Controllable Image Composition using Diffusion Model.
CoRR, 2023
DiffUTE: Universal Text Editing Diffusion Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
LayoutGCN: A Lightweight Architecture for Visually Rich Document Understanding.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Acoustics-Text Dual-Modal Joint Representation Learning for Cover Song Identification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Ant Multilingual Recognition System for OLR 2021 Challenge.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
TransAdv: A Translation-based Adversarial Learning Framework for Zero-Resource Cross-Lingual Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
A Multi-Task Dual-Tree Network for Aspect Sentiment Triplet Extraction.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
AntVoice Neural Speaker Embedding System for FFSVC 2020.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2016
Semantic Documents Relatedness using Concept Graph Representation.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016
2013
Automatic Information Extraction for Computerized Clinical Guideline.
Proceedings of the MEDINFO 2013, 2013
2012
CliniQA : Highly Reliable Clinical Question Answering System.
Proceedings of the Quality of Life through Quality of Information, 2012
2011
Domain customization for aspect-oriented opinion analysis with multi-level latent sentiment clues.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011
2010
EagleEye: Entity-centric business intelligence for smarter decisions.
IBM J. Res. Dev., 2010
CasJoin: a cascade chain for text similarity joins.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
OpinionIt: a text mining system for cross-lingual opinion analysis.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
2009
Domain Adaptation with Latent Semantic Association for Named Entity Recognition.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009
Address standardization with latent semantic association.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009
Product feature categorization with multilevel latent semantic association.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009
2006
Dependency Parsing Based on Dynamic Local Optimization.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006