Zhicheng Dou

窦志成的中文主页

Lab Members

Lab Github

My Flyer

Short Bio

Zhicheng Dou is currently a professor at Renmin University of China and vice dean for Gaoling School of Artificial Intelligence. He received his Ph.D. and B.S. degrees in computer science and technology from the Nankai University in 2008 and 2003, respectively. He worked at Microsoft Research Asia from July 2008 to September 2014. And since 2014, he has been a faculty member at Renmin University of China. His research interests are Agent, Information Retrieval, and Large Language Models. Recently, he is especially interested in information agents, like deep search and deep research agents. He received the Paper Award Nominations of WWW 2023 (spotlight), the Best Paper Runner-Up Award from SIGIR 2013, and the Best Paper Award from AIRS 2012. He served as the program co-chair of SIGIR 2019 (short), AIRS 2017, and NTCIR-16/17. Zhicheng Dou is not a pure research guy - besides writing papers, he also enjoys writing codes to convert cool ideas into real systems.

Contacts: Zhicheng Dou's email address

I am recruiting highly-motivated students. Please drop an email to dou at ruc.edu.cn if you want to join my team.

Research Interests

Agent/智能体 Agent Memory/智能体记忆 Deep Search/深度搜索 Deep Research/深度研究 Retrieval Augmented Generation (RAG)/大模型检索增强生成 Multi-modal/多模态 Generative Retrieval/生成式检索 Large Language Model + IR/大模型与检索融合 Conversational Search/对话式搜索 Large Language Model/大语言模型 Pretraining Models/预训练大模型 Legal AI/司法智能 Personalized Search, Recommendation, and Dialogue/个性化 Diversified Search/多样化搜索 AI Governance/人工智能治理 More...

Academic Homepages

Publications

Publication Filtering:

*: Corresponding author; ____ indicates the author is/was my student/Postdoc when the work was done;

2026

Preprints

Jincheng Feng, Wenhan Liu, Zhicheng Dou. SumRank: Aligning Summarization Models for Long-Document Listwise Reranking. arXiv
Haobo Zhang, Yutao Zhu, Kelong Mao, Tianhao Li, Zhicheng Dou. RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation. arXiv
Yiruo Cheng, Kelong Mao, Tianhao Li, Jiejun Tan, Ji-Rong Wen, Zhicheng Dou. ChatShopBuddy: Towards Reliable Conversational Shopping Agents via Reinforcement Learning arXiv
Jiejun Tan, Zhicheng Dou, Liancheng Zhang, Yuyang Hu, Yiruo Cheng, Ji-Rong Wen. MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning. arXiv
Zhao Wang, Ziliang Zhao, and Zhicheng Dou. 2026. ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation. arXiv
Yutao Zhu, Xingshuo Zhang, Maosen Zhang, Jiajie Jin, Liancheng Zhang, Xiaoshuai Song, Kangzhi Zhao, Wencong Zeng, Ruiming Tang, Han Li, Ji-Rong Wen, Zhicheng Dou. GISA: A Benchmark for General Information-Seeking Assistant. (arXiv | Github )
Xiaoxi Li, Wenxiang Jiao, Jiarui Jin, Shijian Wang, Guanting Dong, Jiajie Jin, Hao Wang, Yinuo Wang, Ji-Rong Wen, Yuan Lu, Zhicheng Dou. OmniGAIA: Towards Native Omni-Modal AI Agents. (arXiv | Github )
Xinyu Yang, Chenlong Deng, Tongyu Wen, Binyu Xie, Zhicheng Dou. LawThinker: A Deep Research Legal Agent in Dynamic Environments. (arXiv | Github )
Shuting Wang, Qiaolin Xia, Vich Wang, Herberttli, Bobsimons, and Zhicheng Dou. 2025. Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register. arXiv
Yuyang Hu, Shichun Liu, Yanwei Yue, Guibin Zhang, Boyang Liu, Fangyi Zhu, Jiahang Lin, Honglin Guo, Shihan Dou, Zhiheng Xi, Senjie Jin, Jiejun Tan, Yanbin Yin, Jiongnan Liu, Zeyu Zhang, Zhongxiang Sun, Yutao Zhu, Hao Sun, Boci Peng, Zhenrong Cheng, Xuanbo Fan, Jiaxin Guo, Xinlei Yu, Zhenhong Zhou, Zewen Hu, Jiahao Huo, Junhao Wang, Yuwei Niu, Yu Wang, Zhenfei Yin, Xiaobin Hu, Yue Liao, Qiankun Li, Kun Wang, Wangchunshu Zhou, Yixin Liu, Dawei Cheng, Qi Zhang, Tao Gui, Shirui Pan, Yan Zhang, Philip Torr, Zhicheng Dou, Ji-Rong Wen, Xuanjing Huang, Yu-Gang Jiang, and Shuicheng Yan. 2026. Memory in the Age of AI Agents. arXiv
Yifei Chen, Guanting Dong, Yutao Zhu, and Zhicheng Dou. 2025. Revisiting RAG Ensemble: A Theoretical and Mechanistic Analysis of Multi-RAG System Collaboration. arXiv
Xinyu Zhang, Yuanquan Hu, Fangchao Liu, and Zhicheng Dou. 2025. P3: Prompts Promote Prompting. arXiv
Huaying Yuan, Zheng Liu, Junjie Zhou, Hongjin Qian, Yan Shu, Nicu Sebe, Ji-Rong Wen, and Zhicheng Dou. 2025. VideoExplorer: Think With Videos For Agentic Long-Video Understanding. arXiv

July

Chenlong Deng, Mengjie Deng, Junjie Wu, Dun Zeng, Teng Wang, Qingsong Xie, Jiadeng Huang, Shengjie Ma, Changwang Zhang, Zhaoxiang Wang, Jun Wang, Yutao Zhu, and Zhicheng Dou. DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories. In Proceedings of the Forty-third International Conference on Machine Learning. (ICML 2026) (CCF A). ( arXiv | Github )
Jing Yao, Xiaoyuan Yi, Jindong Wang, Zhicheng Dou, and Xing Xie. CAReDiO: Enhancing Cultural Alignment of LLM via Representativeness and Distinctiveness Guided Data Optimization In Proceedings of the Forty-third International Conference on Machine Learning. (ICML 2026) (CCF A). arXiv
Liancheng Zhang, Xiaoxi Li, and Zhicheng Dou. TimelineReasoner: Advancing Timeline Summarization with Large Reasoning Models. In Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2026) (CCF A).
Tongyu Wen, Guanting Dong, and Zhicheng Dou. SmartSearch: Process Reward-Guided Query Refinement for Search Agents. In Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2026) (CCF A). arXiv
Chenghao Zhang, Guanting Dong, Xinyu Yang, and Zhicheng Dou. Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation. In Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2026) (CCF A). arXiv
Jiajie Jin, Yanzhao Zhang, Mingxin Li, Dingkun Long, Pengjun Xie, Yutao Zhu, and Zhicheng Dou. Internalizing Explicit Reasoning into Latent Space for Dense Retrieval. In Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2026) (CCF A). arXiv
Jiajie Jin, Xiaoxi Li, Yuyao Zhang, Guanting Dong, Zhao Yang, Yutao Zhu, and Zhicheng Dou. HiRA: Decoupling Planning and Execution with Hierarchical Reasoning in Deep Search. In Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2026) (CCF A). ( arXiv | Github)
Guanting Dong, Yifei Chen, Xiaoxi Li, Jiajie Jin, Hongjin Qian, Yutao Zhu, Hangyu Mao, Guorui Zhou, Zhicheng Dou, and Ji-Rong Wen. Tool-Star: Empowering Multi-Tool Collaborative Web Agent via Reinforcement Learning. In Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR 2026) (CCF A). (arXiv | Github) .
Haonan Chen, Hong Liu, Yuping Luo, Liang Wang, Nan Yang, Furu Wei, Zhicheng Dou. MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). arXiv
Mengjie Deng, Guanting Dong, Zhicheng Dou. ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) (CCF A). arXiv
Jiajie Jin, Yuyao Zhang, Yimeng Xu, Hongjin Qian, Yutao Zhu, Zhicheng Dou. FinSight: Towards Real-World Financial Deep Research. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). (arXiv | Github)
Yifei Chen, Guanting Dong, Zhicheng Dou. ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). arXiv
Lei Xiong, Huaying Yuan, Zheng Liu, Zhao Cao, Zhicheng Dou. PaperScope: A Multi-Modal Multi-Document Benchmark for Agentic Deep Research Across Massive Scientific Papers. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) . arXiv
Xiaoshuai Song, Haofei Chang, Guanting Dong, Yutao Zhu, Zhicheng Dou, Ji-Rong Wen. EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) . arXiv
Xinyu Yang, Chenlong Deng, Zhicheng Dou. GLARE: Agentic Reasoning for Legal Judgment Prediction. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). arXiv
Tong Zhao, Yutao Zhu, Yucheng Tian, Zhicheng Dou. R^3AG: Retriever Routing for Retrieval-Augmented Generation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). arXiv
Wenhan Liu, Xinyu Ma, Weiwei Sun, Yutao Zhu, Yuchen Li, Dawei Yin, Zhicheng Dou. ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). arXiv
Tong Zhao, Chenghao Zhang, Yutao Zhu, Zhicheng Dou. ATIR: Towards Audio-Text Interleaved Contextual Retrieval. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A). arXiv
Wenhan Liu, Xinyu Ma, Yutao Zhu, Yuchen Li, Daiting Shi, Dawei Yin, Zhicheng Dou. Agentic-R: Learning to Retrieve for Agentic Search. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) . arXiv
Haonan Chen, Sicheng Gao, Radu Timofte, Tetsuya Sakai, Zhicheng Dou. e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) . arXiv
Zhao Wang, Max Xiong, Jianxun Lian, Zhicheng Dou. Reasoning-Aware AIGC Detection via Alignment and Reinforcement. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) . arXiv
Yuyang Hu, Jiongnan Liu, Jiejun Tan, Yutao Zhu, Zhicheng Dou. Memory Matters More: Event-Centric Memory as a Logic Map for Agent Searching and Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) . arXiv
Zhaoheng Huang, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou. LLM-Generated Text May Harm Your Retrieval! A Robust Detection Strategy for Retrieval-Augmented Generation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A).
Zhaoheng Huang, Dacheng Wen, Yutao Zhu, Xiaoying Lian, Yushi Liang, Zhicheng Dou, Ji-Rong Wen, Liangjie Zhang, Qi Zhang, Kai Hao, Nan Li, Fangzhao Wu. RLSeek: Evidence-Grounded Reasoning for RAG Hallucination Detection. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics. (ACL 2026) (CCF A).
Yuyao Zhang, Hongyu Lu, Jiajie Jin, Hongjin Qian, Shiyu Li, Zhao Yang, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou. Web Sitemap Knowledge Can Enhance Autonomous Browsing. In Findings of the Association for Computational Linguistics: ACL 2026. (ACL 2026 Findings) .

April

Yutong Bai, Yujia Zhou, Zhicheng Dou and Ji-Rong Wen. Hierarchical Document-Aware Interest Profiling in Personalized Search. IEEE Transactions on Knowledge & Data Engineering, 38(4): 2289-2300 (2026). Apr. 2026. (TKDE 2026) (CCF A). (Download | DOI )
Guanting Dong, Hangyu Mao, Kai Ma, Licheng Bao, Yifei Chen, Zhongyuan Wang, Zhongxia Chen, Jiazhen Du, Huiyang Wang, Fuzheng Zhang, Guorui Zhou, Yutao Zhu, Ji-Rong Wen, and Zhicheng Dou. 2026. Agentic Reinforced Policy Optimization. In Proceedings of ICLR 2026. (ICLR 2026) (CCF A). (Download | arXiv | Github ) Daily Paper #1 Weekly Paper #1
Yifei Chen, Guanting Dong, and Zhicheng Dou. 2026. Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning. In Proceedings of ICLR 2026. (ICLR 2026) (CCF A). (Download | arXiv| OpenReview | Github)
Jing Yao, Shitong Duan, Xiaoyuan Yi, Dongkuan Xu, Peng Zhang, Tun Lu, Ning Gu, Zhicheng Dou, and Xing Xie. 2026. AdAEM: An Adaptively and Automated Extensible Evaluation Method of LLMs' Value Difference. In Proceedings of ICLR 2026. (ICLR 2026) (CCF A). (Download | arXiv OpenReview)
Xiaoxi Li, Wenxiang Jiao, Jiarui Jin, Guanting Dong, Jiajie Jin, Yinuo Wang, Hao Wang, Yutao Zhu, Ji-Rong Wen, Yuan Lu, and Zhicheng Dou. 2026. DeepAgent: A General Reasoning Agent with Scalable Toolsets. In Proceedings of the ACM Web Conference 2026 (WWW ’26), April 13–17, 2026, Dubai, United Arab Emirates. ACM, New York, NY, USA, 12 pages. (WWW 2026) (CCF A). (Download | DOI | arXiv | Github) Daily Paper #1
Guanting Dong, Licheng Bao, Zhongyuan Wang, Kangzhi Zhao, Xiaoxi Li, Jiajie Jin, Jinghan Yang, Hangyu Mao, Fuzheng Zhang, Kun Gai, Guorui Zhou, Yutao Zhu, Ji-Rong Wen, and Zhicheng Dou. 2026. Toward Generalized Web Agent Training: A Deep Dive into Entropy-Balanced Reinforcement Learning. In Proceedings of the ACM Web Conference 2026 (WWW ’26), April 13–17, 2026, Dubai, United Arab Emirates. ACM, New York, NY, USA, 12 pages. (WWW 2026) (CCF A). ( Download | DOI | arXiv | Github) Daily Paper #3

March

Wenhan Liu, Yutao Zhu, Zhicheng Dou*, and Yujia Zhou. 2026. DemoRank: Selecting Effective Demonstrations for Large Language Models in Ranking Task. ACM Trans. Inf. Syst. 44, 3, Article 61 (March 2026), 25 pages. Published: 02 March 2026. (TOIS 2026) (CCF A) (Download | DOI )

January

Jiejun Tan, Zhicheng Dou, Yan Yu, Jiehan Cheng, Lifeng Liu, Jian Xie, and Jirong Wen. 2026. HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches. Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 40(23), 19380-19388. January 20–27, 2026, Singapore. https://doi.org/10.1609/aaai.v40i23.39015 (AAAI 2026) (CCF A) (Download | DOI )
Zhaoheng Huang, Yutao Zhu, Jirong Wen, Zhicheng Dou. 2026. Evaluating the Factuality of Large Language Models Using Multiple Plug-and-Play Fact Sources. Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 40(48), 41607-41609. January 20–27, 2026, Singapore. Published: 2026-03-14. (AAAI 2026 Demonstration) (Download | DOI )

2025

December

Xiaoxi Li, Jiajie Jin, Guanting Dong, Hongjin Qian, Yutao Zhu, Yongkang Wu, Ji-Rong Wen, and Zhicheng Dou. 2025. WebThinker: Empowering Large Reasoning Models with Deep Research Capability. In Advances in Neural Information Processing Systems (NeurIPS 2025). San Diego, CA. December 2nd – 7th, 2025. OpenReview. https://openreview.net/forum?id=7LKKHBAMzH. (NeurIPS 2025) (CCF A) (Download | OpenReview Pdf | OpenReview Forum | Github ) Daily Paper #1
Liang Wang, Haonan Chen, Nan Yang, Xiaolong Huang, Zhicheng Dou, and Furu Wei. 2025. Chain-of-Retrieval Augmented Generation. In Advances in Neural Information Processing Systems (The 39th Conference on Neural Information Processing System, NeurIPS 2025). San Diego, CA. December 2nd – 7th, 2025. OpenReview. https://openreview.net/forum?id=gUPGGCM4WH. (NeurIPS 2025) (CCF A) (Download | OpenReview Pdf | OpenReview Forum )
Chenlong Deng, Zhisong Zhang, Kelong Mao, Shuaiyi Li, Tianqing Fang, Hongming Zhang, Haitao Mi, Dong Yu, and Zhicheng Dou. UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression. In Advances in Neural Information Processing Systems (The 39th Conference on Neural Information Processing System, NeurIPS 2025). San Diego, CA. December 2nd – 7th, 2025. OpenReview. https://openreview.net/forum?id=1C4mXyh31p. (NeurIPS 2025) (CCF A) (Download | OpenReview Pdf | OpenReview Forum )
Huaying Yuan, Nijian, Zheng Liu, Yueze Wang, Junjie Zhou, Zhengyang Liang, Bo Zhao, Zhao Cao, Ji-Rong Wen, and Zhicheng Dou. 2025. MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval. In Advances in Neural Information Processing Systems (The 39th Conference on Neural Information Processing System, NeurIPS 2025). San Diego, CA. December 2nd – 7th, 2025. OpenReview.https://openreview.net/forum?id=gKkA9Oc13m. (NeurIPS 2025 DB). (Download | OpenReview Pdf | OpenReview Forum )
Hongjin Qian, Zheng Liu, Chao Gao, Yankai Wang, Defu Lian, Zhicheng Dou. 2025. HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks. In Advances in Neural Information Processing Systems (The 39th Conference on Neural Information Processing System, NeurIPS 2025). San Diego, CA. December 2nd – 7th, 2025. OpenReview.https://openreview.net/forum?id=VbTDOlnZ6m. (NeurIPS 2025 DB) (Download | OpenReview Pdf | OpenReview Forum )

November

Yutao Zhu, Huaying Yuan, Shuting Wang, Jiongnan Liu, Wenhan Liu, Chenlong Deng, Haonan Chen, Zheng Liu, Zhicheng Dou*, Ji-Rong Wen. 2025. Large Language Models for Information Retrieval: A Survey. ACM Trans. Inf. Syst. 44(1), Article 12 (January 2026), 54 pages. Published: 14 November 2025. (aXiv | DOI | Download | TOIS Copy | Github )
Zhaoheng Huang, Yutao Zhu*, Ji-Rong Wen, and Zhicheng Dou*. 2025. Enhancing LLM Text Detection with Retrieved Contexts and Logits Distribution Consistency. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 9933–9945. Suzhou, China. November 4–9, 2025. Association for Computational Linguistics. (EMNLP 2025) (CCF B) (Download | DOI )
Shuting Wang, Jiejun Tan, Zhicheng Dou*, and Ji-Rong Wen. 2025. OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 5737–5762, Suzhou, China. November 4–9, 2025. Association for Computational Linguistics. (EMNLP 2025) (CCF B) (Download | DOI )
Xiaoxi Li, Guanting Dong, Jiajie Jin, Yuyao Zhang, Yujia Zhou, Yutao Zhu, Peitian Zhang, and Zhicheng Dou*. 2025. Search-o1: Agentic Search-Enhanced Large Reasoning Models. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 4837–4856, Suzhou, China. November 4–9, 2025. Association for Computational Linguistics. (EMNLP 2025) (CCF B) (Download | DOI | Github )
Yutao Zhu, Jiajie Jin, Hongjin Qian, Zheng Liu, Zhicheng Dou* and Ji-Rong Wen. 2025. Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 4837–4856, Suzhou, China. November 4–9, 2025. Association for Computational Linguistics. (EMNLP 2025) (CCF B) (Download | DOI )
Tongyu Wen, Chenglong Wang, Xiyuan Yang, Haoyu Tang, Yueqi Xie, Lingjuan Lyu*, Zhicheng Dou*, Fangzhao Wu*. 2025. Defending against Indirect Prompt Injection by Instruction Detection. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 19472–19487, Suzhou, China. November 4–9, 2025. Association for Computational Linguistics. (EMNLP Findings 2025) (Download | DOI )
Wenhan Liu, Xinyu Ma, Yutao Zhu, Lixin Su, Shuaiqiang Wang, Dawei Yin and Zhicheng Dou*. 2025. CoRanking: Collaborative Ranking with Small and Large Ranking Agents. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 5098–5110, Suzhou, China. November 4–9, 2025. Association for Computational Linguistics. (EMNLP Findings 2025) (Download | DOI )
Zhirui Deng, Jingfen Qiao, Zhicheng Dou*, Ji-Rong Wen and Maarten de Rijke. 2025. DIVAgent: A Diversified Search Agent that Mimics the Human Search Process. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM '25). Association for Computing Machinery, New York, NY, USA, 564–574. Seoul Republic of Korea. November 10 - 14, 2025. (CIKM 2025) (CCF B) (Download | DOI )
Ziliang Zhao, Haonan Chen, Shiren Song, Jian Xie and Zhicheng Dou*. 2025. ClariLM: Enhancing Open-domain Clarification Ability for Large Language Models. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM '25). Association for Computing Machinery, New York, NY, USA, 4401 - 4411. Seoul Republic of Korea. November 10 - 14, 2025. (CIKM 2025) (CCF B) (Download | DOI )
Yiruo Cheng, Hongjin Qian, Fengran Mo, Yongkang Wu, Zhonghua Li, Qi Ye, Ji-Rong Wen and Zhicheng Dou*. 2025. Evolving Graph-Based Context Modeling for Multi-Turn Conversational Retrieval-Augmented Generation. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM '25). Association for Computing Machinery, New York, NY, USA, 436 - 447. Seoul Republic of Korea. November 10 - 14, 2025. (CIKM 2025) (CCF B) (Download | DOI )
Ziliang Zhao, Shiren Song and Zhicheng Dou*. 2025. FollowGPT: A Framework of Follow-up Question Generation for Large Language Models via Conversation Log Mining. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM '25). Association for Computing Machinery, New York, NY, USA, 4412 - 4422. Seoul Republic of Korea. November 10 - 14, 2025. (CIKM 2025) (CCF B) (Download | DOI )
Zhao Wang, Ziliang Zhao and Zhicheng Dou*. 2025. TimeRAG: Enhancing Complex Temporal Reasoning with Search Engine Augmentation. In Proceedings of the 34th ACM International Conference on Information and Knowledge Management (CIKM '25). Association for Computing Machinery, New York, NY, USA, 3230 - 3239. Seoul Republic of Korea. November 10 - 14, 2025. (CIKM 2025) (CCF B) (Download | DOI )

October

Johanne R. Trippas, J. Shane Culpepper, Mohammad Aliannejadi, James Allan, Enrique Amigó, Jaime Arguello, Leif Azzopardi, Peter Bailey, Jamie Callan, Rob Capra, Nick Craswell, Bruce Croft, Jeff Dalton, Gianluca Demartini, Laura Dietz, Zhicheng Dou, Carsten Eickhoff, Michael Ekstrand, Nicola Ferro, Norbert Fuhr, Dorota Glowacka, Faegheh Hasibi, Danula Hettiachchi, Rosie Jones, Jaap Kamps, Noriko Kando, Sarvnaz Karimi, Makoto P. Kato, Bevan Koopman, Yiqun Liu, Chenglong Ma, Joel Mackenzie, Maria Maistro, Jiaxin Mao, Dana McKay, Bhaskar Mitra, Stefano Mizzaro, Alistair Moffat, Josiane Mothe, Iadh Ounis, Lida Rashidi, Yongli Ren, Mark Sanderson, Rodrygo Santos, Falk Scholer, Chirag Shah, Laurianne Sitbon, Ian Soboroff, Damiano Spina, Paul Thomas, Julián Urbano, Arjen de Vries, Ryen White, Abby Yuan, Hamed Zamani, Oleg Zendel, Min Zhang, Shengyao Zhuang, Justin Zobel, and Guido Zuccon. 2025. Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025). SIGIR Forum 59(1) (June 2025), 1–68. https://doi.org/10.1145/3769733.3769739 (DOI | ACM)
Zhirui Deng, Zhicheng Dou*, Yutao Zhu, and Ji-Rong Wen. 2025. A Model-agnostic Pre-training Framework for Search Result Diversification. ACM Trans. Inf. Syst. 44(1), Article 10 (January 2026):10:1 - 10:23. Published: 14 October 2025. (TOIS 2025) (CCF A) (DOI | ACM | Download )
Zhirui Deng, Zhicheng Dou, Yutao Zhu, and Ji-Rong Wen* 2025. Social Cognitive Theory Enhanced Diversified Recommendation. ACM Trans. Inf. Syst. 44(1), Article 11 (January 2026):11:1 - 11:24. Published: 14 October 2025. (TOIS 2025) (CCF A) (DOI | ACM | Download )

September

Fengran Mo, Kelong Mao, Ziliang Zhao, Hongjin Qian, Haonan Chen, Yiruo Cheng, Xiaoxi Li, Yutao Zhu, Zhicheng Dou, and Jian-Yun Nie. 2025. A Survey of Conversational Search. ACM Trans. Inf. Syst. 43, 6, Article 167 (November 2025), 50 pages. 167:1-167:50. Pages 1 - 5. Published: 18 September 2025. (TOIS 2025) (CCF A) (DOI | ACM | Download )

August

Ziliang Zhao, Changle Qu, Zhicheng Dou*, Haonan Chen, and Jiajie Jin. 2025. Retrieving Intent-covering Demonstrations for Clarification Generation in Conversational Search Systems. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2 (KDD '25). Association for Computing Machinery, New York, NY, USA, pages 3992–4001. August 3–7, 2025, Toronto, ON, Canada (KDD 2025) (CCF A) (Download | DOI )
Shuting Wang, Yutao Zhu, and Zhicheng Dou*. 2025. Embedding Prior Task-specific Knowledge into Language Models for Context-aware Document Ranking. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1 (KDD '25). Association for Computing Machinery, New York, NY, USA, pags 1504–1514. Toronto, ON, Canada. Sunday, August 3, 2025 – Thursday, August 7, 2025. (KDD 2025) (CCF A) (Download | DOI | arXiv )
Hongjin Qian, Zheng Liu*, Peitian Zhang, Kelong Mao, Yujia Zhou, Xu Chen, and Zhicheng Dou. 2025. Tackling the Length Barrier: Dynamic Context Browsing for Knowledge-intensive Task. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1 (KDD '25). Association for Computing Machinery, New York, NY, USA, pages 1150–1160. Toronto, ON, Canada. Sunday, August 3, 2025 – Thursday, August 7, 2025. (KDD 2025) (CCF A) (Download | DOI | arXiv )

July

Wenjie Wang, Zheng Liu, Fuli Feng, Zhicheng Dou, Qingyao Ai, Grace Hui Yang, Defu Lian, Lu Hou, Aixin Sun, Hamed Zamani, Donald Metzler, and Maarten de Rijke. 2025. Pre-Trained Models for Search and Recommendation: Introduction to the Special Issue—Part 2. ACM Trans. Inf. Syst. 43, 5, Article 111 (March 2025), 5 pages. Pages 1 - 5 https://dl.acm.org/doi/full/10.1145/3736540
Wenhan Liu, Xinyu Ma, Yutao Zhu, Ziliang Zhao, Shuaiqiang Wang, Dawei Yin, and Zhicheng Dou*. 2025. Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 162–176, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Jiajie Jin, Xiaoxi Li, Guanting Dong, Yuyao Zhang, Yutao Zhu*, Yongkang Wu, Zhonghua Li, Ye Qi, and Zhicheng Dou*. 2025. Hierarchical Document Refinement for Long-context Retrieval-augmented Generation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3502–3520, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Guanting Dong, Jiajie Jin, Xiaoxi Li, Yutao Zhu, Zhicheng Dou*, and Ji-Rong Wen. 2025. RAG-Critic: Leveraging Automated Critic-Guided Agentic Workflow for Retrieval Augmented Generation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3551–3578, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Guanting Dong, Chenghao Zhang, Mengjie Deng, Yutao Zhu, Zhicheng Dou*, and Ji-Rong Wen. 2025. Progressive Multimodal Reasoning via Active Retrieval. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3579–3602, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Chenlong Deng, Zhisong Zhang, Kelong Mao, Shuaiyi Li, Xinting Huang, Dong Yu, and Zhicheng Dou*. 2025. A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4861–4879, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Jie Chen, Zhipeng Chen, Jiapeng Wang, Kun Zhou, Yutao Zhu, Jinhao Jiang, Yingqian Min, Xin Zhao, Zhicheng Dou, Jiaxin Mao, Yankai Lin, Ruihua Song, Jun Xu, Xu Chen, Rui Yan, Zhewei Wei, Di Hu, Wenbing Huang, and Ji-Rong Wen. 2025. Towards Effective and Efficient Continual Pre-training of Large Language Models. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5779–5795, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI | (arXiv) )
Haonan Chen, Liang Wang, Nan Yang, Yutao Zhu, Ziliang Zhao, Furu Wei, and Zhicheng Dou*. 2025. mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data. In Findings of the Association for Computational Linguistics: ACL 2025, pages 8254–8275, Vienna, Austria. Association for Computational Linguistics. (ACL 2025 Findings) (Download | DOI )
Jiongnan Liu, Yutao Zhu*, Shuting Wang, Xiaochi Wei, Erxue Min, Yu Lu, Shuaiqiang Wang, Dawei Yin, and Zhicheng Dou*. 2025. LLMs + Persona-Plug = Personalized LLMs. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 9373–9385, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Hongjin Qian, Zheng Liu*, Peitian Zhang, Zhicheng Dou, and Defu Lian. 2025. Boosting Long-Context Information Seeking via Query-Guided Activation Refilling. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 9453–9464, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Xinyu Zhang, Yuanquan Hu, Fangchao Liu, and Zhicheng Dou. 2025. P3: Prompts Promote Prompting. In Findings of the Association for Computational Linguistics: ACL 2025, pages 11948–11965, Vienna, Austria. Association for Computational Linguistics. (ACL 2025 Findings) (Download | DOI )
Yuyao Zhang, Zhicheng Dou*, Xiaoxi Li, Jiajie Jin, Yongkang Wu, Zhonghua Li, Ye Qi, and Ji-Rong Wen. 2025. Neuro-Symbolic Query Compiler. In Findings of the Association for Computational Linguistics: ACL 2025, pages 12138–12155, Vienna, Austria. Association for Computational Linguistics. (ACL 2025 Findings) (Download | DOI )
Jing Yao, Xiaoyuan Yi, Shitong Duan, Jindong Wang, Yuzhuo Bai, Muhua Huang, Yang Ou, Scarlett Li, Peng Zhang, Tun Lu, Zhicheng Dou, Maosong Sun, James Evans, and Xing Xie. 2025. Value Compass Benchmarks: A Comprehensive, Generative and Self-Evolving Platform for LLMs’ Value Evaluation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 666–678, Vienna, Austria. Association for Computational Linguistics. (ACL 2025 Demo) (Download | DOI )
Xiaoxi Li, Jiajie Jin, Yujia Zhou, Yongkang Wu, Zhonghua Li, Ye Qi, and Zhicheng Dou*. 2025. RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 16754–16779, Vienna, Austria. Association for Computational Linguistics. (ACL 2025) (CCF A) (Download | DOI )
Chen Xu, Zhirui Deng, Clara Rus, Xiaopeng Ye, Yuanna Liu, Jun Xu, Zhicheng Dou, Ji-Rong Wen, and Maarten de Rijke. 2025. FairDiverse: A Comprehensive Toolkit for Fairness- and Diversity-aware Information Retrieval. In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '25). Association for Computing Machinery, New York, NY, USA, 3540–3550. (SIGIR 2025) (Download | DOI )

May

Xiaoxi Li, Jiajie Jin, Yujia Zhou, Yuyao Zhang, Peitian Zhang, Yutao Zhu, and Zhicheng Dou*. 2025. From Matching to Generation: A Survey on Generative Information Retrieval. ACM Trans. Inf. Syst. 43, 3, Article 83 (May 2025), 62 pages. (TOIS 2025) (CCF A) (Download | DOI | aXiv)
Shuting Wang, Zhicheng Dou*, Kexiang Wang, Dehong Ma, Jun Fan, Daiting Shi, Zhicong Cheng, Simiu Gu, Dawei Yin and Ji-Rong Wen. PRADA: Pre-train Ranking Models with Diverse Relevance Signals Mined from Search Logs. IEEE Transactions on Knowledge and Data Engineering, vol. 37, no. 5, pp. 2861-2873, May 2025. (TKDE 2025) (CCF A) (Download | DOI | Online)
Jiongnan Liu, Zhicheng Dou*, Jian-Yun Nie, Zhenlin Chen, Guoyu Tang, Sulong Xu, and Ji-Rong Wen. 2025. Enhancing Sequential Personalized Product Search with External Out-of-sequence Knowledge. ACM Trans. Inf. Syst. 43, 4, Article 84 (July 2025), 25 pages. https://doi.org/10.1145/3726864 (TOIS 2025) (CCF A) (Download | DOI | Online)

April

Haonan Chen, Liang Wang, Nan Yang, Yutao Zhu, Ziliang Zhao, Furu Wei, and Zhicheng Dou*. Little Giants: Synthesizing High-Quality Embedding Data at Scale. NAACL 2025. Albuquerque, New Mexico. April 29–May 4, 2025. (NAACL 2025) (CCF B) (Download | DOI | arXiv)
Yiruo Cheng, Kelong Mao, Ziliang Zhao, Guanting Dong, Hongjin Qian, Yongkang Wu, Tetsuya Sakai, Ji-Rong Wen, and Zhicheng Dou*. CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation. NAACL 2025 Findings. Albuquerque, New Mexico. April 29–May 4, 2025. pages 1308–1330. (NAACL 2025 Findings) (Download | DOI | arXiv)
Huimin Zeng, Xiaojie Wang, Anoop Jain, Zhicheng Dou, and Dong Wang. A Non-Contrastive Learning Framework for Sequential Recommendation with Preference-Preserving Profile Generation. ICLR 2025. Singapore EXPO. Thu Apr 24 – Mon Apr 28th, 2025. (ICLR 2025) (Download | DOI | OpenReview )
Peitian Zhang, Zheng Liu*, Shitao Xiao, Ninglu Shao, Qiwei Ye, Zhicheng Dou. Long Context Compression with Activation Beacon. ICLR 2025. Singapore EXPO. Thu Apr 24 – Mon Apr 28th, 2025. (ICLR 2025) (Download | arXiv )
Xinyu Zhang, Ran Dou, Enrui Hu, Minjun Zhao, Yangkai Ding, and Zhicheng Dou*. 2025. Collaborative Optimization Approach for Workflow Agents in User Behavior Modeling. In Companion Proceedings of the ACM on Web Conference 2025 (WWW '25). Association for Computing Machinery, New York, NY, USA. Sydney. 28 April - 2 May 2025. pages 2988–2992. (WWW 2025 Companion) (Download | DOI )
Hongjin Qian, Zheng Liu*, Peitian Zhang, Kelong Mao, Defu Lian, Zhicheng Dou, and Tiejun Huang. MemoRAG: Boosting Long Context Processing with Global Memory-Enhanced Retrieval Augmentation. In Proceedings of the ACM on Web Conference 2025 (WWW '25). Association for Computing Machinery, New York, NY, USA. Sydney. 28 April - 2 May 2025. pages 2366–2377. (WWW 2025) (CCF A) (Download | DOI )
Jiejun Tan, Zhicheng Dou*, Wen Wang, Mang Wang, Weipeng Chen, and Ji-Rong Wen. HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems. In Proceedings of the ACM Web Conference 2025 (WWW '25) . Sydney. 28 April - 2 May 2025. Association for Computing Machinery, New York, NY, USA, Pages 1733–1746 (WWW 2025) (CCF A) (Download | DOI | arXiv )
Guanting Dong, Yutao Zhu, Chenghao Zhang, Zechen Wang, Ji-Rong Wen, Zhicheng Dou*. Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation. In Proceedings of the ACM Web Conference 2025 (WWW '25) . Sydney. 28 April - 2 May 2025. Pages 4206–4225 (WWW 2025) (CCF A) (Download | DOI )
Jiajie Jin, Yutao Zhu*, Zhicheng Dou*, Guanting Dong, Xinyu Yang, Chenghao Zhang, Tong Zhao, Zhao Yang and Ji-Rong Wen FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research. In WWW Companion ’25. Sydney. 28 April - 2 May 2025. (WWW 2025 Resource) ( Download | DOI | Github ) Github #3 Repository Of The Day

March

Haobo Zhang, Qiannan Zhu*, and Zhicheng Dou*. 2025. A Unified Prompt-aware Framework for Personalized Search and Explanation Generation. ACM Trans. Inf. Syst. 43, 3, Article 71 (May 2025), 26 pages. https://doi.org/10.1145/3716131 (TOIS 2025) (CCF A) ( Download | DOI | arXiv )

February

Wenjie Wang, Zheng Liu, Fuli Feng, Zhicheng Dou, Qingyao Ai, Grace Hui Yang, Defu Lian, Lu Hou, Aixin Sun, Hamed Zamani, Donald Metzler, and Maarten de Rijke. 2025. Pre-Trained Models for Search and Recommendation: Introduction to the Special Issue—Part 1. ACM Trans. Inf. Syst. 43, 2, Article 27 (March 2025), 6 pages. https://doi.org/10.1145/3709134
Yutao Zhu, Zhaoheng Huang, Zhicheng Dou*, Ji-Rong Wen. One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models. AAAI 2025. Philadelphia, Pennsylvania, USA. February 25 – March 4, 2025. 26166-26174. (AAAI 2025) (CCF A) (Download | DOI | arXiv )
Jiehan Cheng, Zhicheng Dou*, Yutao Zhu, Xiaoxi Li. Descriptive and Discriminative Document Identifiers for Generative Retrieval. AAAI 2025. Philadelphia, Pennsylvania, USA. February 25 – March 4, 2025. 11518-11526. (AAAI 2025) (CCF A) (Download | DOI )
Guanting Dong, Xiaoshuai Song, Yutao Zhu, Runqi Qiao, Zhicheng Dou*, Ji-Rong Wen. Toward Verifiable Instruction-Following Alignment for Retrieval Augmented Generation. AAAI 2025. Philadelphia, Pennsylvania, USA. February 25 – March 4, 2025. 23796-23804. (AAAI 2025) (CCF A) (Download | DOI )

January

Zhaoheng Huang, Yutao Zhu*, Zhicheng Dou* and Ji-Rong Wen. CAGS: Context-Aware Document Ranking With Contrastive Graph Sampling. In IEEE Transactions on Knowledge and Data Engineering, vol. 37, no. 1, pp. 89-101, Jan. 2025, (TKDE 2025) (CCF A) ( Download | DOI )
Lei Wang, Jingsen Zhang, Hao Yang, Zhi-Yuan Chen, Jiakai Tang, Zeyu Zhang, Xu Chen, Yankai Lin, Hao Sun, Ruihua Song, Xin Zhao, Jun Xu, Zhicheng Dou, Jun Wang, and Ji-Rong Wen. 2025. User Behavior Simulation with Large Language Model-based Agents. ACM Trans. Inf. Syst. 43, 2, Article 55 (March 2025), 37 pages, Pages 1 - 37. (TOIS 2025) (CCF A) ( DOI )
Yujia Zhou, Zheng Liu, and Zhicheng Dou. 2025. How Credible Is an Answer From Retrieval-Augmented LLMs? Investigation and Evaluation With Multi-Hop QA. In Proceedings of the 31st International Conference on Computational Linguistics, 4232–4242, Abu Dhabi, UAE. Association for Computational Linguistics. (COLING 2025). January 19–24, 2025. (COLING 2025) (CCF B) ( Download| Online )
Shuting Wang, Xin Yu, Mang Wang, Weipeng Chen, Yutao Zhu and Zhicheng Dou*. RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generationg. In Proceedings of the 31st International Conference on Computational Linguistics, pages 11317–11333, Abu Dhabi, UAE. (COLING 2025). January 19–24, 2025. Association for Computational Linguistics. (COLING 2025) (CCF B) ( Download| Online )
Haobo Zhang, Qiannan Zhu*, and Zhicheng Dou*. 2025. Enhancing Reranking for Recommendation with LLMs through User Preference Retrieval. In Proceedings of the 31st International Conference on Computational Linguistics, pages 658–671, Abu Dhabi, UAE. January 19–24, 2025. Association for Computational Linguistics. (COLING 2025) (CCF B) ( Download| Online )
Huaying Yuan, Ziliang Zhao, Shuting Wang, Shitao Xiao, Minheng Ni, Zheng Liu* and Zhicheng Dou*. 2025. FineRAG: Fine-grained Retrieval-Augmented Text-to-Image Generation. In Proceedings of the 31st International Conference on Computational Linguistics, pages 658–671, Abu Dhabi, UAE. January 19–24, 2025. Association for Computational Linguistics. (COLING 2025) (CCF B) ( Download| Online )

2024