Zhicheng Dou (窦志成)
Where there is a will, there is a way (有志者事竟成)
Haidian District Beijing, 100872, P.R. China
Email:
Mobile: +86 ********
Homepages:
Dr. Zhicheng Dou is currently a professor in Renmin University of China.
He worked at Microsoft Research between July 2008 and September 2014,
after he received his Ph.D degree in Computer Science
from Nankai University in 2008.
He received his Bachelor of Engineering degree in Computer Science and Technology in 2003,
also from Nankai University.
His research interests include Information Retrieval, Natural Language Processing, Web Search, Data Mining, Information Extraction, and Big Data Analysis.
During the past years, he has published over 20 referred papers in high-quality international conferences and journals,
such as SIGIR, WWW, CIKM, WSDM, EMNLP, and IEEE TKDE.
He received the Best Paper Runner-Up Award from SIGIR 2013, and the Best Paper Award from AIRS 2012.
He has been serving several international conferences as a program committee member, such as SIGIR, CIKM, WSDM, WWW, and KDD.
He is the co-organizer of the NTCIR-10 Intent-2 task and NTCIR-11 IMINE task.
He is now a Steering Committee (SC) member of Asia Information Retrieval Society (AIRS), served as the mainland China representative.
Besides research, Zhicheng Dou also enjoys implementing cool ideas into real systems.
He participated in several group-level projects when he worked for Microsoft,
like WebStudio,
ProjectQ,
and WebSensor.
Some of them are selected for Microsoft Disruptive Technology Review,
and have made big impact on Microsoft products (like Bing).
He has received three technical transfer awards from Microsoft and six granted/filed patents.
News: (September 2014): after working at Microsoft for almost 9 years (3 years as an full time intern, and more than 6 years as an full time employee), I decided to leave Microsoft and join Renmin University of China, working together with my formal manager Ji-Rong Wen
. You are warmly welcome to visit us next time when you come to Beijing.
Areas: Information Retrieval, Natural Language Processing, Web Search, Data Mining, Information Extraction, and Big Data Analysis
Topics: Personalized Web Search, Anchor Text Mining, Web Search Ranking, Information Retrieval Evaluation,
Query Log Mining, User Intent Mining, Search Result Diversification, Information Extraction, Query Facet Extraction, et al.
- 2014/9-now Professor, Renmin University of China
- 2008/7-2014/9 Researcher, Microsoft Research Asia
- 2005-2008 Research Intern, Microsoft Research Asia
-
WebSensor
With the rapid growth of the web, there are grand challenges when making sense of BIG web data:
big volume, high velocity, high variety, and unknown veracity.
In the physical world, a sensor is a converter that measures a physical quantity and converts
it into a signal that can be read by an observer or by an instrument—today, mostly electronic.
The WebSensor project targets to providing a platform for both developers
and end users to easily build "web sensors" which are programmable "focused crawlers" that
continuously discover, extract, and aggregate structured information around a topic.
We developped a WebSensor platform based on Windows PowerShell and the .NET Framework.
It enables developers to easily create WebSensors that continuously extract information from the web
and generate time-series stream data.
The websensor platform has many built-in capabilities to extract and collect time-sequenced
data embedded in web sites. These built-in capabilities include:
- Convenient wrapper generation on webpages (just by a few clicks)
- Automatically wrapper adaption to page layout change
- Easy to configure and run
- Easy to extend using simple script language
- Easy to manage and retrieve the data collected
Websensors can connect to form a sensor network for more complex analysis tasks
that involve multiple time-sequenced data.
We has used the WebSensor platform to gather huge amounts of data related to "United States presidential election, 2012."
We analyzed the sentiment trends for "who will be the next president of United States,"
and sucessfully predict the election results.
The platform has also been used in Microsoft product teams,
such as Bing News team and Bing Relevance Measurement team.
Keywords: Big Data; Wrapper; Information Extraction; Time Series
-
Project Q:Query-Centric Search
Search has a long document-centric tradition, where "searching information" is equivalent
to "searching document". In Project Q, we explore a new query-centric
search paradigm, which treats query as object and shifts search from "searching document"
to “searching query”. We have developed several effective query mining technologies and
proved that, when deeply mining queries "without" time constraint, we can greatly improve
search relevance and user experiences. The concept of "query-centric search" has been
widely accepted by Microsoft Bing search ranking team. It brings them new opportunities
for improving ranking quality.
-
User Intent Understanding, Personalized Search, and Search Diversification
Studies show that the vast majority of queries to search engines are short and vague
in specifying a user’s intent.
Different users may have completely different information needs and goals when using precisely the same query.
For example, User A is finding information about Apply Company by issuing a query "apple,",
while User B is finding information related to fruit apple using the same query.
When such a query is issued, search engines will return a list of documents that mix different topics.
It takes time for a user to choose which information he/she wants.
We studied two effective solutions to solve this problem:
- Personalied Search: it provides different search results based upon the preferences of users
- Search Result Diversification: it provides a list of results that cover as many aspects as possible,
so that most users can be satisfied by the top results
-
Deep Web Site and Web Page Understanding
Web pages are the main sources of information on the Web.
Without deep understanding of web pages, we cannot extract rich information from the Web.
We take advantage of text, HTML DOM structure and associated visual features,
such as font size, width and height of a DOM element,
to understand the information contained in web pages.
We explored several methods problems related to this, such as:
- Website template generation and wrapper conduction
- Repeat information (in text pattern or layout structure) detection and extraction
- Change pattern summarization and classification by analyzing multiple versions of a page
- Main content extraction from news pages
-
WebStudio: Experimental Search Platform
WebStudio is an end-to-end experimental search system for facilitating search experiments
on specific web data collections.
In WebStudio, some default components are implemented.
Users can customize major operations (including document parsing, page classification,
index building, index serving, and front-end processing) in the E2E search engine,
by adding their own experimental logic for testing ideas.
-
Web Search Ranking
Web Search ranking quality is one of the most important element for the success of a search engine.
We closely collaborated with Bing ranking team on developing
effective technologies for improving ranking quality of Bing search engine.
*: Corresponding author; ____ indicates the author is/was my student/Postdoc when the work was done;
2024
-
Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zhicheng Dou, Zheng Liu, Ji-Rong Wen:
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning. CoRR abs/2401.06532 (2024)
-
Peitian Zhang, Zheng Liu, Shitao Xiao, Ninglu Shao, Qiwei Ye, Zhicheng Dou:
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon. CoRR abs/2401.03462 (2024)
-
Wenhan Liu, Ziliang Zhao, Yutao Zhu, and Zhicheng Dou*.
Mining Exploratory Queries for Conversational Search.
In Proceedings of the ACM Web Conference 2024 (WWW 2024), May 13 - 17, 2024, Singapore.
(TheWebConf 2024) (CCF A)
(Download |
DOI)
-
Ziliang Zhao and Zhicheng Dou*.
Generating Multi-turn Clarification for Web Information Seeking.
In Proceedings of the ACM Web Conference 2024 (WWW 2024), May 13 - 17, 2024, Singapore.
(TheWebConf 2024) (CCF A)
(Download |
DOI)
-
Yujia Zhou Zheng Liu*, Jiajie Jin, Jian-Yun Nie, and Zhicheng Dou*.
Metacognitive Retrieval-Augmented Large Language Models.
In Proceedings of the ACM Web Conference 2024 (WWW 2024), May 13 - 17, 2024, Singapore.
(TheWebConf 2024) (CCF A)
(Download |
DOI)
-
Yujia Zhou, Qiannan Zhu, Jiajie Jin, and Zhicheng Dou*.
Cognitive Personalized Search Integrating Large Language Models with an Efficient Memory Mechanism.
In Proceedings of the ACM Web Conference 2024 (WWW 2024), May 13 - 17, 2024, Singapore.
(TheWebConf 2024) (CCF A)
(Download |
DOI)
-
Jiongnan Liu, Zhicheng Dou*, Jian-Yun Nie, Ji-Rong Wen:
Integrated Personalized and Diversified Search Based on Search Logs. IEEE Trans. Knowl. Data Eng. 36(2): 694-707 (2024)
(TKDE 2024) (CCF A)
(Download |
DOI)
-
Shuting Wang, Zhicheng Dou*, Jiongnan Liu, Qiannan Zhu, and Ji-Rong Wen. 2024.
Personalized and Diversified: Ranking Search Results in an Integrated Way.
ACM Trans. Inf. Syst. 42, 3, Article 81 (May 2024), 25 pages. https://doi.org/10.1145/3631989
(TOIS 2024) (CCF A)
(Download |
DOI)
-
Yutong Bai, Yujia Zhou, Zhicheng Dou, and Ji-Rong Wen. 2024.
Intent-oriented Dynamic Interest Modeling for Personalized Web Search.
ACM Trans. Inf. Syst. Just Accepted (January 2024). https://doi.org/10.1145/3639817
(TOIS 2024) (CCF A)
(Download |
DOI)
-
Xiaoxi Li, Yujia Zhou, Zhicheng Dou*.
UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models.
In Proceedings of the 38th Annual AAAI Conference on Artificial Intelligence.
(AAAI 2024) (CCF A)
(Download |
DOI)
-
Zhirui Deng, Zhicheng Dou*, Yutao Zhu, and Ji-Rong Wen. 2024
CL4DIV: A Contrastive Learning Framework for Search Result Diversification.
In Proceedings of the 17th ACM International Conference on Web Search and Data Mining.
(WSDM 2024) (CCF B)
(Download |
DOI)
2023
-
Qingyao Ai, Ting Bai, Zhao Cao, Yi Chang, Jiawei Chen, Zhumin Chen, Zhiyong Cheng, Shoubin Dong, Zhicheng Dou, Fuli Feng, Shen Gao, Jiafeng Guo, Xiangnan He, Yanyan Lan, Chenliang Li, Yiqun Liu, Ziyu Lyu, Weizhi Ma, Jun Ma, Zhaochun Ren, Pengjie Ren, Zhiqiang Wang, Mingwen Wang, Ji-Rong Wen, Le Wu, Xin Xin, Jun Xu, Dawei Yin, Peng Zhang, Fan Zhang, Weinan Zhang, Min Zhang, Xiaofei Zhu:
Information Retrieval meets Large Language Models: A strategic report from Chinese IR community. AI Open 4: 80-90 (2023)
(DOI |
Download )
-
Yutao Zhu, Huaying Yuan, Shuting Wang, Jiongnan Liu, Wenhan Liu, Chenlong Deng, Haonan Chen, Zhicheng Dou, Ji-Rong Wen:
Large Language Models for Information Retrieval: A Survey. CoRR abs/2308.07107
(aXiv |
Download )
(2023)
-
Peitian Zhang, Shitao Xiao, Zheng Liu, Zhicheng Dou, Jian-Yun Nie:
Retrieve Anything To Augment Large Language Models. CoRR abs/2310.07554 (2023)
(aXiv |
Download )
-
Hongjin Qian, Zhicheng Dou, Jiejun Tan, Haonan Chen, Haoqi Gu, Ruofei Lai, Xinyu Zhang, Zhao Cao, Ji-Rong Wen:
Optimizing Factual Accuracy in Text Generation through Dynamic Knowledge Selection. CoRR abs/2308.15711 (2023)
(aXiv |
Download )
-
Zhirui Deng, Zhicheng Dou*, Ji-Rong Wen:
DeepQFM: a deep learning based query facets mining method.
Information Retrieval Journal. Volume 26, article number 9, (2023)
(Online |
Download )
-
Yujia Zhou, Jing Yao, Ledell Wu, Zhicheng Dou* and Ji-Rong Wen. 2023.
WebUltron: An Ultimate Retriever on Webpages Under the Model-Centric Paradigm.
IEEE Transactions on Knowledge and Data Engineering, doi: 10.1109/TKDE.2023.3332858.
(TKDE 2023) (CCF A)
(
Online
|
Download
)
-
Yujia Zhou, Zhicheng Dou*, and Ji-Rong Wen. 2023.
Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback.
In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.
(EMNLP 2023) (CCF B)
Online
Download
-
Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, and Jing Yao. 2023.
Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval
In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.
(EMNLP 2023) (CCF B)
Online
Download
-
Kelong Mao, Zhicheng Dou*, Fengran Mo, Jiewen Hou, Haonan Chen, and Hongjin Qian. 2023.
Large Language Models Know Your Contextual Search Intent: A Prompting Framework for Conversational Search.
In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing.
(EMNLP 2023 Findings)
Online
Download
-
Zihan Wang, Yujia Zhou, Yiteng Tu, and Zhicheng Dou*. 2023.
NOVO: Learnable and Interpretable Document Identifiers for Model-Based IR.
In Proceedings of the 32nd ACM International Conference on Information and Knowledge
Management (CIKM ’23), October 21–25, 2023, Birmingham, United Kingdom.
ACM, New York, NY, USA, 10 pages.
(CIKM 2023) (CCF B)
(
Download
| DOI
)
-
Huaying Yuan, Zhicheng Dou*, Yujia Zhou, Yu Guo, and Ji-Rong Wen. 2023.
VILE: Block-Aware Visual Enhanced Document Retrieval. In Proceedings
of the 32nd ACM International Conference on Information and Knowledge
Management (CIKM ’23), October 21–25, 2023, Birmingham, United Kingdom.
ACM, New York, NY, USA, 10 pages.
(CIKM 2023) (CCF B)
(
Download
| DOI
)
-
Zhan Su, Zhicheng Dou*,
Yujia Zhou, Ziyuan Zhao, and Ji-Rong Wen. 2023.
PSLOG: Pretraining with Search Logs for Document Ranking. In Proceedings
of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data
Mining (KDD ’23), August 6–10, 2023, Long Beach, CA, USA. ACM, New York,
NY, USA, 11 pages.
(KDD 2023) (CCF A)
(
Download
| DOI
)
-
Shitong Dai, Jiongnan Liu, Zhicheng Dou*,
Haonan Wang, Lin Liu, Bo Long, and Ji-Rong Wen. 2023.
Contrastive Learning for User Sequence Representation in Personalized Product Search. In Proceedings of the 29th ACM
SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’23),
August 6–10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 11 pages.
(KDD 2023) (CCF A)
(
Download
| DOI
)
-
Ziliang Zhao, Zhicheng Dou*, Yu Guo, Zhao Cao, and Xiaohua Cheng. 2023.
Improving Search Clarification with Structured Information Extracted from Search Results.
In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’23), August 6–10, 2023, Long Beach,
CA, USA. ACM, New York, NY, USA, 10 pages.
(KDD 2023) (CCF A)
(
Download
| DOI
)
-
Search-oriented Conversational Query Editing.
Kelong Mao, Zhicheng Dou*, Bang Liu, Hongjin Qian,
Fengran Mo, Xiangli Wu, Xiaohua Cheng and Zhao Cao.
In Findings of the Association for Computational Linguistics: ACL (Findings) 2023.
(ACL 2023 - Findings)
(
Download
| DOI
)
-
Hence, Socrates is mortal: A Benchmark for Natural Language Syllogistic Reasoning.
Yongkang Wu, Meng Han, Yutao Zhu, Lei Li, Xinyu Zhang, Ruofei Lai, Xiaoguang Li,
Yuanhang Ren, Zhicheng Dou, and Zhao Cao.
In Findings of the Association for Computational Linguistics: ACL (Findings) 2023.
(ACL 2023 - Findings)
(
Download
| DOI
)
-
Jiongnan Liu, Zhicheng Dou*, Guoyu Tang, and Sulong Xu. 2023.
JDsearch: A
Personalized Product Search Dataset with Real Queries and Full Interactions.
In Proceedings of the 46th International ACM SIGIR Conference on Research
and Development in Information Retrieval (SIGIR ’23), July 23–27, 2023, Taipei,
Taiwan. ACM, New York, NY, USA, 8 pages.
(SIGIR 2023)
(
Download
| DOI
)
- Kelong Mao, Hongjin Qian, Fengran Mo, Zhicheng Dou*, Bang Liu, Xiaohua
Cheng, and Zhao Cao. 2023.
Learning Denoised and Interpretable Session
Representation for Conversational Search.
In Proceedings of the ACM Web
Conference 2023 (WWW ’23) (Spotlight paper!), May 1–5, 2023, Austin, TX, USA. ACM, New
York, NY, USA, 11 pages.
(WWW 2023) (CCF A)
(
Download
| DOI
)
-
Shuting Wang, Zhicheng Dou, Jing Yao, Yujia Zhou, and Ji-Rong Wen. 2023.
Incorporating Explicit Subtopics in Personalized Search. In Proceedings of the
ACM Web Conference 2023 (WWW ’23), May 1–5, 2023, Austin, TX, USA. ACM,
New York, NY, USA, 11 pages.
(WWW 2023) (CCF A)
(
Download
| DOI
)
- Yujia Zhou, Jing Yao, Zhicheng Dou*, Ledell Yu Wu and Ji-rong Wen.
DynamicRetriever: A Pre-trained Model-based IR System Without an Explicit Index.
Machine Intelligence Research. 20(2): 276-288 (2023).
(
Download
| DOI
)
- Han Zhang, Zhicheng Dou*, Yutao Zhu and Ji-rong Wen.
Contrastive Learning for Legal Judgment Prediction. ACM Transactions on Information Systems. 41(4): 113:1-113:25 (2023)
(TOIS) (CCF A)
(
Download
| DOI
)
- Qingyu Bing, Qiannan Zhu, and Zhicheng Dou*. 2023.
Cognition-aware
Knowledge Graph Reasoning for Explainable Recommendation.
In Proceedings of the Sixteenth ACM International Conference on Web Search and Data
Mining (WSDM ’23), February 27-March 3, 2023, Singapore, Singapore. ACM,
New York, NY, USA, 9 pages.
(WSDM 2023) (CCF B)
(
Download
| DOI
)
- Shuting Wang, Zhicheng Dou*, and Yutao Zhu. 2023.
Heterogeneous Graph based Context-aware Document Ranking. In Proceedings of the Sixteenth
ACM International Conference on Web Search and Data Mining (WSDM ’23),
February 27-March 3, 2023, Singapore, Singapore. ACM, New York, NY, USA,
9 pages.
(WSDM 2023) (CCF B)
(
Download
| DOI
)
-
Yuhang Ye, Zhonghua Li, Zhicheng Dou, Yutao Zhu,
Changwang Zhang, Shangquan Wu, Zhao Cao.
Learning from the Wisdom of Crowds: Exploiting Similar Sessions for Session Search. In Proceedings of AAAI 2023: 4818-4826
(AAAI 2023) (CCF A)
(
Download
| DOI
)
-
Zihan Wang, Hongjin Qian, Zhicheng Dou.
Learning on Structured Documents for Conditional Question Answering. CCL 2023: 37-57
-
Han Zhang, Zhicheng Dou.
Case Retrieval for Legal Judgment Prediction in Legal Artificial Intelligence. CCL 2023: 434-448
-
Hongjin Qian and Zhicheng Dou*. 2023. Topic-Enhanced Personalized Retrieval-Based Chatbot.
In Advances in Information Retrieval: 45th European Conference on Information Retrieval,
ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part II. Springer-Verlag,
Berlin, Heidelberg, 79–93.
(ECIR 2023)
(
Download
| DOI
)
-
Kelong Mao, Zhicheng Dou, Haonan Chen, Fengran Mo, Hongjin Qian:
Large Language Models Know Your Contextual Search Intent:
A Prompting Framework for Conversational Search. https://arxiv.org/abs/2303.06573 (2023)
(DOI)
-
Hongjing Qian, Yutao Zhu, Zhicheng Dou, Haoqi Gu, Xinyu Zhang, Zheng Liu,
Ruofei Lai, Zhao Cao, Jian-Yun Nie, Ji-Rong Wen:
WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus.
https://arxiv.org/abs/2304.04358 (2023)
(DOI)
- Peitian Zhang, Zheng Liu, Yujia Zhou, Zhicheng Dou, Zhao Cao:
Term-Sets Can Be Strong Document Identifiers For Auto-Regressive Search Engines.
https://arxiv.org/abs/2305.13859 (2023)
(DOI)
- Jiongnan Liu, Jiajie Jin, Zihan Wang, Jiehan Cheng, Zhicheng Dou, Ji-Rong Wen:
RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit.
https://arxiv.org/abs/2306.05212 (2023)
(DOI)
- Jing Yao, Zheng Liu, Junhan Yang, Zhicheng Dou, Xing Xie and Ji-rong Wen.
CDSM: Cascaded Deep Semantic Matching on Textual Graphs Leveraging Ad-hoc Neighbor Selection.
ACM Transactions on Intelligent Systems and Technology, 14(2): 32:1-32:24 (2023)
(Download
| DOI)
-
[J] Yujia Zhou, Zhicheng Dou* and Ji-Rong Wen,
"Enhancing Potential Re-finding in Personalized Search with Hierarchical Memory Networks,"
in IEEE Transactions on Knowledge and Data Engineering. 35(4): 3846-3857 (2023)
(TKDE 2023) (CCF A)
(Download |
DOI)
- Haonan Chen, Zhicheng Dou*, Qiannan Zhu, Xiaochen Zuo, and Ji-Rong Wen.
Integrating Representation and Interaction for Context-aware Document Ranking.
ACM Trans. Inf. Syst. 41(1): 21:1-21:23 (2023)
(TOIS 2023) (CCF A)
(Download |
DOI)
-
Xubo Qin, Zhicheng Dou, Yutao Zhu, and Ji-Rong Wen. 2022.
GDESA: Greedy Diversity Encoder with Self-Attention for Search Results Diversification.
ACM Trans. Inf. Syst. 41(2): 34:1-34:36 (2023)
(TOIS 2023) (CCF A)
(Download |
DOI)
2022
- Kelong Mao, Zhicheng Dou*, Hongjin Qian, Fengran Mo, Xiaohua Cheng, and Zhao Cao. 2022. ConvTrans: Transforming Web Search Sessions for Conversational Dense Retrieval. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 2935–2946, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics
(EMNLP 2022) (CCF B)
(Download | URL)
-
Hongjin Qian and Zhicheng Dou*. 2022. Explicit Query Rewriting for Conversational Dense Retrieval. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 4725–4737, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics
(EMNLP 2022) (CCF B)
(Download | URL)
-
Zhaoye Fei, Yu Tian, Yongkang Wu, Xinyu Zhang, Yutao Zhu, Zheng Liu, Jiawen Wu, Dejiang Kong, Ruofei Lai, Zhao Cao, Zhicheng Dou, and Xipeng Qiu. 2022. Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding. In Proceedings of the 29th International Conference on Computational Linguistics, pages 4952–4964, Gyeongju, Republic of Korea. International Committee on Computational Linguistics
(COLING 2022) (CCF B)
(Download | URL)
-
Zhaoheng Huang, Zhicheng Dou*, Yutao Zhu, and Zhengyi Ma. 2022.
MCP: Self-supervised Pre-training for Personalized Chatbots with Multi-level Contrastive Sampling.
In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 1030–1042, Abu Dhabi, United Arab Emirates.
Association for Computational Linguistics.
(EMNLP 2022 Findings)
(CCF B)
(Download | URL)
-
Zhan Su, Zhicheng Dou*, Yutao Zhu, and Ji-Rong Wen.
2022. Knowledge Enhanced Search Result Diversification.
In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22).
Association for Computing Machinery, New York, NY, USA, 1687–1695.
(KDD 2022) (CCF A)
(
Download |
DOI)
-
Yutao Zhu, Jian-Yun Nie, Yixuan Su, Haonan Chen, Xinyu Zhang, and
Zhicheng Dou*. 2022. From Easy to Hard: A Dual Curriculum Learning
Framework for Context-Aware Document Ranking. In Proceedings of the 31st
ACM International Conference on Information and Knowledge Management
(CIKM ’22), October 17–21, 2022, Atlanta, GA, USA. ACM, New York, NY,
USA, 11 pages.
(CIKM 2022) (CCF B)
(
Download |
DOI)
-
Haonan Chen, Zhicheng Dou*, Yutao Zhu, Zhao Cao, Xiaohua Cheng, and Ji-Rong Wen.
2022. Enhancing User Behavior Sequence Modeling by Generative Tasks for Session Search.
In Proceedings of the 31st ACM International
Conference on Information and Knowledge Management (CIKM ’22), October 17–21, 2022, Atlanta, GA, USA. ACM, New York, NY, USA, 11 pages.
(CIKM 2022) (CCF B)
(
Download |
DOI)
-
Hanxun Zhong, Zhicheng Dou*, Yutao Zhu, Hongjin Qian, and Ji-Rong Wen. 2022.
Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation.
In Proceedings of the 2022 Conference of the North American Chapter of the Association
for Computational Linguistics: Human Language Technologies, pages 5808–5820, Seattle,
United States. Association for Computational Linguistics.
(NAACL 2022) (CCF B)
(
Download |
DOI
)
- Kelong Mao, Zhicheng Dou*, and Hongjin Qian.
Curriculum Contrastive Context Denoising for Few-shot Conversational Dense Retrieval.
In Proceedings of SIGIR 2022.
(SIGIR 2022) (CCF A)
(Download |
DOI)
- Yu Guo, Zhengyi Ma, Jiaxin Mao, Hongjin Qian, Xinyu Zhang, Hao Jiang, Zhao Cao,
and Zhicheng Dou*.
Webformer: Pre-training with Web Pages for Information Retrieval.
In Proceedings of SIGIR 2022.
(SIGIR 2022) (CCF A)
(Download |
DOI)
- Ziliang Zhao, Zhicheng Dou*, Jiaxin Mao and Ji-Rong Wen.
Generating Clarifying Questions with Web Search Results.
In Proceedings of SIGIR 2022.
(SIGIR 2022) (CCF A)
(Download |
DOI)
- Yujia Zhou, Zhicheng Dou*, Huaying Yuan and Zhengyi Ma.
Socialformer: Social Network Inspired Long Document Modeling for Document Ranking.
In Proceedings of the ACM Web Conference 2022 (WWW ’22), April 25–29, 2022, Virtual Event, Lyon, France. ACM, New York, NY, USA.
(TheWebConf 2022) (CCF A)
(Download |
DOI)
- Jiongnan Liu, Zhicheng Dou*, Qiannan Zhu, and Ji-Rong Wen.
A Category-aware Multi-interest Model for Personalized Product Search.
In Proceedings of the ACM Web Conference 2022 (WWW ’22), April 25–29, 2022, Virtual Event, Lyon, France. ACM, New York, NY, USA.
(TheWebConf 2022) (CCF A)
(Download |
DOI)
- Qiannan Zhu, Haobo Zhang, Qing He, and Zhicheng Dou*.
A Gain-Tuning Dynamic Negative Sampler for Recommendation.
In Proceedings of the ACM Web Conference 2022 (WWW ’22), April 25–29, 2022, Virtual Event, Lyon, France. ACM, New York, NY, USA.
(TheWebConf 2022) (CCF A)
(Download |
DOI)
-
Xiaochen Zuo, Zhicheng Dou*, Ji-Rong Wen.
Improving Session Search by Modeling Multi-Granularity Historical Query Change.
In Proceedings of the 15th ACM International Conference on Web Search and Data Mining (WSDM '22). ACM, New York, NY, USA.
(WSDM 2022) (CCF B)
(Download |
DOI)
-
Chenlong Deng, Yujia Zhou, and Zhicheng Dou*.
Improving Personalized Search with Dual-Feedback Network.
In Proceedings of the 15th ACM International Conference on Web Search and Data Mining (WSDM '22). ACM, New York, NY, USA,
(WSDM 2022) (CCF B)
(Download |
DOI)
-
Chengzhen Fu, Enrui Hu, Letian Feng, Zhicheng Dou, Yantao Jia, Lei Chen, Pan Yu, and Zhao Cao.
Leveraging Multi-view Inter-passage Interactions for Neural Document Ranking.
In Proceedings of the 15th ACM International Conference on Web Search and Data Mining (WSDM '22). ACM, New York, NY, USA,
(WSDM 2022) (CCF B)
(Download |
DOI)
2021
-
[J] Yutao Zhu, Ruihua Song, Jian-Yun Nie, Pan Du, Zhicheng Dou, and Jin Zhou. 2021.
Leveraging Narrative to Generate Movie Script.
ACM Trans. Inf. Syst. Just Accepted (December 2021).
(TOIS 2021) (CCF A)
(Download |
DOI)
-
[J] Jing Yao, Zhicheng Dou*, and Ji-Rong Wen. 2021.
Clarifying Ambiguous Keywords with Personal Word Embeddings for Personalized Search.
ACM Trans. Inf. Syst. 40, 3, Article 43, 29 pages.
(TOIS 2021) (CCF A)
(Download |
DOI)
-
[J] Jing Yao, Zhicheng Dou*, Jian-Yun Nie, and Ji-Rong Wen.
Looking Back on the Past: Active Learning with Historical Evaluation Results.
in IEEE Transactions on Knowledge and Data Engineering.
(TKDE 2021) (CCF A)
(Download | DOI)
-
[J] Jing Yao, Zhicheng Dou*, Jun Xu, and Ji-Rong Wen.
RLPS: A Reinforcement Learning based Framework for Personalized Search.
in Transactions on Information Systems
(TOIS 2021) (CCF A)
(Download | DOI)
-
Yujia Zhou, Zhicheng Dou*, Yutao Zhu, Ji-Rong Wen. 2021. PSSL: Self-supervised
Learning for Personalized Search with Contrastive Sampling.
In Proceedings of the 30th ACM International Conference on Information
and Knowledge Management (CIKM ’21), November 1–5, 2021, Virtual Event,
QLD, Australia. ACM, New York, NY, USA, 10 pages.
(CIKM 2021) (CCF B)
(Download | DOI)
-
Hongjin Qian, Zhicheng Dou*, Yutao Zhu, Yueyuan Ma, and Ji-Rong Wen.
2021. Learning Implicit User Profiles for Personalized Retrieval-Based Chatbot.
In Proceedings of the 30th ACM International Conference on Information
and Knowledge Management (CIKM ’21), November 1–5, 2021, Virtual
Event, QLD, Australia. ACM, New York, NY, USA, 11 pages.
(CIKM 2021) (CCF B)
(Download | DOI)
-
Jing Yao, Zhicheng Dou*, Ruobing Xie, Yanxiong Lu, Zhiping Wang and Ji-Rong Wen. 2021.
USER: A Unified Information Search and Recom-mendation Model based on Integrated Behavior Sequence. InProceedings ofthe 30th ACM International Conference on Information and Knowledge Man-agement (CIKM ’21), November 1–5, 2021, Virtual Event, QLD, Australia.ACM,New York, NY, USA, 11 pages.
(CIKM 2021) (CCF B)
(Download | DOI)
- Zhengyi Ma, Zhicheng Dou*, Wei Xu, Xinyu Zhang, Hao Jiang, Zhao Cao,and Ji-Rong Wen. 2021.
Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need. InProceedings of the 30th ACM International Conference onInformation and Knowledge Management (CIKM ’21), November 1–5, 2021,Virtual Event, QLD, Australia.ACM, New York, NY, USA, 10 pages
(CIKM 2021) (CCF B)
(Download | DOI)
- Yutao Zhu, Jian-Yun Nie, Zhicheng Dou*, Zhengyi Ma, Xinyu Zhang, Pan Du,
Xiaochen Zuo, and Hao Jiang. 2021.
Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking . InProceedings of the 30th
ACM International Conference on Information and Knowledge Management (CIKM ’21), New York, NY, USA, 2780–2791.
(CIKM 2021) (CCF B)
(Download |
DOI)
-
Shuqi Lu, Chenyan Xiong, Di He, Guolin Ke, Waleed Malik, Zhicheng Dou, Paul Bennett, Tie-Yan Liu, Arnold Overwijk.
Less is More: Pre-training a Strong Siamese Encoder Using a Weak Decoder.
The 2021 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2021) (CCF B)
- Zhengyi Ma, Zhicheng Dou*, Yutao Zhu, Hanxun Zhong, and Ji-Rong Wen. One Chatbot Per Person: Creating Personalized Chatbots based onImplicit User Profiles . InProceedings of the 44th International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR ’21),July 11–15, 2021, Virtual Event, Canada.ACM, New York, NY, USA, 10 pages.
(SIGIR 2021) (CCF A)
(Download | DOI)
- Zhan Su, Zhicheng Dou*,Yutao Zhu, Xubo Qin, and Ji-Rong Wen. Modeling Intent Graph for Search Result Diversification. InProceedings of the 44th International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR ’21),July 11–15, 2021, Virtual Event, Canada.ACM, New York, NY, USA, 10 pages.
(SIGIR 2021) (CCF A)
(Download | DOI)
- Yujia Zhou, Zhicheng Dou*, Bingzheng Wei, Ruobing Xie, and Ji-Rong Wen. Group based Personalized Search by Integrating Search Behaviourand Friend Network. InProceedings of the 44th International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR ’21),July 11–15, 2021, Virtual Event, Canada.ACM, New York, NY, USA, 10 pages.
(SIGIR 2021) (CCF A)
(Download | DOI)
- Hongjin Qian, Xiaohe Li, Hanxun Zhong, Yu Guo, Yueyuan Ma, Yutao Zhu, Zhanliang Liu, Zhicheng Dou*, and Ji-Rong Wen. Pchatbot: A Large-Scale Dataset for Personalized Chatbot. InProceedings of the 44th International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR ’21),July 11–15, 2021, Virtual Event, Canada.ACM, New York, NY, USA, 8 pages (Resource Paper).
(SIGIR 2021) (CCF A)
(Download | DOI)
- Xinyu Zhang, Ke Zhan, Enrui Hu, Chengzhen Fu, Lan Luo, Hao Jiang, Yantao Jia, Pan Yu, Zhao Cao, Zhicheng Dou, Lei Chen. Answer Complex Questions: Path Ranker Is All You Need. InProceedings of the 44th International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR ’21),July 11–15, 2021, Virtual Event, Canada.ACM, New York, NY, USA, 10 pages.
(SIGIR 2021) (CCF A) (Download | DOI)
- Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Hao Jiang, andZhicheng Dou. Proactive Retrieval-based Chatbots based on Relevant
Knowledge and Goals. InProceedings of the 44th International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR ’21),July 11–15, 2021, Virtual Event, Canada.ACM, New York, NY, USA, 5 pages (Short Paper).
(SIGIR 2021) (CCF A)
(Download | DOI)
- Jing Yao, Zhicheng Dou*, and Ji-Rong Wen. FedPS: A Privacy Protection Enhanced Personalized Search Framework.
Proceedings of 30th The Web Conference
(WWW 2021) (CCF A)
(Download)
- Yutao Zhu, Kun Zhou, Jian-Yun Nie, Shengchao Liu, and Zhicheng Dou. Neural Sentence Ordering Based on Constraint Graphs.
Proceedings of The Thirty-Fifth AAAI Conference on Artificial Intelligence
(AAAI 2021) (CCF A)
(Download)
- Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, and Zhicheng Dou. Content Selection Network for Document-grounded Retrieval-based Chatbots.
Proceedings of the 43rd edition of the annual BCS-IRSG European Conference on Information Retrieval
(ECIR 2021)
(Download)
- Zhumin Chen, Xueqi Cheng, Shoubin Dong, Zhicheng Dou, Jiafeng Guo, Xuanjing Huang, Yanyan Lan, Chenliang Li, Ru Li, Tie-Yan Liu, Yiqun Liu, Jun Ma, Bing Qin, Mingwen Wang, Ji-Rong Wen, Jun Xu, Min Zhang, Peng Zhang, Qi Zhang:
Information retrieval: a view from the Chinese IR community. Frontiers Comput. Sci. 15(1): 151601 (2021)
(Download | DOI)
- Yuqi Huo, Manli Zhang, Guangzhen Liu, Haoyu Lu, Yizhao Gao, Guoxing Yang, Jingyuan Wen, Heng Zhang, Baogui Xu, Weihao Zheng, Zongzheng Xi, Yueqian Yang, Anwen Hu, Jinming Zhao, Ruichen Li, Yida Zhao, Liang Zhang, Yuqing Song, Xin Hong, Wanqing Cui, Dan Yang Hou, Yingyan Li, Junyi Li, Peiyu Liu, Zheng Gong, Chuhao Jin, Yuchong Sun, Shizhe Chen, Zhiwu Lu, Zhicheng Dou, Qin Jin, Yanyan Lan, Wayne Xin Zhao, Ruihua Song, Ji-Rong Wen:
WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training. https://arxiv.org/pdf/2103.06561.pdf (2021)
- Xubo Qin, Zhicheng Dou, Yutao Zhu, Ji-Rong Wen:
Interaction-Based Document Matching for Implicit Search Result Diversification. CCIR 2021: 3-15
- Han Zhang, Zhicheng Dou, Yutao Zhu, Jirong Wen:
Few-Shot Charge Prediction with Multi-grained Features and Mutual Information. CCL 2021: 387-403
-
Xubo Qin, Zhicheng Dou, Yutao Zhu, and Jirong Wen. 2021. 基于双星型自注意力网络的搜索结果多样化方法(Search Result Diversification Framework Based on Dual Star-shaped Self-Attention Network). In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 280–292, Huhhot, China. Chinese Information Processing Society of China. (URL)
- 郭宇,窦志成,文继荣.PCC:一个对单用户建模的个性化对话系统[J].中文信息学报,2021,35(12):112-121.
- 张晗,郑伟昊,窦志成,文继荣.融合法律文本结构信息的刑事案件判决预测[J/OL].计算机工程与应用:1-12[2023-01-24]
- 黄宝跃,宋红伟,王宏堃,窦志成.互联网中检察监督线索的监测和发现——以人工智能深度学习算法为视角[J].人民检察,2021(18):45-48.
2020
- Zhengyi Ma, Zhicheng Dou*, Guanyue Bian, and Ji-Rong Wen. PSTIE: Time Information Enhanced Personalized Search.
Proceedings of 29th ACM International Conference on Information and Knowledge Management
(CIKM 2020) (CCF B)
(Download | DOI)
- Xubo Qin, Zhicheng Dou* and Ji-Rong Wen. Diversifying Search Results using Self-Attention Network.
Proceedings of 29th ACM International Conference on Information and Knowledge Management
(CIKM 2020) (CCF B)
(Download | DOI)
- Jing Yao, Zhicheng Dou* and Ji-Rong Wen. Employing Personal Word Embeddings for Personalized Search.
Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
(SIGIR 2020) (CCF A)
(Download | DOI)
- Yujia Zhou, Zhicheng Dou* and Ji-Rong Wen.
Encoding History with Context-aware Representation Learning for Personalized Search.
Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
(SIGIR 2020) (CCF A)
(Download | DOI)
- Jiongnan Liu, Zhicheng Dou*, Xiaojie Wang, Shuqi Lu and Ji-Rong Wen.
DVGAN: A Minimax Game for Search Result Diversification Combining Explicit and Implicit Features.
Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
(SIGIR 2020) (CCF A)
(Download | DOI)
-
Shuqi Lu, Zhicheng Dou*, Chenyan Xiong, Xiaojie Wang and Ji-Rong Wen.
Knowledge Enhanced Personalized Search.
Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
(SIGIR 2020) (CCF A)
(Download | DOI)
-
Jing Yao, Zhicheng Dou*, Jun Xu, and Ji-Rong Wen.
RLPer: A Reinforcement Learning Model for Personalized Search.
Proceedings of The Web Conference 2020, April 20--24, 2020, Taipei, Taiwan (WWW 2020) (CCF A)
(Download | DOI)
-
Anwen Hu, Zhicheng Dou*, Jian-Yun Nie, and Ji-Rong Wen. Leveraging Multi-token Entities in Document-level Named Entity Recognition.
Proceedings of the thirty-Fourth AAAI Conference on Artificial Intelligence
(AAAI 2020) (CCF A)
(Download | DOI)
- Yujia Zhou, Zhicheng Dou*, and Ji-Rong Wen. 2020. Enhancing Re-finding Behavior with External Memories for Personalized Search.
In Proceedings of the 13th ACM International Conference on Web Search and Data Mining (WSDM '20). ACM, New York, NY, USA,
(WSDM 2020) (CCF B)
(Download | DOI)
-
Yutao Zhu, Ruihua Song, Zhicheng Dou*, Jian-Yun Nie, and Jin Zhou. ScriptWriter: Narrative-Guided Script Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational
Linguistics, {ACL} 2020, Online, July 5-10, 2020: 8647-8657 (ACL 2020) (CCF A)
(Download | DOI)
2019
- Shuqi Lu, Zhicheng Dou*, Xu Jun, Jian-Yun Nie, and Ji-Rong Wen. PSGAN: A Minimax Game for Personalized Search with Limited and Noisy Click Data.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval: 555-564 (SIGIR 2019) (CCF A)
(Download | DOI)
-
Zhicheng Dou*, Xue Yang, Diya Li, Ji-Rong Wen, and Tetsuya Sakai. Low-cost, bottom-up measures for evaluating search result diversification. Information Retrieval Journal (2019). (IRJ)
(Download | DOI)
-
Yutao Zhu, Zhicheng Dou*, Jian-Yun Nie, and Ji-Rong Wen. ReBoost: A Retrieval-Boosted Sequence-to-Sequence Model for Neural Response Generation. Information Retrieval Journal (2019). (IRJ)
(Download | DOI)
-
Juan Li, Zhicheng Dou*, Yutao Zhu, Xiaochen Zuo, and Ji-Rong Wen. Deep Cross-platform Product Matching in E-commerce. Information Retrieval Journal (2019). (IRJ)
(Download | DOI)
- Yujia Zhou, , Zhicheng Dou*, Songwei Ge, and Ji-Rong Wen. Dynamic Personalized Search Based on RNN with Attention Mechanism. Chinese Journal of Computer (2019), Vol 42.
(Download | DOI)
- Zhicheng Dou*, Xubo Qin, and Ji-Rong Wen. A Survey on Search Result Diversification. Chinese Journal of Computer (2019), Vol 42.
(Download | DOI)
- Shuqi Lu, Zhicheng Dou*, and Ji-Rong Wen. Research On Structural Data Extraction in Surgical Cases. Chinese Journal of Computer (2019), Vol 42.
(Download | DOI)
- Xiaochen Zuo, Zhicheng Dou*, Zhen Huang, Shuqi Lu. Product Category Mining Associated with Weibo Hot Topics. Journal of Computer Research and Development,2019, 56(09):1927-1938. (Download | DOI)
- Anwen Hu, Zhicheng Dou*, and Ji-Rong Wen. Document-Level Named Entity Recognition by Incorporating Global
and Neighbor Features. Information Retrieval. 25th China Conference, CCIR 2019, Fuzhou, China, September 20–22, 2019, Proceedings.
(Download | DOI)
2018
- Zhengbao Jiang, Zhicheng Dou*, Wayne Xin Zhao, Jian-Yun Nie, Ming Yue, and Ji-Rong Wen.
Supervised Search Result Diversification via Subtopic Attention. IEEE Trans. Knowl. Data Eng. 30(10): 1971-1984 (2018)
(TKDE) (CCF A)
(Download | DOI)
- Xiao-Jie Wang, Ji-Rong Wen, Zhicheng Dou*, Tetsuya Sakai, and Rui Zhang.
Search Result Diversity Evaluation Based on Intent Hierarchies, IEEE Trans. Knowl. Data Eng. 30(1): 156-169 (2018)
(TKDE) (CCF A)
(Download | DOI)
- Songwei Ge, Zhicheng Dou*, Zhengbao Jiang, Jian-Yun Nie, Ji-Rong Wen.
Personalizing Search Results Using Hierarchical RNN with Query-aware Attention.
in Proceedings of the 27th ACM International Conference on Information and Knowledge Management: 347-356
(CIKM 2018) (CCF B)
(Download | DOI)
- Ji-Rong Wen, Zhicheng Dou*, Ruihua Song: Personalized Web Search. Encyclopedia of Database Systems (2nd ed.) 2018
2017
-
Zhengbao Jiang, Ji-Rong Wen, Zhicheng Dou*, Wayne Xin Zhao, Jian-Yun Nie, Ming Yue.
Learning to Diversify Search Results via Subtopic Attention.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
(SIGIR 2017) (CCF A)
(Download | DOI)
- Zhengbao Jiang, Zhicheng Dou*, Ji-Rong Wen.
Generating Query Facets Using Knowledge Bases. IEEE Trans. Knowl. Data Eng. 29(2): 315-329 (2017)
(TKDE) (CCF A)
(Download | DOI)
- Zhicheng Dou, Zhengbao Jiang, Jinxiu Li, Yichun Zhang, and Ji-Rong Wen.
A Method of Mining Query Facets Based on Term Graph Analysis. Chinese Journal of Computer, 2017, 40(3):556-569.
(Download | DOI) (In Chinese)
2016
- Xiaojie Wang, Zhicheng Dou*, Tetsuya Sakai, and Ji-Rong Wen. Evaluating Search Result Diversity using Intent Hierarchies. In Proceedings of SIGIR, 2016. (SIGIR 2016) (CCF A)
(Download | DOI)
- Zhicheng Dou*, Zhengbao Jiang, Sha Hu, Ji-Rong Wen, Ruihua Song: Automatically Mining Facets for Queries from Their Search Results. IEEE Trans. Knowl. Data Eng. (TKDE) 28(2):385-397 (2016) (TKDE) (CCF A)
(Download | DOI)
- Sha Hu, Ji-Rong Wen, Zhicheng Dou, Shuo Shang. Following the dynamic block on the Web. World Wide Web 19(6): 1077-1101 (2016)
(Download | DOI)
- Takehiro Yamamoto, Yiqun Liu, Min Zhang, Zhicheng Dou, Ke Zhou, Ilya Markov, Makoto P. Kato, Hiroaki Ohshima, Sumio Fujita. Overview of the NTCIR-12 IMine-2 Task. NTCIR 2016 (Download)
- Ming Yue, Zhicheng Dou, Sha Hu, Jinxiu Li, Xiao-Jie Wang, Ji-Rong Wen. RUCIR at NTCIR-12 IMINE-2 Task. NTCIR 2016 (Download)
- Shaoping Ma, Ji-Rong Wen, Yiqun Liu, Zhicheng Dou, Min Zhang, Yi Chang, Xin Zhao.
Information Retrieval Technology - 12th Asia Information Retrieval Societies Conference, AIRS 2016, Beijing, China, November 30 - December 2, 2016, Proceedings. Lecture Notes in Computer Science 9994, Springer 2016, ISBN 978-3-319-48050-3 (Download)
2015
- Zhongqi Lu, Zhicheng Dou*, Xing Xie, Jianxun Lian, Qiang Yang. Content-based Collaborative Filtering for News Topic Recommendation. In Proceedings of Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2015), Austin Texas, USA, Jan 25-29, 2015. (AAAI 2015) (CCF A)
(Download | DOI)
- Sha Hu, Zhicheng Dou*, Xiaojie Wang, Tetsuya Sakai, and Ji-Rong Wen. 2015. Search Result Diversification Based on Hierarchical Intents. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM '15). ACM, New York, NY, USA, 63-72. (CIKM 2015) (CCF B)
(Download | DOI)
- Sha Hu, Zhicheng Dou*, Xiao-Jie Wang, Ji-Rong Wen: Search Result Diversification Based on Query Facets. J. Comput. Sci. Technol. (JCST) 30(4):888-901 (2015)
(Download | DOI)
- Zhicheng Dou and Ji-Rong Wen. Web Analytical Engine in the Big Data Era. Big Data Journal, 2015(3). (In Chinese)
(Download | DOI)
↑↑↑↑↑↑After I joined Renmin University of China↑↑↑↑↑↑
2014
- Yiqun Liu, Ruihua Song, Min Zhang, Zhicheng Dou, Takehiro Yamamoto, Makoto Kato, Hiroaki Ohshima, Ke Zhou. Overview of the NTCIR-11 IMine Task. Proceedings of the 11th NTCIR conference. (Download)
- Fei Chen, Yiqun Liu, Zhicheng Dou, Keyang Xu, Yujie Cao, Min Zhang, and Shaoping Ma, Revisiting the Evaluation of Diversified Search Evaluation Metrics with User Preferences. Proceedings of the 10th Asia Information Retrieval Society Conference (AIRS 2014) (Download)
- Jingfei Li, Dawei Song, Peng Zhang, Ji-Rong Wen, and Zhicheng Dou, Personalizing Web Search Results Based on Subspace Projection, Proceedings of the 10th Asia Information Retrieval Society Conference (AIRS 2014) (Download)
- Shu Tang, Zhicheng Dou, Xing Xie, and Jun He,
Detecting and Monitoring Dynamic Content Blocks of a Web Page by Merging its Historical Versions, in SIGIR 2014 Workshop on Temporal, Social and Spatially-aware Information Access (TAIA2014), 2014 (Download)
2013
- Xiao Ding, Zhicheng Dou, Bing Qin, Ting Liu, and Ji-Rong Wen, Improving Web Search Ranking by Incorporating Structured Annotation of Queries, in Proceedings of EMNLP 2013, pages 468-478, October 2013 (EMNLP 2013) (CCF B) (Download)
- Kosetsu Tsukuda, Tetsuya Sakai, Zhicheng Dou, and Katsumi Tanaka, Estimating Intent Types for Search Result Diversification, in Information Retrieval Technology, pages 25-37, Springer Berlin Heidelberg, 2013 (Download)
- Ke Zhou, Tetsuya Sakai, Mounia Lalmas, Zhicheng Dou, and Joemon M. Jose, Evaluating Heterogeneous Information Access, in ACM SIGIR 2013 Workshop on Modeling User Behavior for Information Access Evaluation, 2013 (Download)
- Qinglei Wang, Yanan Qian, Ruihua Song, Zhicheng Dou, Fan Zhang, Tetsuya Sakai, and Qinghua Zheng, Mining Subtopics from Text Fragments for a Web Query, in Information Retrieval 16(4) pages 484-503, 2013 (Download)
- Tetsuya Sakai and Zhicheng Dou, Summaries, Ranked Retrieval and Sessions: A Unified Framework for Information Access Evaluation, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 473-482, ACM, 2013 (The Best Paper Runner-Up Award) (SIGIR 2013) (CCF A) (Download)
- Tetsuya Sakai, Zhicheng Dou, and Carles Clarke, The Impact of Intent Selection on Diversified Search Evaluation, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 921-924, ACM, 2013 (SIGIR 2013) (CCF A) (Download)
- Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, Makoto Kato, Ruihua Song, and Mayu Iwata, Summary of the NTCIR-10 INTENT-2 Task: Subtopic Mining and Search Result Diversification, in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2013), pages 761 - 764, ACM, 2013 (SIGIR 2013) (CCF A) (Download)
- Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, and Ruihua Song, Overview of the NTCIR-10 INTENT-2 Task, in Proceedings of the 10th NTCIR Conference, pages 94-123, June 18-21, 2013 (Download)
- Kosetsu Tsukuda, Zhicheng Dou, and Tetsuya Sakai, Microsoft Research Asia at the NTCIR-10 Intent Task, in Proceedings of the 10th NTCIR Conference, June 2013 (Download)
- Kazuya Narita, Tetsuya Sakai, Zhicheng Dou, and Young-In Song, MSRA at NTCIR-10 1CLICK-2, in Proceedings of the 10th NTCIR Conference, 2013 (Download)
2012
- Tetsuya Sakai, Zhicheng Dou, Ruihua song, and Noriko Kando, The Reusability of a Diversified Search Test Collection, in Information Retrieval Technology (AIRS 2012), pages 26-38, Springer Berlin Heidelberg, 20 December 2012 (The Best Paper Award) (Download)
2011
- Zhicheng Dou, Sha Hu, Kun Chen, Ruihua Song, and Ji-Rong Wen, Multi-dimensional Search Result Diversification, in Proceedings of the fourth ACM international conference on Web search and data mining (WSDM 2011), pages 475-484, ACM, February 2011 (WSDM 2011) (CCF B) (Download)
- Zhicheng Dou, Finding Dimensions for Queries, in Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM 2011), pages 1311-1320, ACM, 2011 (CIKM 2011) (CCF B) (Download)
- Jialong Han, Qinglei Wang, Naoki Orii, Zhicheng Dou, Tetsuya Sakai, and Ruihua Song, Microsoft Research Asia at the NTCIR-9 Intent Task, in Proceedings of the 10th NTCIR Conference (NTCIR-9), National Institute of Informatics, 2011 (CIKM 2011) (CCF B) (Download)
2010
- Tetsuya Sakai, Nick Craswell, Ruihua Song, Stephen Robertson, Zhicheng Dou, and Chin-Yew Lin, Simple Evaluation Metrics for Diversified Search Results, in Proceedings of the Third International Workshop on Evaluating Information Access (EVIA), Volumn 26, pages 27, National Institute of Informatics, June 2010 (Download)
- Ruihua Song, Zhicheng Dou, Hsiao-Wuen Hon, and Yong Yu, Learning Query Ambiguity Models by Using Search Logs, Journal of Computer Science and Technology, 25(4), pages 782-738, Springer, July 2010 (Download)
2009
- Zhicheng Dou, Kun Chen, Ruihua Song, Yunxiao Ma, Shuming Shi, and Ji-Rong Wen, Microsoft Research Asia at the Web Track of TREC 2009, in Proceedings of TREC 2009, November 2009 (Download)
- Ji-Rong Wen, Zhicheng Dou, and Ruihua Song, Personalized Web Search, in Encyclopedia of Database Systems, pages 2099-2103, Springer-Verlag, New York, USA, September 2009 (Download)
- Zhicheng Dou, Ruihua Song, Jian-Yun Nie, and Ji-Rong Wen, Using Anchor Texts with Their Hyperlink Structure for Web Search, in Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval(SIGIR 2009), pages 227-234, ACM, July 2009 (SIGIR 2009) (CCF A) (Download)
- Zhicheng Dou, Ruihua Song, Ji-Rong Wen, and Xiaojie Yuan, Evaluating the Effectiveness of Personalized Web Search, in IEEE Transactions on Knowledge and Data Engineering (TKDE), 21(8), pages 1178-1190, IEEE computer Society Digital Library, Aug., 2009 (TKDE) (CCF A) (Download)
2008
- Zhicheng Dou, Ruihua Song, Xiaojie Yuan, and Ji-Rong Wen, Are click-through data adequate for learning web search rankings?, in Proceeding of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 73-82, ACM, New York, NY, USA, 2008 (CIKM 2008) (CCF B) (Download)
- Zhicheng Dou, Xiaojie Yuan, and Songbai He, Analysis of Query Repetition in Large-scale Chinese Search Log, Computer Engineering, 34(21), Volumn 21, pages 40-44, 2008 (In Chinese) (Download)
- Xiaojie Yuan, Zhicheng Dou, Lu Zhang, and Fang Liu, Automatic User Goals Identification Based on Anchor Text and Click-through Data, in Wuhan University Journal of Natural Sciences (WISA2008), 13(4), pages 495-500, 2008 (In Chinese) (Download)
- Xiaojie Yuan, Zhicheng Dou, Fang Liu, and Lu Zhang, Personalized Web Search Based on Dynamic User Profile, NDBC 2008: Proceedings of the 25th National Database Conference (In Chinese) , 2008 (Download)
- Lu ZHANG, Xiao-jie YUAN, Fang LIU, and Zhicheng Dou,
Research on Distributed Index Mechanism for Large Dataset, Microelectronics & Computer, Volume 10, Pages 037, 2008
2007
- Zhicheng Dou, Ruihua Song, and Ji-Rong Wen, A large-scale evaluation and analysis of personalized search strategies, in Proceedings of the 16th international conference on World Wide Web (WWW2007), pages 581-590, ACM Press, New York, NY, USA, 2007 (WWW 2007) (CCF A) (Download)
- Steering Committee member of Asia Information Retrieval Society (AIRS)
- Co-organizer: NTCIR-10 Intent-2 task, NTCIR-11 IMINE task
and NTCIR-12 IMINE-2 task
- PC/Reviewer
- SIGIR 2016, IJCAI 2016, ICDM 2016
- SIGIR 2015, BigData 2015, CCIR 2015, NLPCC 2015, EMNLP 2015, WWW 2015, AAAI 2015, ICDE 2015, DASFAA 2015 demo track, HIA 2015
- SIGIR 2014, CIKM 2014, WSDM 2014, KDD 2014, BigData 2014, NLPCC 2014
- SIGIR 2013, CIKM 2013, IEEE BIG Data 2013, OAIR 2013, WWW 2013, SDM 2013
- KDD 2012, WIDM 2009, KDD'08, WWW'07, KDD'07, APWeb'07, ICDM'06, et al
- Reviewer, TKDE, TIST, JCST, KAIS, GeoInformatica, Information Retrieval Journal, et al.
- Collaboration Award, Microsoft, 2013
- ACM SIGIR 2013 Best Paper Runner-Up Award
- AIRS 2012 Best Paper Award
- Microsoft Spot Award, 2012
- As one of the best interns at Microsoft Research Asia, visited Bill Gates' home in 2007
- Employing Page Links to Marge Pages of Articles (in publication)
- Information Sensors for Sensing Web Dynamics (in publication)
- Zhicheng Dou, Ruihua Song, and Ji-Rong Wen, Extracting Query Dimensions from Search Results, US-2013-0173605-A1, publication date:July 4, 2013
- Zhicheng Dou, Junyan Chen, Ruihua Song, and Ji-Rong Wen, Using Anchor Text With Hyperlink Structures for Web Searches, US-2011-0238644-A1, US8380722 B2, publication date: Feb 19, 2013
- Ji-Rong Wen, Guomao Xin, Yunxiao Ma, Yu Chen, Qing Yu, Yi Liu, Zhicheng Dou, and Shuming Shi, Data-Centric Search Engine Architecture, US-2011-0137886-A1, publication date: June 9, 2011
- Ji-Rong Wen, Yu Chen, Guomao Xin, Yunxiao Ma, Yi Liu, Zhicheng Dou, Qing Yu, and Shuming Shi, Experimental Web Search System, US-2011-0078131-A1, publication date: Mar 31, 2011