1 Star 2 Fork 0

深层知识处理与语义计算 / knowledge-driven-dialogue

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
README.md 4.91 KB
一键复制 编辑 原始数据 按行查看 历史
Yu 提交于 2020-11-28 17:21 . update README.md.

Proactive Conversation——Knowledge Driven Dialogue

Motivation

Human-machine conversation is one of the most important topics in artificial intelligence (AI) and has received much attention across academia and industry in recent years. Currently dialogue system is still in its infancy, which usually converses passively and utters their words more as a matter of response rather than on their own initiatives, which is different from human-human conversation. Therefore, we set up a new conversation task, named proactive conversation which is a type of knowledge driven dialogue, where machines converse with humans based on a built knowledge graph. It aims at testing machines’ ability to conduct human-like conversations.

Knowledge Graph

We crawled the related knowledge information from the website MTime.com, which records the information of most films, heroes, and heroines in China. We collect both structured knowledge (such as “Harry Potter” is “directed by” “Chris Columbus”) as well as unstructured knowledge including short comments and synopsis. After the raw data collection, we construct a knowledge graph. Our knowledge graph is comprised of multiple SPO (Subject, Predicate, Object) knowledge triplets, where objects can be factoid facts and non-factoid sentences such as comments and synopsis.

Paper

Task Description

Given a dialogue goal g and a set of topic-related background knowledge M = f1 ,f2 ,..., fn , the system is expected to output an utterance "ut" for the current conversation H = u1, u2, ..., ut-1, which keeps the conversation coherent and informative under the guidance of the given goal. During the dialogue, the system is required to proactively lead the conversation from one topic to another. The dialog goal g is given like this: "Start->Topic_A->TOPIC_B", which means the machine should lead the conversation from any start state to topic A and then to topic B. The given background knowledge includes knowledge related to topic A and topic B, and the relations between these two topics.
image Figure1.Proactive Conversation Case. Each utterance of "BOT" could be predicted by system, e.g., utterances with black words represent history H,and utterance with green words represent the response ut predicted by system.

DuConv

We collected around 30k conversations containing 270k utterances named DuConv. Each conversation was created by two random selected crowdsourced workers. One worker was provided with dialogue goal and the associated knowledge to play the role of leader who proactively leads the conversation by sequentially change the discussion topics following the given goal, meanwhile keeping the conversation as natural and engaging as possible. Another worker was provided with nothing but conversation history and only has to respond to the leader.
  We devide the collected conversations into training, development, test1 and test2 splits. The test1 part with reference response is used for local testing such as the automatic evaluation of our paper. The test2 part without reference response is used for online testing such as the competition we had held and the Leader Board which is opened forever in https://ai.baidu.com/broad/leaderboard?dataset=duconv. The dataset is available at https://ai.baidu.com/broad/subordinate?dataset=duconv.

Baseline Performance

We provide retrieval-based and generation-based baseline systems. Both systems were implemented by PaddlePaddle (the Baidu deeplearning framework). The performance of the two systems is as follows:

baseline system F1/BLEU1/BLEU2 DISTINCT1/DISTINCT2
retrieval-based 31.72/0.291/0.156 0.118/0.373
generation-based 32.65/0.300/0.168 0.062/0.128

Competitions

Rank F1/BLEU1/BLEU2 DISTINCT1/DISTINCT2
1 49.22/0.449/0.318 0.118/0.299
2 47.76/0.430/0.296 0.110/0.275
3 46.40/0.422/0.289 0.118/0.303
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
Python
1
https://gitee.com/semantic_kg/knowledge-driven-dialogue.git
git@gitee.com:semantic_kg/knowledge-driven-dialogue.git
semantic_kg
knowledge-driven-dialogue
knowledge-driven-dialogue
master

搜索帮助

344bd9b3 5694891 D2dac590 5694891