System Architecture¶

This page provides a high-level overview of the architecture of our system. At this level of abstraction, our system constitutes a domain-independent framework for facilitating conversational item recommendation. Thus, even though we will be using movie-related examples for illustration, it is straightforward to adapt the system to other domains.

The system architecture is shown in the figure below, illustrating the core process for each dialogue turn.

Natural Language Understanding¶

The NLU component converts the natural language UserUtterance into a DialogueAct. This process, comprising of intent detection and slot filling, is performed based on the current dialogue state.

Dialogue Manager¶

The DialogueManager tracks the dialogue state and decides what action the system should take. It consists of two sub-components:

DialogueStateTracker, which updates the DialogueState and DialogueContext based on the dialogue acts by both the agent and the user.
DialoguePolicy, which generates a dialogue act by the agent based on the current dialogue state. It defines the flow of the conversation, i.e., what steps an agent must take at every stage.

Natural Language Generation¶

The NLG component converts the output of the DialoguePolicy to a natural language response. Further, this component can (1) summarize the information need back to the user, to help them keep track of their stated preferences and (2) help the user to explore the item space by providing options.