In English

Dialogue modeling using recurrent neural networks

Viktor Anderling ; Jonathan Orrö
Göteborg : Chalmers tekniska högskola, 2016. 61 s.
[Examensarbete på avancerad nivå]

As the importance of computers in everyday life increases, so does the demand for better human-computer interfaces. Natural language, being our most natural form of communication, combined with man’s innate tendency of anthropomorphism, motivates the idea of a talking machine. Existing dialogue systems have the problem of being unable to answer out-of-domain questions as well as being tedious to design. While these systems are developed with hand-crafted rules, the goal of this thesis is to investigate if a dialogue system could be automatically trained to speak instead. Our aim is to test whether a model trained on a dialogue corpus can compare to existing dialogue systems. We trained a recurrent neural network using the sequenceto- sequence method, preserving the state of the model during the course of the conversation. The resulting network is end-to-end trainable. User testing was used to evaluate the model and compare it to the other dialogue systems. The final model can answer appropriately to common phrases such as greetings and valedictions. It also generates replies in correct English. However, the results do not stretch any further than that. Giving the model a more complicated input usually results in a nonsensical reply, which prevents it from having a coherent conversation with the user. We present a few hypotheses as to why we did not get better results, with suggestions on how they could be solved. We display high hopes for future work in the area and present a few suggestions of what could be the next steps.

Nyckelord: chatbot, dialogue, dialogue modeling, dialogue system, artificial neural networks, recurrent neural networks, rnn, lstm, user testing, deep learning

Publikationen registrerades 2016-06-17. Den ändrades senast 2016-06-17

CPL ID: 237861

Detta är en tjänst från Chalmers bibliotek