Created by: golovenkov
Description
Add several ranking matching models for retrieval-based multi-turn response selection:
-
Sequential Matching Network (SMN) [Wu et al., 2017]
-
Deep Attention Matching Network (DAM) [Zhou et al., 2018]
-
Deep Attention Matching Network with Universal Sentence Encoder (DAM-USE-T). This is our new proposed neural ranking architecture based on [Zhou et al., 2018] and [Cer D. et al., 2018]. The model DAM-USE-T outperforms the current state-of-the-art models on the Ubuntu Dialogue Corpus v1 [Lowe et al., 2015] and Ubuntu Dialogue Corpus v2 [Lowe et al., 2017] with the test metrics:
0.7957/0.8932/0.9734
and0.7446/0.8677/0.9738
accordingly.
UPD: added pre-trained models (SMN, DAM, DAM-USE-T) for both Ubuntu Corpus v1 and v2 datasets. There are 6 models in total, see ranking configs, for example, pre-trained model DAM-USE-T for Ubuntu-v1 is here
References
Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. 2015. The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. arXiv preprint arXiv:1506.08909.
Ryan Thomas Lowe, Nissan Pow, Iulian Vlad Serban, Laurent Charlin, Chia-Wei Liu, and Joelle Pineau. 2017. Training end-to-end dialogue systems with the ubuntu dialogue corpus. Dialogue and Discourse 8(1).
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu and Hua Wu. 2018. Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1118-1127, ACL. http://aclweb.org/anthology/P18-1103
Yu Wu, Wei Wu, Ming Zhou, and Zhoujun Li. 2017. Sequential match network: A new architecture for multi-turn response selection in retrieval-based chatbots. In ACL, pages 372–381.
Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Brian Strope, Ray Kurzweil. 2018a. Universal Sentence Encoder for English. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Pages 169–174. http://aclweb.org/anthology/D18-2029