research-article

Overview of the seventh Dialog System Technology Challenge: : DSTC7

Authors:

Luis Fernando D’Haro,

Jonathan K. Kummerfeld,

Michel Galley,

Xiang GaoAuthors Info & Claims

Volume 62, Issue C

https://doi.org/10.1016/j.csl.2020.101068

Published: 01 July 2020 Publication History

Highlights

•

DSTC7: Dialog Challenge to build more robust and accurate end-to-end dialog systems.

•

Track 1, Sentence selection for multiple domains, including variations where there are a large number of candidate options, and where the candidate set has zero, one, or multiple correct options.

•

Track 2, Beyond Chitchat: Generation of informational responses grounded in external knowledge.

•

Track 3, Audio visual scene-aware dialog systems to allow dynamic conversations about objects and events around users.

Abstract

This paper provides detailed information about the seventh Dialog System Technology Challenge (DSTC7) and its three tracks aimed to explore the problem of building robust and accurate end-to-end dialog systems. In more detail, DSTC7 focuses on developing and exploring end-to-end technologies for the following three pragmatic challenges: (1) sentence selection for multiple domains, (2) generation of informational responses grounded in external knowledge, and (3) audio visual scene-aware dialog to allow conversations with users about objects and events around them.

This paper summarizes the overall setup and results of DSTC7, including detailed descriptions of the different tracks, provided datasets and annotations, overview of the submitted systems and their final results. For Track 1, LSTM-based models performed best across both datasets, allowing teams to effectively handle task variants where no correct answer was present or when multiple paraphrases were included. For Track 2, RNN-based architectures augmented to incorporate facts by using two types of encoders: a dialog encoder and a fact encoder plus using attention mechanisms and a pointer-generator approach provided the best results. Finally, for Track 3, the best model used Hierarchical Attention mechanisms to combine the text and vision information obtaining a 22% better result than the baseline LSTM system for the human rating score.

More than 220 participants were registered and about 40 teams participated in the final challenge. 32 scientific papers reporting the systems submitted to DSTC7, and 3 general technical papers for dialog technologies, were presented during the one-day wrap-up workshop at AAAI-19. During the workshop, we reviewed the state-of-the-art systems, shared novel approaches to the DSTC7 tasks, and discussed the future directions for the challenge (DSTC8).

References

[1]

H. Alamri, V. Cartillier, A. Das, J. Wang, A. Cherian, I. Essa, et al., Audio visual scene-aware dialog, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.

Highlights

Abstract

References

Cited By

Index Terms

Recommendations

Editorial: Special Issue on the Eighth Dialog System Technology Challenge

Overview of the Tenth Dialog System Technology Challenge: DSTC10

Overview of the Ninth Dialog System Technology Challenge: DSTC9

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations