Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/151218
Full metadata record
DC FieldValueLanguage
dc.contributor.authorD'Haro, Luis Fernandoen_US
dc.contributor.authorBanchs, Rafael E.en_US
dc.contributor.authorHori, Chiorien_US
dc.contributor.authorLi, Haizhouen_US
dc.date.accessioned2021-07-02T03:31:40Z-
dc.date.available2021-07-02T03:31:40Z-
dc.date.issued2018-
dc.identifier.citationD'Haro, L. F., Banchs, R. E., Hori, C. & Li, H. (2018). Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics. Computer Speech and Language, 55, 200-215. https://dx.doi.org/10.1016/j.csl.2018.12.004en_US
dc.identifier.issn0885-2308en_US
dc.identifier.other0000-0002-4201-7578-
dc.identifier.urihttps://hdl.handle.net/10356/151218-
dc.description.abstractEnd-to-end dialog systems are gaining interest due to the recent advances of deep neural networks and the availability of large human–human dialog corpora. However, in spite of being of fundamental importance to systematically improve the performance of this kind of systems, automatic evaluation of the generated dialog utterances is still an unsolved problem. Indeed, most of the proposed objective metrics shown low correlation with human evaluations. In this paper, we evaluate a two-dimensional evaluation metric that is designed to operate at sentence level, which considers the syntactic and semantic information carried along the answers generated by an end-to-end dialog system with respect to a set of references. The proposed metric, when applied to outputs generated by the systems participating in track 2 of the DSTC-6 challenge, shows a higher correlation with human evaluations (up to 12.8% relative improvement at the system level) than the best of the alternative state-of-the-art automatic metrics currently available.en_US
dc.language.isoenen_US
dc.relation.ispartofComputer Speech and Languageen_US
dc.rights© 2018 Elsevier Ltd. All rights reserved.en_US
dc.subjectEngineering::Computer science and engineeringen_US
dc.titleAutomatic evaluation of end-to-end dialog systems with adequacy-fluency metricsen_US
dc.typeJournal Articleen
dc.contributor.schoolSchool of Computer Science and Engineeringen_US
dc.identifier.doi10.1016/j.csl.2018.12.004-
dc.identifier.scopus2-s2.0-85059347815-
dc.identifier.volume55en_US
dc.identifier.spage200en_US
dc.identifier.epage215en_US
dc.subject.keywordsAutomatic Evaluation Metricsen_US
dc.subject.keywordsDialog Systemsen_US
item.fulltextNo Fulltext-
item.grantfulltextnone-
Appears in Collections:SCSE Journal Articles

Page view(s)

82
Updated on May 24, 2022

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.