Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/161275
Title: | Variational deep logic network for joint inference of entities and relations | Authors: | Wang, Wenya Pan, Sinno Jialin |
Keywords: | Engineering::Computer science and engineering | Issue Date: | 2021 | Source: | Wang, W. & Pan, S. J. (2021). Variational deep logic network for joint inference of entities and relations. Computational Linguistics, 47(4), 775-812. https://dx.doi.org/10.1162/COLI_a_00415 | Project: | M4081532.020 | Journal: | Computational Linguistics | Abstract: | Currently, deep learning models have been widely adopted and achieved promising results on various application domains. Despite their intriguing performance, most deep learning models function as black boxes, lacking explicit reasoning capabilities and explanations, which are usually essential for complex problems. Take joint inference in information extraction as an example. This task requires the identification of multiple structured knowledge from texts, which is inter-correlated, including entities, events, and the relationships between them. Various deep neural networks have been proposed to jointly perform entity extraction and relation prediction, which only propagate information implicitly via representation learning. However, they fail to encode the intensive correlations between entity types and relations to enforce their coexistence. On the other hand, some approaches adopt rules to explicitly constrain certain relational facts, although the separation of rules with representation learning usually restrains the approaches with error propagation. Moreover, the predefined rules are inflexible and might result in negative effects when data is noisy. To address these limitations, we propose a variational deep logic network that incorporates both representation learning and relational reasoning via the variational EM algorithm. The model consists of a deep neural network to learn high-level features with implicit interactions via the self-attention mechanism and a relational logic network to explicitly exploit target interactions. These two components are trained interactively to bring the best of both worlds. We conduct extensive experiments ranging from fine-grained sentiment terms extraction, end-to-end relation prediction, to end-to-end event extraction to demonstrate the effectiveness of our proposed method. | URI: | https://hdl.handle.net/10356/161275 | ISSN: | 0891-2017 | DOI: | 10.1162/COLI_a_00415 | Schools: | School of Computer Science and Engineering | Rights: | © 2021 Association for Computational Linguistics. Published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license. | Fulltext Permission: | open | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCSE Journal Articles |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
coli_a_00415.pdf | 567.73 kB | Adobe PDF | ![]() View/Open |
SCOPUSTM
Citations
50
1
Updated on Sep 16, 2023
Web of ScienceTM
Citations
50
1
Updated on Sep 22, 2023
Page view(s)
39
Updated on Sep 23, 2023
Download(s) 50
36
Updated on Sep 23, 2023
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.