dc.contributor.author | Bhattarai, Bimal | |
dc.contributor.author | Granmo, Ole-Christoffer | |
dc.contributor.author | Lei, Jiao | |
dc.date.accessioned | 2023-01-09T10:16:39Z | |
dc.date.available | 2023-01-09T10:16:39Z | |
dc.date.created | 2022-04-08T19:38:25Z | |
dc.date.issued | 2022 | |
dc.identifier.citation | Bhattarai, B., Granmo, O.-C. & Lei, J. (2022). Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines. Applied intelligence (Boston), 1-25. doi: | en_US |
dc.identifier.issn | 0924-669X | |
dc.identifier.uri | https://hdl.handle.net/11250/3041885 | |
dc.description.abstract | Recent research in novelty detection focuses mainly on document-level classification, employing deep neural networks (DNN). However, the black-box nature of DNNs makes it difficult to extract an exact explanation of why a document is considered novel. In addition, dealing with novelty at the word level is crucial to provide a more fine-grained analysis than what is available at the document level. In this work, we propose a Tsetlin Machine (TM)-based architecture for scoring individual words according to their contribution to novelty. Our approach encodes a description of the novel documents using the linguistic patterns captured by TM clauses. We then adapt this description to measure how much a word contributes to making documents novel. Our experimental results demonstrate how our approach breaks down novelty into interpretable phrases, successfully measuring novelty. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Springer Nature | en_US |
dc.rights | Navngivelse 4.0 Internasjonal | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/deed.no | * |
dc.title | Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines | en_US |
dc.type | Peer reviewed | en_US |
dc.type | Journal article | en_US |
dc.description.version | publishedVersion | en_US |
dc.rights.holder | © 2022 Author(s) | en_US |
dc.subject.nsi | VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550 | en_US |
dc.source.pagenumber | 25 | en_US |
dc.source.journal | Applied intelligence (Boston) | en_US |
dc.identifier.doi | 10.1007/s10489-022-03281-1 | |
dc.identifier.cristin | 2016286 | |
dc.relation.project | Universitetet i Agder: CAIR | en_US |
dc.description.localcode | Paid Open Access | en_US |
cristin.qualitycode | 2 | |