Vis enkel innførsel

dc.contributor.authorBhattarai, Bimal
dc.contributor.authorGranmo, Ole-Christoffer
dc.contributor.authorLei, Jiao
dc.date.accessioned2023-01-09T10:16:39Z
dc.date.available2023-01-09T10:16:39Z
dc.date.created2022-04-08T19:38:25Z
dc.date.issued2022
dc.identifier.citationBhattarai, B., Granmo, O.-C. & Lei, J. (2022). Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines. Applied intelligence (Boston), 1-25. doi:en_US
dc.identifier.issn0924-669X
dc.identifier.urihttps://hdl.handle.net/11250/3041885
dc.description.abstractRecent research in novelty detection focuses mainly on document-level classification, employing deep neural networks (DNN). However, the black-box nature of DNNs makes it difficult to extract an exact explanation of why a document is considered novel. In addition, dealing with novelty at the word level is crucial to provide a more fine-grained analysis than what is available at the document level. In this work, we propose a Tsetlin Machine (TM)-based architecture for scoring individual words according to their contribution to novelty. Our approach encodes a description of the novel documents using the linguistic patterns captured by TM clauses. We then adapt this description to measure how much a word contributes to making documents novel. Our experimental results demonstrate how our approach breaks down novelty into interpretable phrases, successfully measuring novelty.en_US
dc.language.isoengen_US
dc.publisherSpringer Natureen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleWord-level human interpretable scoring mechanism for novel text detection using Tsetlin Machinesen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© 2022 Author(s)en_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US
dc.source.pagenumber25en_US
dc.source.journalApplied intelligence (Boston)en_US
dc.identifier.doi10.1007/s10489-022-03281-1
dc.identifier.cristin2016286
dc.relation.projectUniversitetet i Agder: CAIRen_US
dc.description.localcodePaid Open Accessen_US
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal