Vis enkel innførsel

dc.contributor.authorYadav, Rohan Kumar
dc.contributor.authorLei, Jiao
dc.contributor.authorGranmo, Ole-Christoffer
dc.contributor.authorGoodwin, Morten
dc.date.accessioned2023-03-09T12:35:30Z
dc.date.available2023-03-09T12:35:30Z
dc.date.created2022-11-22T16:08:46Z
dc.date.issued2022
dc.identifier.citationYadav, R. K., Lei, J., Granmo, O.-C. & Goodwin, M. (2022). Robust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negation. International Joint Conferences on Artificial Intelligence, 4439-4446.en_US
dc.identifier.isbn978-1-956792-00-3
dc.identifier.urihttps://hdl.handle.net/11250/3057374
dc.description.abstractThe state-of-the-art natural language processing models have raised the bar for excellent performance on a variety of tasks in recent years. However, concerns are rising over their primitive sensitivity to distribution biases that reside in the training and testing data. This issue hugely impacts the performance of the models when exposed to out-of-distribution and counterfactual data. The root cause seems to be that many machine learning models are prone to learn the shortcuts, modelling simple correlations rather than more fundamental and general relationships. As a result, such text classifiers tend to perform poorly when a human makes minor modifications to the data, which raises questions regarding their robustness. In this paper, we employ a rule-based architecture called Tsetlin Machine (TM) that learns both simple and complex correlations by ANDing features and their negations. As such, it generates explainable AND-rules using negated and non-negated reasoning. Here, we explore how non-negated reasoning can be more prone to distribution biases than negated reasoning. We further leverage this finding by adapting the TM architecture to mainly perform negated reasoning using the specificity parameter s. As a result, the AND-rules becomes robust to spurious correlations and can also correctly predict counterfactual data. Our empirical investigation of the model's robustness uses the specificity s to control the degree of negated reasoning. Experiments on publicly available Counterfactually-Augmented Data demonstrate that the negated clauses are robust to spurious correlations and outperform Naive Bayes, SVM, and Bi-LSTM by up to 20 %, and ELMo by almost 6 % on counterfactual test data.en_US
dc.language.isoengen_US
dc.publisherInternational Joint Conferences on Artificial Intelligenceen_US
dc.relation.ispartofIJCAI International Joint Conference on Artificial Intelligence
dc.relation.ispartofseriesIJCAI International Joint Conference on Artificial Intelligence
dc.titleRobust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negationen_US
dc.title.alternativeRobust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negationen_US
dc.typePeer revieweden_US
dc.typeConference objecten_US
dc.description.versionacceptedVersionen_US
dc.rights.holder© 2022 IJCAIen_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US
dc.source.pagenumber4439-4446en_US
dc.source.issue2022en_US
dc.identifier.doihttps://doi.org/10.24963/ijcai.2022/616
dc.identifier.cristin2080703
cristin.fulltextpostprint


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel