Show simple item record

dc.contributor.authorLien, Henrik
dc.contributor.authorBiermann, Daniel
dc.contributor.authorPalumbo, Fabrizio
dc.contributor.authorGoodwin, Morten
dc.date.accessioned2023-03-28T12:27:37Z
dc.date.available2023-03-28T12:27:37Z
dc.date.created2023-01-24T16:33:31Z
dc.date.issued2022
dc.identifier.citationLien, H., Biermann, D., Palumbo, F. & Goodwin, M. (2022). An Exploration of Semi-supervised Text Classification. In L. Iliadis, C. Jayne, A. Tefas & E. Pimenidis (Eds.), Engineering Applications of Neural Networks, (1600, pp. 477-488). Springer, Cham.en_US
dc.identifier.isbn978-3-031-08223-8
dc.identifier.urihttps://hdl.handle.net/11250/3060734
dc.descriptionAuthor's accepted manuscripten_US
dc.description.abstractGood performance in supervised text classification is usually obtained with the use of large amounts of labeled training data. However, obtaining labeled data is often expensive and time-consuming. To overcome these limitations, researchers have developed Semi-Supervised learning (SSL) algorithms exploiting the use of unlabeled data, which are generally easy and free to access. With SSL, unlabeled and labeled data are combined to outperform Supervised-Learning algorithms. However, setting up SSL neural networks for text classification is cumbersome and frequently based on a trial and error process. We show that the hyperparameter configuration significantly impacts SSL performance, and the learning rate is the most influential parameter. Additionally, increasing model size also improves SSL performance, particularly when less pre-processing data are available. Interestingly, as opposed to feed-forward models, recurrent models generally reach a performance threshold as pre-processing data size increases. This article expands the knowledge on hyperparameters and model size in relation to SSL application in text classification. This work supports the use of SSL work in future NLP projects by optimizing model design and potentially lowering training time, particularly if time-restricted.en_US
dc.language.isoengen_US
dc.publisherSpringeren_US
dc.relation.ispartofEngineering Applications of Neural Networks - 23rd International Conference, EAAAI/EANN 2022
dc.relation.ispartofseriesCommunications in Computer and Information Science;1600
dc.titleAn Exploration of Semi-supervised Text Classificationen_US
dc.typeChapteren_US
dc.typePeer revieweden_US
dc.description.versionacceptedVersionen_US
dc.rights.holder© Springer Nature Switzerland AG 2022en_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US
dc.source.pagenumber477-488en_US
dc.source.volume1600en_US
dc.identifier.doihttps://doi.org/10.1007/978-3-031-08223-8_39
dc.identifier.cristin2114218
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record