Show simple item record

dc.contributor.authorSharif, Md Haidar
dc.contributor.authorLei, Jiao
dc.contributor.authorOmlin, Christian Walter Peter
dc.date.accessioned2023-11-03T08:50:11Z
dc.date.available2023-11-03T08:50:11Z
dc.date.created2023-10-26T09:45:48Z
dc.date.issued2023
dc.identifier.citationSharif, M. H., Lei, J. & Omlin, C. W. P. (2023). CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection. Sensors, 23 (18).en_US
dc.identifier.issn1424-8220
dc.identifier.urihttps://hdl.handle.net/11250/3100420
dc.description.abstractVideo anomaly event detection (VAED) is one of the key technologies in computer vision for smart surveillance systems. With the advent of deep learning, contemporary advances in VAED have achieved substantial success. Recently, weakly supervised VAED (WVAED) has become a popular VAED technical route of research. WVAED methods do not depend on a supplementary self-supervised substitute task, yet they can assess anomaly scores straightway. However, the performance of WVAED methods depends on pretrained feature extractors. In this paper, we first address taking advantage of two pretrained feature extractors for CNN (e.g., C3D and I3D) and ViT (e.g., CLIP), for effectively extracting discerning representations. We then consider long-range and short-range temporal dependencies and put forward video snippets of interest by leveraging our proposed temporal self-attention network (TSAN). We design a multiple instance learning (MIL)-based generalized architecture named CNN-ViT-TSAN, by using CNN- and/or ViT-extracted features and TSAN to specify a series of models for the WVAED problem. Experimental results on publicly available popular crowd datasets demonstrated the effectiveness of our CNN-ViT-TSAN.en_US
dc.language.isoengen_US
dc.publisherMDPIen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleCNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detectionen_US
dc.title.alternativeCNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detectionen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© 2023 The Author(s)en_US
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550en_US
dc.source.volume23en_US
dc.source.journalSensorsen_US
dc.source.issue18en_US
dc.identifier.doihttps://doi.org/10.3390/s23187734
dc.identifier.cristin2188641
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Navngivelse 4.0 Internasjonal
Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal