dc.contributor.author | Samdal, Ellen Synnøve | |
dc.date.accessioned | 2019-09-25T12:18:17Z | |
dc.date.available | 2019-09-25T12:18:17Z | |
dc.date.issued | 2019 | |
dc.identifier.uri | http://hdl.handle.net/11250/2618743 | |
dc.description | Master's thesis Information- and communication technology IKT590 - University of Agder 2019 | nb_NO |
dc.description.abstract | The research on object detection has come a long way and has in recent years become so efficient andreliable that we see many practical applications in simple detection of objects in pictures to advancesegmentation in video recordings. Despite its popularity, there are no de facto approach availableand several methods are competing to be superior.This thesis focuses on object detection in photos where the desired object is letters and numbers, andthe photo is a PDF-file of an invoice without any use of OCR and any knowledge of the layout. Theresearch compares Tensorflow’s Object Detection API with SSD Inception V2 and YOLO. One of themethods shows promise, though it will need some more work. This thesis shows that SSD seems toneed more training, possibly with more varied data, while YOLO seems to not be a good method forthis type of object detection.The results from SSD varies as it sometimes get the correct location, though not always being certainin its decision. For other invoices it misses completely, and might guess at other short words ornumbers. Sometimes the words and numbers make sense, and could be the start of understandingwhat it is looking for. The tests done with YOLO gives results where it guesses almost the samelocation for every invoice, regardless of where the object actually is. It seems to overfit on the mostpopular formats, which is going against what this thesis is trying to do. | nb_NO |
dc.language.iso | eng | nb_NO |
dc.publisher | Universitetet i Agder ; University of Agder | nb_NO |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/deed.no | * |
dc.subject | IKT590 | nb_NO |
dc.title | A DEEP LEARNING SEGMENTATION APPROACH FOR AUTOMATIC INVOICE IDENTIFICATION | nb_NO |
dc.type | Master thesis | nb_NO |
dc.subject.nsi | VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550 | nb_NO |
dc.source.pagenumber | 64 p. | nb_NO |