Novel methods for writer identification and retrieval

Fiel, Stefan

doi:10.34726/hss.2015.25468

DC Field

Value

Language

dc.contributor.advisor

Sablatnig, Robert

dc.contributor.author

Fiel, Stefan

dc.date.accessioned

2020-06-28T08:33:39Z

dc.date.issued

2015

dc.date.submitted

2016-01

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Fiel, S. (2015). <i>Novel methods for writer identification and retrieval</i> [Dissertation, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2015.25468</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2015.25468

dc.identifier.uri

http://hdl.handle.net/20.500.12708/3591

dc.description

Zusammenfassung in deutscher Sprache

dc.description.abstract

Writer identification is the task of identifying the writer of a handwritten document, based on a set of documents where the authors are known. It can be used e.g. for tasks in forensics and for historical document analysis. In contrast to this, writer retrieval is to receive a ranking of the pages in the set of documents sorted according to the similarity of handwriting and can be used for clustering a not indexed set of documents according to the individual handwriting. State-of-the-art methods calculate features on the contours of the characters, so pre-processing steps are needed to extract this contour. In contrast to this in this thesis, three novel approaches for writer identification and writer retrieval are presented. The first is based on the bag of words approach, which is well known for object recognition. SIFT features are calculated on the handwriting and then an occurrence histogram is generated which is then used for the identification of the writer. The second method is based on the Fisher vector. Again, SIFT features are generated on the handwriting, but this time the gradient vectors of a Gaussian Mixture Model (GMM) are used to generate the feature vector for writer identification. The last method is based on Convolutional Neural Network (CNN). A CNN is trained on image patches and the classification layer is cut off and the second last layer is used as feature vector for this patch. The mean vector of all patches on one page is the feature vector for the handwriting and is used for identification and retrieval. The methods presented are evaluated and compared to the state of the art on different scientific databases and additionally on a historic dataset using common evaluation metrics for writer identification. The evaluations show that the three methods proposed outperform the state of the art on many of the different tasks on these datasets. Advantages and possible weaknesses are discussed. The methods proposed achieve good results (>90%) on every dataset used for evaluation.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Writer Identification

dc.subject

Writer Retrieval

dc.subject

Fisher Vector

dc.subject

Deep Learning

dc.subject

Document Analysis

dc.title

Novel methods for writer identification and retrieval

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2015.25468

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Stefan Fiel

dc.publisher.place

Wien

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

tuw.publication.orgunit

E183 - Institut für Rechnergestützte Automation

dc.type.qualificationlevel

Doctoral

dc.identifier.libraryid

AC13006439

dc.description.numberOfPages

118

dc.identifier.urn

urn:nbn:at:at-ubtuw:1-89519

dc.thesistype

Dissertation

dc.thesistype

Dissertation

tuw.author.orcid

0000-0001-5033-6723

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.advisor.orcid

0000-0003-4195-1593

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_db06

item.languageiso639-1

item.openaccessfulltext

Open Access

item.openairetype

doctoral thesis

item.grantfulltext

open

crisitem.author.dept

E193-01 - Forschungsbereich Computer Vision

crisitem.author.parentorg

E193 - Institut für Visual Computing and Human-Centered Technology

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(18.74 MB)

In Copyright

Show simple item record

Page view(s)

284

checked on Nov 21, 2023

Download(s)

138

checked on Nov 21, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM