Signature matching in document image retrieval

Schulz, Thomas

doi:10.34726/hss.2014.22681

Record link:

https://doi.org/10.34726/hss.2014.22681
http://hdl.handle.net/20.500.12708/3074

Title:

Signature matching in document image retrieval

Citation:

Schulz, T. (2014). Signature matching in document image retrieval [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2014.22681

reposiTUm DOI:

10.34726/hss.2014.22681

CatalogPlus:

AC12161017

Publication Type:

Thesis - Diplomarbeit

Language:

English

Authors:

Schulz, Thomas

Advisor:

Sablatnig, Robert

Co-advisor:

Diem, Markus

Organisational Unit:

E183 - Institut für Rechnergestützte Automation

Date (published):

2014

Number of Pages:

Keywords:

signature retrieval; thin-plate spline; shape context

Abstract:

Das automatisierte Abfragen von Dokumentenbildern anhand bestimmter Kriterien (z. B. nur Dokumente die handgeschriebenen Text enthalten) wird verwendet, um in einer Menge unsortierter Bilder von Dokumenten jene zu finden, die für die vorliegende Aufgabe relevant sind. Um etwa Dokumente zu finden, die von der selben Person verfasst wurden, können händische Unterschriften verwendet werden, sofern die Dokumente Unterschriften enthalten. Daher wird in dieser Arbeit ein System präsentiert, das anhand einer Referenzunterschrift Bilder mit Unterschriften der selben Person findet. Hierfür wird ein Algorithmus zum Abgleich von Unterschriften mit einem Vorfilterungsschritt kombiniert um festzustellen, welche Unterschriftenbilder zur selben Person gehören. Es werden für jedes Bild jene Thin-Plate Spline (TPS) Transformationen berechnet, die ein Referenzbild am besten auf die anderen Unterschriften abbilden. Aus diesen Transformationen werden anschließend jeweils folgende vier Distanzmaße berechnet: Die Beugungsenergie, die shape context Distanz, die anisotrope Skalierung und der Restfehler der Registrierung. Die Gesamtdistanz eines Referenzbilds zu den anderen Unterschriften wird berechnet, indem die vier Distanzmaße mit Hilfe von Gewichten kombiniert werden, welche über Lineare Diskriminanzanalyse (LDA) ermittelt werden. Mit Hilfe eines Vorfilterungsschritts, der auf der shape context Distanz basiert, wird der Vorgang beschleunigt, so dass die zeitaufwendige Berechnung der Transformationen und der Distanzmaße nur mehr für 3% der Unterschrifen durchgeführt werden muss. Für die Evaluierung werden zwei Teilmengen der GPDS960signature Datenbank verwendet. Der hybride Ansatz dieser Arbeit erreicht eine um 1,6 Prozentpunkte höhere MRP und eine um 0,9 Prozentpunkte höhere MAP als die Distanzmaße allein, während gleichzeitig die Laufzeit auf der größeren Teilmenge um das 16-fache verringert werden kann. Die Ergebnisse zeigen außerdem, dass die gewichtete Implementierung des Restfehlers der Registrierung, welche in dieser Arbeit vorgeschlagen wird, eine deutlich höhere MAP erreicht (54,8% statt 7,6%), als eine auf exakten Korrespondenzen basierende Implementierung. Weiters wird gezeigt, dass das vorliegende System zum Abgleich von Unterschriftenbildern nicht auf Trainingsdaten angewiesen ist, da die Ergebnisse mit 25% Trainingsdaten höchstens 0,3 Prozentpunkte besser sind als ohne Trainingsdaten.

Document image retrieval is a method used for searching through unsorted images of documents to find the ones which are relevant for a given task. One way to do this is by sorting the documents according to their corresponding author. For documents that contain handwritten signatures this can be done using signature matching techniques. This thesis therefore presents a signature retrieval system that uses query signature images in order to find images with signatures from a specific author. The retrieval system uses a state-of-the-art signature matching algorithm that is combined with a pre-filtering step to determine which signature image belongs to the same author. For this purpose it computes the Thin-Plate Spline (TPS) transformation that best maps one signature image to another for each signature image in a library. This transformation is used to compute four dissimilarity measures, namely the bending energy, the shape context distance, the anisotropic scaling and the registration residual error. The total dissimilarity of the query signature to the other signatures is computed by combining the four dissimilarity measures using weights obtained by means of Linear Discriminant Analysis (LDA). The whole process is sped-up by applying a pre-filtering step based on the shape context distance to the query signature such that the time-consuming computation of the transformation and the dissimilarity measures is done for only 3% of the images in the library. The evaluation of the signature retrieval system is done using two subsets of the GPDS960signature database. The hybrid approach proposed in this thesis achieves a 1.6 percentage points higher MRP and a 0.9 percentage points higher MAP than the dissimilarity measures on their own, while providing a speed-up of factor 16 on the larger set. The results also demonstrate that the weighted implementation of the registration residual error that is proposed yields a significantly higher MAP (i.e. 54.8% compared to 7.6%) than the implementation which uses exact matches. Furthermore, it is shown that the retrieval system is not dependent on the use of training data as the performance with 25% training data is at most 0.3 percentage points better than without any training data.

Additional information:

Abweichender Titel laut Übersetzung der Verfasserin/des Verfassers
Zsfassung in dt. Sprache

License:

In Copyright

Appears in Collections:

Thesis