|
Genealogic Database based on the records of parish registers
|
|
The European diocese
archives bare a giant information
treasure, the almost complete
population derivation and
relationship plan since the beginning of
registration in the 14th/15th
century. To sift and to secure it is a major
social task for the upcoming decades.
The ceaseless race against the constant progress
of natural decay within the storage media forces us to react quickly
and make use of both modern data storage techniques as well as massive
personal resources to stay ahead of final loss by steadily copying and
reorganizing the data stock. While in former times this has been done
by hand writing nowadays and in the future electronic copying onto the
currently technologically optimal medium replaces manual information
backup.
However the first hard step remains transforming the hand writings
into an electronic data representation. Because of the poor quality of
the input data there is no real chance of applying automatizations
like OCR. Therefore there is no alternative to the typing-by-hand
input method.
The overall reward for the effort is a massive improvement of
opportunities to do inquiry and research while at the same time
significantly reducing the mechanical impact on the precious
books.
In cooperation with the Passau Diocese archive
(Map of parishes)
we developed a software system to
systematically build up a population
database from their stored church registers
(Excerpt 100kB)
We hereby concentrated onto the following aspects:
-
High efficient input assistance, because altogether there are
about 6000 church register books with over 4 million entries.
-
A flexibly rearrangeable user interface, on order to be able
to register the strongly varying data records comfortably and
efficiently.
-
Person linkage, i.e. while identifying individual appearances
of the same person at varying places within the registers (maybe
with slightly different name spelling or date of birth) are and
acquiring thereby a logical correlation between the entries is
an important offline task (unfortunately not yet finished but
finally under construction).
As a result of the person linkage an almost full-automatic
family tree compilation is in sight. Arbitrary queries over the
data pool are possible. The archivists benefit already from the
recorded entries, because for the first time they can do searching for
persons without knowing about their habitat region.
The program is almost platform independent because it is implemented
exclusively in the Java programming language (so far tested under
UNIX and Windows NT/2000/XP). It can be field operated within network or on
single user systems. The only requirement is access to a fully
operational ANSI compliant database system.
The Passau archivists do use a UNIX network (Solaris 2.6) with
around 10 clients in operation, five of them are exclusively used
for data input. As of now they have an Oracle 9i database
server. The system can be held working seated upon PostgreSQL
databases meanwhile.
In the first phase of the project the marriages from the church
registers had been recorded. The input of the entire register entries
has also reached an impressive amount while currently the progress has
come to a standstill due to limitations in personal resources.
Instead the archive staff concentrates upon the digitization of the
sources self by making photographic scans with an industry size book
scanner. Future input turns shall then be done by using those
scans. That way uplift and deposition of the touchy books can become
limited considerably easening their long term conservation.
The database content has already been used for medical research
purposes. To track cases of the early onset form of Alzheimer's
disease (supposed to be of genetic etiology) the content of the
population database was evaluated for relationships of persons and for
generations of family trees. These investigations were possible due to
the fact that the
second
patient of Dr. Alzheimer and many of his descents lived in the
former diocese Passau. As a result also AD-suspicious deaths within
that family tree could be found.
The time necessary to put all registers into the database can not
easily be estimated. However we expect it to take at least 10 years of
working time for a whole archivist team.
We hope to widen the project to cover other dioceses (also in
Austria), too. However, there are no concrete plans to do so yet. We
do not plan to offer the data by internet access, because neither the
legal nor the financial situation allow us to do so.
To revive the input progress that has fell asleep
we work hard at building a decentralized
version of the software system i.e. we extract
a version that gives the ability to start data input
from an empty PostgreSQL database which can
become later on merged into the total stock. This
piece of Software will be shipped together with a set of
scans on a removable medium to a
customer The main aim is to offer
interested persons and
organizations the
possibility to take part in the input
process. In return those may keep their results for
further research and usage for free.
Project staff
Project partners
|