2.1 Standardization. Below is an illustration of how the query set is reduced through standardization. The data comes from a research problem in Norway. The assignment of a standard to the various spellings of a given name reduces multiple queries on that name to one. Similarly multiple queries on the patronymic, which was used historically as a surname, may be reduced to one. So if both the given name and the patronymic are needed to identify a person, and the person making the query doesnt know how the name will be found in the record, there are many possible queries. Some of the possible ways to record a certain mans name appear in the following chart. Here there are n possibilities for the given name: GN1, , GNi, , GNn. Similarly there are m possibilities for the surname: SN1, , SNj, , SNm. This means there are n * m total possibilities to form a query. Standardization reduces this to one.
| Class | GN1 Siver | GN2 Syver | GN3 Sivert | GN4 Syvert | GN5 Sigvart | GN6 Sigvaart | GN7 Sirverdt | GN8 Syrverdt | GN9 Sigurd | GN10 Siur | GN11 Siul |
|---|---|---|---|---|---|---|---|---|---|---|---|
| SN1 Ole | Siver Olesen | Syver Olesen | Sivert Olesen | Syvert Olesen | Sigvart Olesen | Sigvaart Olesen | Sirverdt Olesen | Syrverdt Olesen | Sigurd Olesen | Siur Olesen | Siul Olesen |
| SN2 Ola | Siver Olasen | Syver Olasen | Sivert Olasen | Syvert Olasen | Sigvart Olasen | Sigvaart Olasen | Sirverdt Olasen | Syrverdt Olasen | Sigurd Olasen | Siur Olasen | Siul Olasen |
| SN3 Oluf | Siver Olufsen | Syver Olufsen | Sivert Olufsen | Syvert Olufsen | Sigvart Olufsen | Sigvaart Olufsen | Sirverdt Olufsen | Syrverdt Olufsen | Sigurd Olufsen | Siur Olufsen | Siul Olufsen |
| SN4 Olav | Siver Olavsen | Syver Olavsen | Sivert Olavsen | Syvert Olavsen | Sigvart Olavsen | Sigvaart Olavsen | Sirverdt Olavsen | Syrverdt Olavsen | Sigurd Olavsen | Siur Olavsen | Siul Olavsen |
| SN5 Olof | Siver Olofsen | Syver Olofsen | Sivert Olofsen | Syvert Olofsen | Sigvart Olofsen | Sigvaart Olofsen | Sirverdt Olofsen | Syrverdt Olofsen | Sigurd Olofsen | Siur Olofsen | Siul Olofsen |
| SN6 Olaf | Siver Olafsen | Syver Olafsen | Sivert Olafsen | Syvert Olafsen | Sigvart Olafsen | Sigvaart Olafsen | Sirverdt Olafsen | Syrverdt Olafsen | Sigurd Olafsen | Siur Olafsen | Siul Olafsen |
| SN7 Olle | Siver Ollesen | Syver Ollesen | Sivert Ollesen | Syvert Ollesen | Sigvart Ollesen | Sigvaart Ollesen | Sirverdt Ollesen | Syrverdt Ollesen | Sigurd Ollesen | Siur Ollesen | Siul Ollesen |
| SN8 Olla | Siver Ollasen | Syver Ollasen | Sivert Ollasen | Syvert Ollasen | Sigvart Ollasen | Sigvaart Ollasen | Sirverdt Ollasen | Syrverdt Ollasen | Sigurd Ollasen | Siur Ollasen | Siul Ollasen |
| SN9 Ollof | Siver Ollofsen | Syver Ollofsen | Sivert Ollofsen | Syvert Ollofsen | Sigvart Ollofsen | Sigvaart Ollofsen | Sirverdt Ollofsen | Syrverdt Ollofsen | Sigurd Ollofsen | Siur Ollofsen | Siul Ollofsen |
| SN10 Ollov | Siver Ollovsen | Syver Ollovsen | Sivert Ollovsen | Syvert Ollovsen | Sigvart Ollovsen | Sigvaart Ollovsen | Sirverdt Ollovsen | Syrverdt Ollovsen | Sigurd Ollovsen | Siur Ollovsen | Siul Ollovsen |
| SN11 Ollav | Siver Ollavsen | Syver Ollavsen | Sivert Ollavsen | Syvert Ollavsen | Sigvart Ollavsen | Sigvaart Ollavsen | Sirverdt Ollavsen | Syrverdt Ollavsen | Sigurd Ollavsen | Siur Ollavsen | Siul Ollavsen |
| SN12 Ollaus | Siver Ollaussen | Syver Ollaussen | Sivert Ollaussen | Syvert Ollaussen | Sigvart Ollaussen | Sigvaart Ollaussen | Sirverdt Ollaussen | Syrverdt Ollaussen | Sigurd Ollaussen | Siur Ollaussen | Siul Ollaussen |
| SN13 Ollaug | Siver Ollaugsen | Syver Ollaugsen | Sivert Ollaugsen | Syvert Ollaugsen | Sigvart Ollaugsen | Sigvaart Ollaugsen | Sirverdt Ollaugsen | Syrverdt Ollaugsen | Sigurd Ollaugsen | Siur Ollaugsen | Siul Ollaugsen |
| SN14 Olaus | Siver Olaussen | Syver Olaussen | Sivert Olaussen | Syvert Olaussen | Sigvart Olaussen | Sigvaart Olaussen | Sirverdt Olaussen | Syrverdt Olaussen | Sigurd Olaussen | Siur Olaussen | Siul Olaussen |
It is easy to see that the problem is much larger even than this example, when all the variant spellings are taken into consideration. The surname Taylor in the British Isles has more than 150 spellings alone. The place called Wuerttemberg has been found referred to in any of over 2500 different ways of being spelled. The given name Elizabeth in all its variations numbers over 4,000. Such large n values imply a very cumbersome query management.
In actual practice the form of the authoritative standard is irrelevant. Hence, if the query comes in for John Tailor of Wurtemberg, the standard forms may be Johannes Schneider and Wuerttemberg. But, the standard might also be some set of codes, such as, 00003487 for the given name, 00236349 for the surname and 007830 for the place.
