1.6 Parser. The parser used in the standardization process takes the form of a chart parser. The GRAMMAR specifies the possible phrase structures for the names in the culture involved. The parser considers each token of the name string in turn as to its category and class. These then are compared and matched with the patterns specified as grammatical. In the process each possibility is preserved on a chart. After the final token is considered, the parse is complete. Any structures remaining on the chart are possible parses. When there are multiple analyses, the likelihood of each is calculated and the possibilities are ordered accordingly.
The initial release of the common pedigree required unambiguous parses of the data from three fields: Given name, Surname, and Title. The user expert provided the system with the following set of phrase structure rules. They are given here in three sets, but they are not independent some in certain sets referring to elements defined in the others:
Later releases of the common pedigree allowed parses of the name as a single field: Full name phrase. The engineers examined the data and provided the system with the following set of phrase structure rules: