l1.7.A.5 — Find possible names used to refer to individuals. The common name of an individual sometimes significantly differs from the complete, formal name present in Encyclopedia Britannica and Wikipedia. This encyclopedia full name can contain details such as titles, initials and military or nobility standings, which are not commonly used when referring to individual in most publications. Even in simpler cases, when the full name contains only first, middle and last names, there exists no systematic convention on which names to use when talking about an individual. Henry David Thoreau is most commonly referred to by his full name, not “Henry Thoreau” nor “David Thoreau”, whereas Oliver Joseph Lodge is mentioned by his first and last name “Oliver Lodge”, not his full name “Oliver Joseph Lodge’. Given a full name with complex structure potentially containing details such as titles, initials, nobility rights and ranks, in addition to multiple first and last names, we must extract a list of simple names, using three words at most, which can potentially be used to refer to this individual. This set of names is created by generating combinations of names found in the raw name. Furthermore, whenever they appear we systematically exclude common words such as titles or ranks from these names. The query name sets of several individuals are displayed in Table $10. 5) For every record, using the set of raw names, create a set of query names. Query names are (2,3) grams which will be used in order to measure the fame of the individual. The following procedure is iterated on every raw name variant associated with the record. Steps for which the record type is not specified are carried out for both. a. For Encyclopedia Britannica records, truncate the raw name at the second comma, reorder so that the part of name preceding the first comma follows that succeeding the comma. b. For Wikipedia records, replace the underscores with whitespaces. c. Truncate the name string at t