Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initialed first name assumed to be middle initial #41

Open
nathanlenz opened this issue Nov 12, 2018 · 0 comments
Open

Initialed first name assumed to be middle initial #41

nathanlenz opened this issue Nov 12, 2018 · 0 comments

Comments

@nathanlenz
Copy link

I'm not sure if this is an issue or not, it could be interpreted either way. Opening up a discussion.

For example: "J. Edgar Hoover" or "M. Night Shyamalan" are currently parsed as:

Array
(
    [salutation] => 
    [fname] => Edgar
    [initials] => J.
    [lname] => Hoover
    [lname_base] => Hoover
    [lname_compound] => 
    [suffix] => 
)

If this name is re-assembled in another system it would be assumed to be "Edgar J. Hoover" which would be incorrect.

An alternative would be to make fname "J. Edgar" in this situation, with no initials.

I pulled a random sampling of 1000 people from a large database and parsed their names, this script was 96.8% accurate. If this one issue were fixed, 13 additional splits would work, upping the accuracy to 98.1%.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant