Enable to add custom first names to the underlying language dictionaries

Security organizations would like to analyze reports, in these reports officers often use partial person names (only last names for example). In SPSS Text Analytics there in an extraction option for "Group partial and full person names together", but it only works for the first names listed in the underlying language dictionaries, which are not exposed to the user.

This means that SPSS Text Analytics is not able to extract names (also full and partial) from countries outside the supported languages. The workaround soultion could be to list all the first and last names in a dictionary type, with matching criterion Start or End. Since there are loads of names that's not an option.

The perfect solution would be if we can add tokens to the list of first names that apparently exists inside the language files.

Needed by Date

Oct 31, 2019

Post comment

Guest

Reply
| Aug 29, 2019

Modeler Text Analytics allows you to import files into the resource editor.
If you have a list of first names, put it in a file txt, name it `Person.txt`.
One first name = one line

Go to the resource editor (either from the interactive session or from the standalone template editor), select Resources > Import Files > Import Single File..., as shown in appended screenshot.
This will import all the terms into the `Person`type under the `Core Library (<language>)`.
If you want first names to be first or last position, change the Match from `Start`to `Start or End`.
You don't have to change the Match one line after another. You can select all rows and the change will apply to all rows.
Or you can change directly the settings before importing (see third screenshot)
Let me know if this solution helps or if not, let's discuss it further.

Capture d’écra...

Capture d’écra...

Capture d’écra...

Capture d’écran 2019-08-29 à 20.35.41.png
Open full size
Capture d’écran 2019-08-29 à 20.39.38.png
Open full size
Capture d’écran 2019-08-29 à 20.42.12.png
Open full size
Capture d’écran 2019-08-29 à 20.35.41.png Capture d’écran 2019-08-29 à 20.39.38.png Capture d’écran 2019-08-29 à 20.42.12.png

0 reply Hide replies

By clicking the "Post Comment" or "Submit Idea" button, you are agreeing to the IBM Ideas Portal Terms of Use.
Do not place IBM confidential, company confidential, or personal information into any field.

Please enter your email address

RELATED IDEAS

Enable to add custom first names to the underlying language dictionaries