Right now, period + whitespace terminate a sentence and makes it impossible to set relations. A ml-models performance is lowered significantly by the inconsistency of documents that one is using it one, for instance e-mails.
Including workarounds to avoid this limitation cost a lot of time and it is impossible to cover all of it.
|Who would benefit from this IDEA?||As a customer, I want to have control over sentence limitations so I can analyse inconsistently written text like emails.|
NOTICE TO EU RESIDENTS: per EU Data Protection Policy, if you wish to remove your personal information from the IBM ideas portal, please login to the ideas portal using your previously registered information then change your email to "email@example.com" and first name to "anonymous" and last name to "anonymous". This will ensure that IBM will not send any emails to you about all idea submissions