IBM Data and AI

Welcome to the IBM Data and AI Ideas Portal for Clients!

We welcome and appreciate your feedback on IBM Data and AI Products to help make them even better than they are today!
Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal. If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.
IBM Employees:
Clients:
  • Our team welcomes any feedback and suggestions you have for improving our offerings / products! This forum allows us to connect your offering / product improvement ideas with IBM product and engineering teams.

  • If you have not registered on this portal please click on the following link and register. To complete registration you will need to open the email you will receive from Aha to confirm your identity. http://ibm.biz/IBM-Data-and-AI-Portal-Register

Additional Information:
  • The shorter URL for this site is: https://ibm.biz/IBM-Data-and-AI-Ideas

  • To view our roadmaps: http://ibm.biz/Data-and-AI-Roadmaps

  • Reminder: This is not the place to submit defects or support needs, please use normal support channel for these cases

  • Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.

Smart Document Understanding to produce a readable PDF from original scanned PDF

In solutions where text archives are digitized to become searchable in Watson Discovery, Smart Document Understanding is a promising new feature. However, it does not by side effect produce a searchable (text-based) PDF when OCRing a scanned PDF. This is a feature of IBM's BACA system. It allows for user to open, after finding a relevant document via Discovery,  what looks like the original scanned PDF but as a searchable/text-based PDF where user can now apply control-F to search where the term or phrase of interest occurs, and copy text passages into clipboard etc. 

Potential benefit is not estimated, but the benefit is clear for any Discovery use case when searching large enterprise collections that have been OCRd.

  • Avatar32.5fb70cce7410889e661286fd7f1897de Guest
  • Jan 13 2020
  • Planned
Why is it useful?
Who would benefit from this IDEA? As an analyst, I have searched my company's digitized archives (or previous projects, history, contracts) and now am able to search within the original document the larger context of the passages matching my search results.
How should it work?
Idea Priority
Priority Justification
Customer Name
Submitting Organization
Submitter Tags
  • Attach files
  • Admin
    Christophe Guittet commented
    30 Mar 12:30

    This capability to search within OCRed document using Discovery will be available in May release through the "Rich Preview" feature. https://bigblue.aha.io/features/WDS-789

NOTICE TO EU RESIDENTS: per EU Data Protection Policy, if you wish to remove your personal information from the IBM ideas portal, please login to the ideas portal using your previously registered information then change your email to "anonymous@euprivacy.out" and first name to "anonymous" and last name to "anonymous". This will ensure that IBM will not send any emails to you about all idea submissions