IBM Data and AI

Welcome to the IBM Data and AI Ideas Portal for Clients!

We welcome and appreciate your feedback on IBM Data and AI Products to help make them even better than they are today!
Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal. If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.
IBM Employees:
  • Our team welcomes any feedback and suggestions you have for improving our offerings / products! This forum allows us to connect your offering / product improvement ideas with IBM product and engineering teams.

  • If you have not registered on this portal please click on the following link and register. To complete registration you will need to open the email you will receive from Aha to confirm your identity.

Additional Information:
  • The shorter URL for this site is:

  • To view our roadmaps:

  • Reminder: This is not the place to submit defects or support needs, please use normal support channel for these cases

  • Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.

Document Text Recognition / OCR

Would like the specific ability to be able to recognize text in documents, such as PDFs.

  • Avatar32.5fb70cce7410889e661286fd7f1897de Guest
  • Sep 28 2017
  • Not Under Consideration
Who would benefit from this IDEA? As a user, I want to be able to extract text from documents.
Idea Priority Low
  • Attach files
  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    31 Jan, 2019 01:49am

    Hi Allie, Is the capability for Handwriting/Text Recognition now available within Watson, I see there is NLP/NLU capabilities, wondering if there was a more updated status on this..


  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    4 Dec, 2017 05:52pm

    @Wolfgang - I would reach out to the DataCap team to hear about their Cloud plans. Full-document reading is not in our current roadmap. As Shantenu said, we are focused on text within photos someone might take (text should generally be 5% of screen...currently optimized for full English words...)

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    2 Dec, 2017 12:12am

    The new Text Model will not be optimized for documents, but focus on larger text you may find on boxes, street signs etc.

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    1 Dec, 2017 08:09am

    yes, it is in the near roadmap. BTW, I have played with the dark beta and it works pretty well (tested on my pay sheet)...

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    30 Nov, 2017 01:58pm

    Hello , I would say that it was once a beta feature of the Visual Recognition service .. It still is but has the status of Black Beta (if you don't know it exists , you won't find it !) Any plan to incorporate an OCR capability in VR ?

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    13 Nov, 2017 08:30pm

    Thank you Allie for your swift response.

    I know DataCap and have used it - however I needed a classic
    infrastructure for it - there is nothing to my knowledge that makes a OCR
    service being consumable on the IBM cloud (other than installing DataCap
    on a virtual machine in the IBM cloud).
    In contrast others have already comparable services:

    What about having DataCap Service running and exposing some services in
    the IBM cloud?

    Mit freundlichen Grüßen / with kind regards

    Wolfgang von Drews
    Leading Technical Sales Professional
    IBM Certified IT Architect

    Client Technical Architect

    IBM Deutschland GmbH
    Hollerithstr. 1
    D-81829 München

    für Commerzbank AG und Sparda Gruppe
    Mobile: +49 (0)7034-643-1336
    Notes: MAYLE@IBMDE

    IBM Financial Services

    WW IT Infrastructure CoP Co-Leader
    The Open Group Master Certified IT Architect
    Member of TEC Central Region

    IBM Deutschland GmbH - Vorsitzender des Aufsichtsrats: Martin Jetter -
    Geschäftsführung: Martina Koederitz (Vorsitzende), Norbert Janzen, Stefan
    Lutz, Nicole Reimer, Dr. Klaus Seifert, Wolfgang Wendt
    Sitz der Gesellschaft: Ehningen - Registergericht: Amtsgericht Stuttgart,
    HRB 14562 - WEEE-Reg.-Nr. DE 99369940

    Beachten Sie bitte, dass jede Form der unautorisierten Nutzung,
    Veröffentlichung, Vervielfältigung oder Weitergabe des Inhalts dieser
    E-Mail nicht gestattet ist.Diese Nachricht ist ausschliesslich fuer den
    bezeichneten Adressaten oder dessen Vertreter bestimmt. Sollten Sie nicht
    der vorgesehene Adressat dieser E-Mail oder dessen Vertreter sein, so
    bitten wir Sie, sich mit dem Absender der E-Mail in Verbindung zu setzen.
    Any form of unauthorised use, publication, reproduction, copying or
    disclosure of the content of this e-mail is not permitted. This message is
    exclusively for the person addressed or their representative. If you are
    not the intended recipient of this message and its contents, please notify
    the sender immediately.

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    13 Nov, 2017 07:20pm

    Wolfgang, this feature exists in IBM land, just not within Watson Visual Recognition. It is under the "DataCap" team, since it is more doc processing and outside image/video/face detection and recognition.

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    13 Nov, 2017 11:09am

    this is highly demanded from my clients as well. Microsoft and others have this capability already consumable on the cloud.

    Therefore enhance visual recognition with this feature.

NOTICE TO EU RESIDENTS: per EU Data Protection Policy, if you wish to remove your personal information from the IBM ideas portal, please login to the ideas portal using your previously registered information then change your email to "anonymous@euprivacy.out" and first name to "anonymous" and last name to "anonymous". This will ensure that IBM will not send any emails to you about all idea submissions