IBM Data and AI

Welcome to the IBM Data and AI Ideas Portal for Clients!

We welcome and appreciate your feedback on IBM Data and AI Products to help make them even better than they are today!
Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal. If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.
IBM Employees:
  • Our team welcomes any feedback and suggestions you have for improving our offerings / products! This forum allows us to connect your offering / product improvement ideas with IBM product and engineering teams.

  • If you have not registered on this portal please click on the following link and register. To complete registration you will need to open the email you will receive from Aha to confirm your identity.

Additional Information:
  • The shorter URL for this site is:

  • To view our roadmaps:

  • Reminder: This is not the place to submit defects or support needs, please use normal support channel for these cases

  • Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.

Support STT edge use when not connected to IBM Cloud

Our WPA Automotive clients need to use speech to text when not connected to cloud.  Like turn on and turn off the lights commands in the car when for example the car is in a tunnel and doesn't have internet connectivity to WPA and IBM CLoud services.

  • Avatar32.5fb70cce7410889e661286fd7f1897de Guest
  • Sep 11 2017
  • Delivered
  • Attach files
  • Admin
    Marco Noel commented
    26 Mar 09:04pm

    STT on CP4D offering

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    11 Oct, 2017 03:32pm

    Hybrid Model is not just Offline speech -- Need a Hybrid Conversation Framework (that should include STT, Conversation, NLU, TTS) - an Edge Model.  Low energy keyword activation can be an first phase of this entire Edge Model Program. For this we would need to know details of how it can be exposed - i.e. open source, library, compiled code and what programming paradigm will it support: (i.e. IOS, Android, Raspberry Pi, Auto-specific models?)

    Right now this work will not be prioritized for Q4 and Q1. We will start having strategic discussions on this in Q4 and have a goal to have something on roadmap by end or Q4 for edge computing.

  • Avatar40.8f183f721a2c86cd98fddbbe6dc46ec9
    Guest commented
    5 Oct, 2017 01:33pm

    The issue is broader than just speech, the WPA team would like to have (limited) conversational capability offline.

    Most relevant speech related task:
    - create offline STT SDK
    -- well-defined API
    -- documentation
    -- offline capabilities
    --- low-energy keyword activation
    --- speech barge-in into TTS
    --- platform support - most importantly BLAS (or equivalent) libraries on ASM level
    -- automotive acoustic models
    -- tools for LM pruning (basic LM smaller than for service)
    -- more languages :)
    -- platform CI

    - create offline TTS SDK
    -- well-defined API
    -- documentation
    -- offline capabilities
    --- CELP voice support
    --- smaller CELP voices (tooling)
    --- RNN prosody & phrasebreak speed up / reimplementation
    --- solve expressive for embedded
    --- parametric (voice transformation) for embedded

NOTICE TO EU RESIDENTS: per EU Data Protection Policy, if you wish to remove your personal information from the IBM ideas portal, please login to the ideas portal using your previously registered information then change your email to "anonymous@euprivacy.out" and first name to "anonymous" and last name to "anonymous". This will ensure that IBM will not send any emails to you about all idea submissions