IBM Data and AI

Welcome to the IBM Data and AI Ideas Portal for Clients!

We welcome and appreciate your feedback on IBM Data and AI Products to help make them even better than they are today!
Before you submit an idea, please perform a search first as a similar idea may have already been reported in the portal. If a related idea is not yet listed, please create a new idea and include with it a description which includes expected behavior as well as why having this feature would improve the service and how it would address your use case.
IBM Employees:
Clients:
  • Our team welcomes any feedback and suggestions you have for improving our offerings / products! This forum allows us to connect your offering / product improvement ideas with IBM product and engineering teams.

  • If you have not registered on this portal please click on the following link and register. To complete registration you will need to open the email you will receive from Aha to confirm your identity. http://ibm.biz/IBM-Data-and-AI-Portal-Register

Additional Information:
  • The shorter URL for this site is: https://ibm.biz/IBM-Data-and-AI-Ideas

  • To view our roadmaps: http://ibm.biz/Data-and-AI-Roadmaps

  • Reminder: This is not the place to submit defects or support needs, please use normal support channel for these cases

  • Please do not use the Ideas Portal for reporting bugs - we ask that you report bugs or issues with the product by contacting IBM support.

Include Graphframes Spark package in CPD

Graphframes package for Spark provides the capabilities of both the GraphX and the Spark DataFrames.

The recent Apache Spark 3.0 release has several enhancements with respect to the features enablement and performance, especially, for the Spark DataFrames.

Since the GraphFrames are based on Spark DataFrames, there is a direct advantage in using the GraphFrames compared to GraphX, as all the enhancements to DataFrames apply to GraphFrames.

This requirement is for the biggest IBM's bank client in India where the data size is humongous and hence utilizing GraphFrames would definitely help to better the performance.

This extended functionality includes motif finding, DataFrame-based serialization, and highly expressive graph queries.

Currently, in CPD, it is not possible to customize the Spark for Scala/Python environment, and also it is difficult to add a custom library.

  • Avatar32.5fb70cce7410889e661286fd7f1897de Guest
  • Sep 21 2020
  • Under Review
Who would benefit from this IDEA? Data Scientist will be able to utilize GraphFrames package in building graph applications
How should it work?

Add GraphFrames package/jar along with the jars shipped with the respective spark version.

Currently, in CPD, it is not possible to customize the Spark for Scala/Python environment and also it is difficult to add a custom library.

Idea Priority High
Priority Justification This requirement is for the biggest IBM's bank client in India where the data size is humongous and hence utilizing GraphFrames would definitely help to better the performance.
Customer Name State Bank of India
  • Attach files

NOTICE TO EU RESIDENTS: per EU Data Protection Policy, if you wish to remove your personal information from the IBM ideas portal, please login to the ideas portal using your previously registered information then change your email to "anonymous@euprivacy.out" and first name to "anonymous" and last name to "anonymous". This will ensure that IBM will not send any emails to you about all idea submissions