Our company is using Cloud Pak for Data as a managed service, which means we don't have direct access to the OpenShift platform or any of the pods or servers that run the Cloud Pak services, like the DataStage server.
Occasionally with on-prem DataStage we'll build custom scripts in Python or bash that do pre or post processing on data in order to more effectively ingest or, or do do the actual initial ingestion itself (like when using external API's).
To give an example, in the past I have written scripts that invoke external API's like Google DoubleClick to pull the data from an external source via the Execute Command stage in a Sequence job, land it to a temporary file on the DataStage ETL server, and then use those files as input to the normal ingestion process in DataStage.
The current process to get a script to the cloud is to send it over to our IBM support contact and have them upload it for us, but the problem with this approach is we have no real way of testing how it will work in the target system without sending it over so:
IBM support uploads
We try to run it, see the error(s)
Tweak it again
Send it over again, and the process repeats
Some things we can test locally and might just require different filepaths to get it working on that environment, but not being able to just tweak it in the actual environment is a bit like flying blind.
IBM mentioned they have this feature documented, but no roadmap for getting it implemented:
|Who would benefit from this IDEA?||ETL Developers|
How should it work?
Enable a connection to the DataStage server that we're able to connect to via remote tools like ftp/ssh that will allow us to transfer files over to the DataStage box directly
|Priority Justification||This is a priority for us as we'd like to write custom scripts for moving files around in Cloud Object Storage instead of streaming the data through DataStage|
|IBM's success depends on gathering feedback from customers like yourself. Aha Ideas Portal is the third party tool through which IBM Offering Managers gather feedback from customers such as yourself.|
|IBM is a global organization with business processes, management structures, technical systems and service provider networks that cross borders. As such, the information collected through Aha Ideas Portal (Customer Name, Customer Email Address) will be stored by them in the United States, and handled only as per IBM's instructions and policies. Your data (Name and Email Address) will NOT be shared with other IBM customers.|
|In order to safeguard your information in Aha, do not leave your workstation unattended while using this application, log off after using it, and print only if necessary. If you need to make a hardcopy, remember to pick up the print-out immediately, keep it under lock, and destroy it immediately when no longer needed.|
|NOTICE TO EU RESIDENTS: per EU Data Protection Policy, if you wish to remove your personal information from the IBM ideas portal, please login to the ideas portal using your previously registered information then change your email to "email@example.com" and first name to "anonymous" and last name to "anonymous". This will ensure that IBM will not send any emails to you about all idea submissions|