Data Virtualization on Cloud Pak for Data as a Service

Description

Data Virtualization integrates multiple data sources across locations without having to copy and replicate data, and turns all this data into one logical data view. This virtual data view makes the job of getting value out of your data easy. The Data Virtualization service is fully integrated into Cloud Pak for Data as a Service as part of the data fabric.

To get started, create a service instance of Data Virtualization in Cloud Pak for Data as a Service and connect to your data sources. After creating connections to your data sources, you can quickly create views across all of your organization’s data. You can can simplify your analytics and make them more up to date and accurate because you’re querying the latest data at its source.

With Data Virtualization, your company can accomplish these goals:

  • Use real-time analytics efficiently and get current analytics across distributed data sources, with no need to store data outside your data center.
  • Accelerate processing times by automatically organizing your data nodes into a collaborative network for computational efficiency.
  • Take advantage of standard SQL through common interfaces including R, Spark, Python, and Jupyter Notebooks while experiencing a single data repository where your SQL applications can connect and run.
  • Centralize authentication and authorization for data sources in a trusted environment where credentials for your private databases are stored encrypted at the local device and are private to that device.

Integrated services

Related services

Compatible data sources

See Connection types for a list of data source services that are compatible.