0 / 0
Troubleshoot Watson Machine Learning

Troubleshoot Watson Machine Learning

Follow these tips to resolve common problems you might encounter when working with Watson Machine Learning.

Insufficient class members in training data for AutoAI experiment

Training data for an AutoAI experiment must have at least 4 members for each class. If your training data has an insufficient number of members in a class, you will encounter this error:

ERROR: ingesting data Message id: AC10011E. Message: Each class must have at least 4 members. The following classes have too few members: ['T'].

To resolve the problem, update the training data to remove the class or add more members.

Batch deployments that use large data volumes as input might fail

If you are scoring a batch job that uses large volumes of data as the input source, the job might fail becase of internal timeout settings. A symptom of this problem might be an error message similar to the following example:

Incorrect input data: Flight returned internal error, with message: CDICO9999E: Internal error occurred: Snowflake sQL logged error: JDBC driver internal error: Timeout waiting for the download of #chunk49(Total chunks: 186) retry=0.

If the timeout occurs when you score your batch deployment, you must configure the data source query level timeout limitation to handle long-running jobs.

Query-level timeout information for data sources is as follows:

Information about query-level time limitation for data sources
Data source Query level time limitation Default time limit Modify default time limit
Apache Cassandra Yes 10 seconds Set the read_timeout_in_ms and write_timeout_in_ms parameters in the Apache Cassandra configuration file or in the Apache Cassandra connection URL to change the default time limit.
Cloud Object Storage No N/A N/A
Db2 Yes N/A Set the QueryTimeout parameter to specify the amount of time (in seconds) that a client waits for a query execution to complete before a client attempts to cancel the execution and return control to the application.
Hive via Execution Engine for Hadoop Yes 60 minutes (3600 seconds) Set the hive.session.query.timeout property in the connection URL to change the default time limit.
Microsoft SQL Server Yes 30 seconds Set the QUERY_TIMEOUT server configuration option to change the default time limit.
MongoDB Yes 30 seconds Set the maxTimeMS parameter in the query options to change the default time limit.
MySQL Yes 0 seconds (No default time limit) Set the timeout property in the connection URL or in the JDBC driver properties to specify a time limit for your query.
Oracle Yes 30 seconds Set the QUERY_TIMEOUT parameter in the Oracle JDBC driver to specify the maximum amount of time a query can run before it is automatically cancelled.
PostgreSQL No N/A Set the queryTimeout property to specify the maximum amount of time that a query can run. The default value of the queryTimeout property is 0.
Snowflake Yes 6 hours Set the queryTimeout parameter to change the default time limit.

To avoid your batch deployments from failing, partition your data set or decrease its size.

Security for file uploads

Files you upload through the Watson Studio or Watson Machine Learning UI are not validated or scanned for potentially malicious content. It is recommended that you run security software, such as an anti-virus application, on all files before uploading to ensure the security of your content.

Deployments with constricted software specifications fail after an upgrade

If you upgrade to a more recent version of IBM Cloud Pak for Data and deploy an R Shiny application asset that was created by using constricted software specifications in FIPS mode, your deployment fails.

For example, deployments that use shiny-r3.6 and shiny-r4.2 software specifications fail after you upgrade from IBM Cloud Pak for Data version 4.7.0 to 4.8.4 or later. You might receive the error message Error 502 - Bad Gateway.

To prevent your deployment from failing, update the constricted specification for your deployed asset to use the latest software specification. For more information, see Managing outdated software specifications or frameworks. You can also delete your application deployment if you no longer need it.

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more