[10:22:13] [INFO] [org.apache.hadoop.io.compress.CodecPool] - Got brand-new compressor [.snappy] [10:22:13] [INFO] [com.dataiku.dip.dataflow.streaming.DatasetWriter] - Done initializing output writer [10:22:13] [INFO] [com.dataiku.dip.input.formats.parquet.ParquetOutputWriter] - Processed 0 rows (0 failed) [10:22:14] [INFO] [org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter] - Saved output of task 'attempt_dss_0000_r_000000_0' to hdfs://xxxxxxxxxxxxx/xxxxxxx/xxxxxxxxxxxx/xx/Xxxxxxxxxxxxxxxxxxxxxxx/xxxxxxxxxxxxxxx/xxxxxx/x/xxxxxxxxxxx [10:22:14] [INFO] [com.dataiku.dip.dataflow.streaming.DatasetWritingService] - Pushed data to write session QTDj0hf53e : 0 rows [10:22:14] [INFO] [com.dataiku.dip.dataflow.streaming.DatasetWritingService] - Finished write session: QTDj0hf53e [10:22:14] [DEBUG] [dku.jobs] - Command /tintercom/datasets/push-data processed in 1156ms [10:22:14] [DEBUG] [dku.jobs] - Command /tintercom/datasets/wait-write-session processed in 1157ms [10:22:14] [INFO] [dku.utils] - 0 rows successfully written (QTDj0hf53e) [10:22:14] [INFO] [dku.utils] - Traceback (most recent call last): [10:22:14] [INFO] [dku.utils] - File "/usr/lib64/python2.7/runpy.py", line 162, in _run_module_as_main [10:22:14] [INFO] [dku.utils] - "__main__", fname, loader, pkg_name) [10:22:14] [INFO] [dku.utils] - File "/usr/lib64/python2.7/runpy.py", line 72, in _run_code [10:22:14] [INFO] [dku.utils] - exec code in run_globals [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python/dataiku/doctor/prediction/reg_scoring_recipe.py", line 286, in [10:22:14] [INFO] [dku.utils] - dkujson.load_from_filepath(sys.argv[7])) [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python/dataiku/doctor/prediction/reg_scoring_recipe.py", line 252, in main [10:22:14] [INFO] [dku.utils] - for output_df in output_generator(): [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python/dataiku/doctor/prediction/reg_scoring_recipe.py", line 187, in output_generator [10:22:14] [INFO] [dku.utils] - output_probas=recipe_desc["outputProbabilities"]) [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python/dataiku/doctor/prediction/classification_scoring.py", line 265, in binary_classification_predict [10:22:14] [INFO] [dku.utils] - (pred_df, proba_df) = binary_classification_predict_single(clf, pipeline, modeling_params, target_map, threshold, data, output_probas) [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python/dataiku/doctor/prediction/classification_scoring.py", line 225, in binary_classification_predict_single [10:22:14] [INFO] [dku.utils] - probas_raw = clf.predict_proba(features_X) [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python.packages/sklearn/ensemble/forest.py", line 583, in predict_proba [10:22:14] [INFO] [dku.utils] - X = self._validate_X_predict(X) [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python.packages/sklearn/ensemble/forest.py", line 362, in _validate_X_predict [10:22:14] [INFO] [dku.utils] - return self.estimators_[0]._validate_X_predict(X, check_input=True) [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python.packages/sklearn/tree/tree.py", line 377, in _validate_X_predict [10:22:14] [INFO] [dku.utils] - X = check_array(X, dtype=DTYPE, accept_sparse="csr") [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python.packages/sklearn/utils/validation.py", line 573, in check_array [10:22:14] [INFO] [dku.utils] - allow_nan=force_all_finite == 'allow-nan') [10:22:14] [INFO] [dku.utils] - File "/home/usrservcloudera/dataiku-dss-7.0.2/python.packages/sklearn/utils/validation.py", line 56, in _assert_all_finite [10:22:14] [INFO] [dku.utils] - raise ValueError(msg_err.format(type_err, X.dtype)) [10:22:14] [INFO] [dku.utils] - ValueError: Input contains NaN, infinity or a value too large for dtype('float32'). [10:22:14] [INFO] [dku.flow.activity] - Run thread failed for activity score_Cancelacion_Explicita_NP com.dataiku.dip.exceptions.ProcessDiedException: The Python process failed (exit code: 1). More info might be available in the logs. at com.dataiku.dip.dataflow.exec.AbstractCodeBasedActivityRunner.throwSubprocessError(AbstractCodeBasedActivityRunner.java:225) at com.dataiku.dip.dataflow.exec.AbstractCodeBasedActivityRunner.handleExecutionResult(AbstractCodeBasedActivityRunner.java:176) at com.dataiku.dip.dataflow.exec.AbstractCodeBasedActivityRunner.execute(AbstractCodeBasedActivityRunner.java:109) at com.dataiku.dip.dataflow.exec.AbstractPythonRecipeRunner.executeModule(AbstractPythonRecipeRunner.java:64) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner$3.run(PredictionScoringRecipeRunner.java:588) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner.runPython(PredictionScoringRecipeRunner.java:626) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner.runWithOriginalEngine(PredictionScoringRecipeRunner.java:383) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner.run(PredictionScoringRecipeRunner.java:258) at com.dataiku.dip.dataflow.jobrunner.ActivityRunner$FlowRunnableThread.run(ActivityRunner.java:380) [10:22:14] [INFO] [dku.flow.activity] running score_Cancelacion_Explicita_NP - activity is finished [10:22:14] [ERROR] [dku.flow.activity] running score_Cancelacion_Explicita_NP - Activity failed com.dataiku.dip.exceptions.ProcessDiedException: The Python process failed (exit code: 1). More info might be available in the logs. at com.dataiku.dip.dataflow.exec.AbstractCodeBasedActivityRunner.throwSubprocessError(AbstractCodeBasedActivityRunner.java:225) at com.dataiku.dip.dataflow.exec.AbstractCodeBasedActivityRunner.handleExecutionResult(AbstractCodeBasedActivityRunner.java:176) at com.dataiku.dip.dataflow.exec.AbstractCodeBasedActivityRunner.execute(AbstractCodeBasedActivityRunner.java:109) at com.dataiku.dip.dataflow.exec.AbstractPythonRecipeRunner.executeModule(AbstractPythonRecipeRunner.java:64) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner$3.run(PredictionScoringRecipeRunner.java:588) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner.runPython(PredictionScoringRecipeRunner.java:626) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner.runWithOriginalEngine(PredictionScoringRecipeRunner.java:383) at com.dataiku.dip.analysis.ml.prediction.flow.PredictionScoringRecipeRunner.run(PredictionScoringRecipeRunner.java:258) at com.dataiku.dip.dataflow.jobrunner.ActivityRunner$FlowRunnableThread.run(ActivityRunner.java:380)