-
Re: Split dataset by stratified sampling.
And if you have values of 'label' that appear in only a single record, and you want to make sure those records go to the training set, you need a few more lines: import numpy as np values, counts = n…1 · -
Re: Split dataset by stratified sampling.
Well, a 2-liner, the other one being from sklearn.model_selection import train_test_split1 · -
Re: Pivot and Chart-Please help!!!
Things we learned while making this work: * The docs need to be better integrated. Some of the info we needed was at https://doc.dataiku.com/dss/9.0/preparation/processors/fold-columns-by-pattern.htm…4 · -
Re: Adding a SAS data source
To elaborate - Add the file to a managed folder, then create a Python code recipe and add lines like the following to the default recipe code: from sas7bdat import SAS7BDATmyfilename = '<myfile>…1 · -
FYI - Academy: Machine Learning Basics error
Dataiku Support has forwarded the following ticket to the Academy team: Course module https://academy.dataiku.com/path/ml-practitioner/machine-learning-basics/546062 states that "Patient Number&…1 ·