Survey banner
Share your feedback on the Dataiku documentation with this 5 min survey. Thanks!

Core Designer Certificate: HOL Question 5

Level 1
Core Designer Certificate: HOL Question 5

I'm stuck on question 5! My formal is correct but there is no data showing up in the column. I successfully created a card for Univariate analysis of the 3 categories but don't know what I'm missing. Any guidance would be greatly appreciated. I'm applying for a sales position and would love to be certified prior to the interview process starting.

Some columns ( Oil production (Etemad & Luciana) (terawatt-hours), meat_prod_tonnes, and Food Balance Sheets: Eggs - Production (FAO (2017)) (tonnes) ) report country total data. Transform these three columns into per-capita data.

• Hint 1: To convert from country โtotalโ data to โper-capitaโ data divide by the country's population.
• Hint 2: The Formula page in the reference documentation shows the proper syntax for using formulas to read column values when the column names include spaces.
• Hint 3: You will not need the original columns (including Population) for further analysis.

Thanks!

AA

6 Replies
Level 1
Author

Here's the formula I used numval("Oil production (Etemad & Luciana) (terawatt-hours)") / numval(Population) - DSS says the formula is valid but I'm this ExpressionError: Cannot parse to number: null in the sample output column.  What am I missing? I tired to apply the formula to the whole dataset then on the individual columns and got nothing

Dataiker

Hi @admin4,

I would suggest looking at the names of your columns and which ones need to be converted using numval.

Hope that helps!

Best,

Denise, Dataiku Academy

Level 1
Author

Thanks for the tip. I'll apply and see what happens

Level 1
Author

curious if rounding to integers is the first right step or does that skew the data? @denisemalan

Level 1

Hello, I am also having the same problem. I tried isolating the variables that needed to be converted using numval but that still didn't work. This is what I have so far:

numval("Oil production (Etemad & Luciana) (terawatt-hours)")/(Population)

Dataiker

Hi, @LeBronJames,

This formula works for me. Keep in mind the Oil production column has some nulls so you won't see output for every row.

Best,

Denise