Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Hi Experts,
I am new in DataIku, just passed 101, 102, 103 learning passes, and am already fascinated by unlimited capabilities, which the platform gives to non-coding analysts. It`s gorgeous!
Now I am passing Core Designer Certificate exercise and stuck on Hands-On instructions step 5 :
"Some columns ( Oil production (Etemad & Luciana) (terawatt-hours), meat_prod_tonnes, and Food Balance Sheets: Eggs - Production (FAO (2017)) (tonnes) ) report country total data. Transform these three columns into per-capita data."
I applied formula: "numval("Oil production (Etemad & Luciana) (terawatt-hours)") / numval("Population), but it`s not correct.
I tried to find correct formula to apply for this task, checked the materials, provided on the "Formula page" reference, but cannot work out right formula.
Can you please help me to resolve this formula?
Many thanks.
Respectfully,
Vyacheslav
when you say your formula is not correct. What do you mean
- the formula is producing a error message. If so what is the error message.
- or are the numeric results incorrect.
In the example above I notice an extra quote mark โ before the formula. I also note you are missing a quote mark โ after population and before the closing parenthesis ).
Regarding univariate analysis, Iโm not clear exactly what type of data you are working with or which type(s) of univariate analysis you are trying to do.
That being said there are a bunch of tools built into Dataiku for exploratory data analysis that might be useful to you. As a starting point you might find the statistical worksheets to be useful. Here is a knowledge base article on the use.
Here is a link to the documentation.
https://doc.dataiku.com/dss/latest/statistics/index.html
You can do all sorts of other types of analysis with Dataiku and then even more type using R Jupiter notebooks and Python.
Have fun with your data.
when you say your formula is not correct. What do you mean
- the formula is producing a error message. If so what is the error message.
- or are the numeric results incorrect.
In the example above I notice an extra quote mark โ before the formula. I also note you are missing a quote mark โ after population and before the closing parenthesis ).
Good day, Tom,
Thanks a lot for your feedback and suggestions.
I adjusted the formula syntax details as was recommended and it worked out well.
Your help is highly appreciated.
Now could you please advise where I can find explicit instructions how to create an univariate analysis?
Thanks in advance,
Vyacheslav
Regarding univariate analysis, Iโm not clear exactly what type of data you are working with or which type(s) of univariate analysis you are trying to do.
That being said there are a bunch of tools built into Dataiku for exploratory data analysis that might be useful to you. As a starting point you might find the statistical worksheets to be useful. Here is a knowledge base article on the use.
Here is a link to the documentation.
https://doc.dataiku.com/dss/latest/statistics/index.html
You can do all sorts of other types of analysis with Dataiku and then even more type using R Jupiter notebooks and Python.
Have fun with your data.
Tom,
Thank you very much for prompt response.
I find it very useful!
Cheers
Dear Tom,
I want to share with you my Core Designer Certificate : https://verify.skilljar.com/c/oa9qsjrrc5o7.
Thanks a lot again for your kind contribution in my very 1st DSS mastering achievement.
There is more to come.
Respectfully,
Vyacheslav
Excellent to see your achievement.
What studies do you plan to take on next?
--Tom
@tgb417 tgb
Hi Neuron,
Thank you very much for your endorsement, motivates me well! ๐
My next target is ML Practitioner Certificate!
Respectfully.
Congratulations buddy,
I am also trying to complete this certification and stuck on same step.
Could you help me out
Can any one help me here for core designer step no.5.
Some columns ( Oil production (Etemad & Luciana) (terawatt-hours), meat_prod_tonnes, and Food Balance Sheets: Eggs - Production (FAO (2017)) (tonnes) ) report country total data. Transform these three columns into per-capita data.
Hi, can you please suggest the correct syntax for the same?
thankyou
Hello,
Can someone explain what is exactly meant by step 5? What do I need to do?
"
Some columns ( Oil production (Etemad & Luciana) (terawatt-hours), meat_prod_tonnes, and Food Balance Sheets: Eggs - Production (FAO (2017)) (tonnes) ) report country total data. Transform these three columns into per-capita data.
"
Hi rra21 and welcome to the Dataiku Community,
You need to apply a recipe on the dataset to convert the total values to the per capita value so they can be compared with the other data in the project. The documentation linked from the hints you quoted should give you some indication on what you should do in the data preparation step.
If you are still stuck please share what you tried do to so far and where you are not getting any further. We're happy to help.
I have been able to know what was meant by it, and now I am at step 9. any tips for it?
I'm also stuck on this step. This formula seems to be correct but didn't display any values in the new column. Not sure what I'm missing to get the correct conversion.
numval("Oil production (Etemad & Luciana) (terawatt-hours)") / numval(โPopulationโ)
Iโm not sure if I know what your problem is and because this has to do with getting a certificate Iโm only to provide something that might be a hint. I note that your formula has spaces, parentheses, ampersand, and hyphen or minuses in column names. See if making some changes in those areas might help. There are a lot of ways to change that. Also note that the formula language is case sensitive.
Iโd also check your quotation marks. It looks like you are not using the simple quotation marks but fancy ones that have forward upside down and backward quotation marks in some places in your formula. This is a โfeatureโ of some OSs like Mac OS to provide these pretty quotation marks. But this can cause some programming languages like python and dataikuโs scripting language to have problems.
sir actually my all columns are empty except year. plss help
i am struck please help me