## Sign up to take part

Registered users can ask their own questions, contribute to discussions, and be part of the Community!

This website uses cookies. By clicking OK, you consent to the use of cookies. Read our cookie policy.

Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Registered users can ask their own questions, contribute to discussions, and be part of the Community!

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

Solved!

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Core Designer Certificate

Hi Experts,

I am new in DataIku, just passed 101, 102, 103 learning passes, and am already fascinated by unlimited capabilities, which the platform gives to non-coding analysts. It`s gorgeous!

Now I am passing Core Designer Certificate exercise and stuck on Hands-On instructions step 5 :

"Some columns ( *Oil production (Etemad & Luciana) (terawatt-hours)*, *meat_prod_tonnes*, and *Food Balance Sheets: Eggs - Production (FAO (2017)) (tonnes)* ) report country total data. Transform these three columns into per-capita data."

I applied formula: "numval("Oil production (Etemad & Luciana) (terawatt-hours)") / numval("Population), but it`s not correct.

I tried to find correct formula to apply for this task, checked the materials, provided on the "Formula page" reference, but cannot work out right formula.

Can you please help me to resolve this formula?

Many thanks.

Respectfully,

Vyacheslav

2 Solutions

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

when you say your formula is not correct. What do you mean

- the formula is producing a error message. If so what is the error message.

- or are the numeric results incorrect.

In the example above I notice an extra quote mark “ before the formula. I also note you are missing a quote mark “ after population and before the closing parenthesis ).

--Tom

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Regarding univariate analysis, I’m not clear exactly what type of data you are working with or which type(s) of univariate analysis you are trying to do.

That being said there are a bunch of tools built into Dataiku for exploratory data analysis that might be useful to you. As a starting point you might find the statistical worksheets to be useful. Here is a knowledge base article on the use.

Here is a link to the documentation.

https://doc.dataiku.com/dss/latest/statistics/index.html

You can do all sorts of other types of analysis with Dataiku and then even more type using R Jupiter notebooks and Python.

Have fun with your data.

--Tom

Solutions shown first - Read whole discussion

15 Replies

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

when you say your formula is not correct. What do you mean

- the formula is producing a error message. If so what is the error message.

- or are the numeric results incorrect.

In the example above I notice an extra quote mark “ before the formula. I also note you are missing a quote mark “ after population and before the closing parenthesis ).

--Tom

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Good day, Tom,

Thanks a lot for your feedback and suggestions.

I adjusted the formula syntax details as was recommended and it worked out well.

Your help is highly appreciated.

Now could you please advise where I can find explicit instructions how to create an univariate analysis?

Thanks in advance,

Vyacheslav

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Regarding univariate analysis, I’m not clear exactly what type of data you are working with or which type(s) of univariate analysis you are trying to do.

That being said there are a bunch of tools built into Dataiku for exploratory data analysis that might be useful to you. As a starting point you might find the statistical worksheets to be useful. Here is a knowledge base article on the use.

Here is a link to the documentation.

https://doc.dataiku.com/dss/latest/statistics/index.html

You can do all sorts of other types of analysis with Dataiku and then even more type using R Jupiter notebooks and Python.

Have fun with your data.

--Tom

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Tom,

Thank you very much for prompt response.

I find it very useful!

Cheers

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Dear Tom,

I want to share with you my Core Designer Certificate : https://verify.skilljar.com/c/oa9qsjrrc5o7.

Thanks a lot again for your kind contribution in my very 1st DSS mastering achievement.

There is more to come.

Respectfully,

Vyacheslav

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Excellent to see your achievement.

What studies do you plan to take on next?

--Tom

--Tom

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

@tgb417 tgb

Hi Neuron,

Thank you very much for your endorsement, motivates me well! 🙂

My next target is ML Practitioner Certificate!

Respectfully.

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Congratulations buddy,

I am also trying to complete this certification and stuck on same step.

Could you help me out

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Hi, can you please suggest the correct syntax for the same?

thankyou

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Hello,

Can someone explain what is exactly meant by step 5? What do I need to do?

"

Some columns ( *Oil production (Etemad & Luciana) (terawatt-hours)*, *meat_prod_tonnes*, and *Food Balance Sheets: Eggs - Production (FAO (2017)) (tonnes)* ) report country total data. Transform these three columns into per-capita data.

**Hint 1:**To convert from country “total” data to “per-capita” data divide by the country's population.**Hint 2:**The Formula page in the reference documentation shows the proper syntax for using formulas to read column values when the column names include spaces.**Hint 3:**You will not need the original columns (including Population) for further analysis.

"

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Hi rra21 and welcome to the Dataiku Community,

You need to apply a recipe on the dataset to convert the total values to the per capita value so they can be compared with the other data in the project. The documentation linked from the hints you quoted should give you some indication on what you should do in the data preparation step.

If you are still stuck please share what you tried do to so far and where you are not getting any further. We're happy to help.

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

I have been able to know what was meant by it, and now I am at step 9. any tips for it?

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

I'm also stuck on this step. This formula seems to be correct but didn't display any values in the new column. Not sure what I'm missing to get the correct conversion.

numval("Oil production (Etemad & Luciana) (terawatt-hours)") / numval(“Population”)

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

I’m not sure if I know what your problem is and because this has to do with getting a certificate I’m only to provide something that might be a hint. I note that your formula has spaces, parentheses, ampersand, and hyphen or minuses in column names. See if making some changes in those areas might help. There are a lot of ways to change that. Also note that the formula language is case sensitive.

--Tom

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

I’d also check your quotation marks. It looks like you are not using the simple quotation marks but fancy ones that have forward upside down and backward quotation marks in some places in your formula. This is a “feature” of some OSs like Mac OS to provide these pretty quotation marks. But this can cause some programming languages like python and dataiku’s scripting language to have problems.

--Tom