How to select a column by using values of other columns

Tags
Registered Posts: 2

I want to select an existing column but the name of that column will be dynamic. Which means it should be dependent upon another columns' value. For example - Suppose I have a dataset like

ID,Year,Month,Scen,09_2023,12_2023

1,2023,09,34.5,5,6

2,2023,09,56.2,1,3

3,2023,09,51.9,2,7

Now the new dataset would be combination of `Month_Year`, so for this example it would be

ID,Year,Month,Scen,09_2023

1,2023,09,34.5,5

2,2023,09,56.2,1

3,2023,09,51.9,2

Welcome!

It looks like you're new here. Sign in or register to get started.

Answers

  • Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,399 Neuron

    I don't really understand what you are asking. The only difference I see between the two sample datasets is that the last column 12_2003 is removed in the second dataset. This makes no sense with the title of the post "How to select a column by using values of other columns". Can you please clarify what exactly you are trying to achieve? Don't describe the step but the goal, there might be a better way of achieving the goal.

  • Dataiker, Dataiku DSS Core Designer, Registered, Moderator Posts: 753 Dataiker
    edited July 2024

    Hi,

    You can do this with a formula. In formula, the strval(col_name) function returns the value of the column "col_name". So what you have to do is to build the col_name by taking month and year, so with this input:

    IDYearMonthScen09_202312_2023
    120230934.556
    220230956.213
    320231251.927

    You would use formula (written on multiple lines for clarity)

    strval(
      concat(
         strval("Month"),
         "_",
         strval("Year")
      )
    )


    First we use concat to build a string that is the concatenation opf the value of the month, _ and the value of the year. Then we use strval to take the value of this column, which gets us our output:

    image.png

  • Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,399 Neuron

    Now I get it, nice one @Clément_Stenac
    !

  • Registered Posts: 2

    you are almost there, but I want the name of the column (viz `output`) to be `09_2023` too. Also, the value of `Month` and `Year` is same throughout the dataset.

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.