Can dataiku dss handle json and XML data inputs and outputs?

Solved!
aitoufkm
Level 2
Can dataiku dss handle json and XML data inputs and outputs?

Does Dataiku offer handle input data of types XML (using XSD schema) and json?

Does it also handle the generation of output based on XSLT?

 

Or this should be done programmatically ?

 

Thank you in advance

3 Solutions
ATsao
Dataiker

Hi there,

DSS does indeed support using XML and JSON formats as input. More information can be found in our documentation here:

https://doc.dataiku.com/dss/latest/formats/xml.html

https://doc.dataiku.com/dss/latest/formats/json.html

In terms of your second question, it'd be helpful to get more context and details about your use case but yes you would likely need to do this programmatically. I hope that this helps!

Thanks,

Andrew

View solution in original post

aitoufkm
Level 2
Author

Thank you for your answer

I have checked the documentation when I tried to extract information during the upload of the xml, I couldn't get the information as shown in the example below:

let assume that I have this simple xml file:

<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<document>
<person gender="Male">
<lastName>Doe</LastName>
<firstName>John</firstName>
<phoneNumbers>
<phone type="fix">01 02 03 04 05</phone>
<phone type="Mobile">06 07 08 09 10</phone>
</phoneNumbers>
</person>

<person>..</person>

</document>

I want to extract each person with all its attributes as record using Xpath

 

genderlastNamefirstNamefixmobile
maleDoeJohn01 02 03 04 0506 07 08 09 10

 

when trying  "XPaths to columns" in the schema/Preview it does not extract the expected information.

Thank you

View solution in original post

0 Kudos
aitoufkm
Level 2
Author

I finally figure it out.

the issue was related to Xpath Expressions

just still have "xml_text":  before each content 

Thank you anyway

View solution in original post

0 Kudos
4 Replies
ATsao
Dataiker

Hi there,

DSS does indeed support using XML and JSON formats as input. More information can be found in our documentation here:

https://doc.dataiku.com/dss/latest/formats/xml.html

https://doc.dataiku.com/dss/latest/formats/json.html

In terms of your second question, it'd be helpful to get more context and details about your use case but yes you would likely need to do this programmatically. I hope that this helps!

Thanks,

Andrew

aitoufkm
Level 2
Author

Thank you for your answer

I have checked the documentation when I tried to extract information during the upload of the xml, I couldn't get the information as shown in the example below:

let assume that I have this simple xml file:

<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<document>
<person gender="Male">
<lastName>Doe</LastName>
<firstName>John</firstName>
<phoneNumbers>
<phone type="fix">01 02 03 04 05</phone>
<phone type="Mobile">06 07 08 09 10</phone>
</phoneNumbers>
</person>

<person>..</person>

</document>

I want to extract each person with all its attributes as record using Xpath

 

genderlastNamefirstNamefixmobile
maleDoeJohn01 02 03 04 0506 07 08 09 10

 

when trying  "XPaths to columns" in the schema/Preview it does not extract the expected information.

Thank you

0 Kudos
aitoufkm
Level 2
Author

I finally figure it out.

the issue was related to Xpath Expressions

just still have "xml_text":  before each content 

Thank you anyway

0 Kudos
SUSHIL
Level 3

Hi, 

I also facing issue when uploading the excel file, its reading some columns. 

But in those columns it looks nested array and contains xml_text before each content. 

What will be solution and how you have resolved

0 Kudos