web logs annalysis
Hello,
I write to you again.
There are an other part I don't understand what I have to do here
Feature engineering the referrer URLs
-Split URL in referer, extracting only the hostname
-Use the Find and replace processor on referer_host, replacing t.co with twitter.com and matching on the complete value of the string
-In the same column, replace www. with an empty expression (i.e. no value), matching on substring
-Once more for referer_host, replace \..* with an empty expression, matching on regular expression. This step allows us to later put all traffic from the local Google domains under a single group.
-Reduce clutter by removing eight more columns: server_ts, referer, type, visitor_params, session_params, event_params, br_lang, and tz_off
I don't know how I can do that. Is there anyone who has a course which
Answers
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,166 Neuron
The Prepare recipe should be able to do what you want. Give it a try and let us know.
https://knowledge.dataiku.com/latest/data-preparation/prepare-recipe/concept-prepare-recipe.html
In fact there is a also a documented sample of using Web Logs: