Survey banner
The Dataiku Community is moving to a new home! Some short term disruption starting next week: LEARN MORE

web logs annalysis

Ornicar
Level 1
web logs annalysis

Hello,

I write to you again.
There are an other part I don't understand what I have to do here

Feature engineering the referrer URLs

-Split URL in referer, extracting only the hostname

-Use the Find and replace processor on referer_host, replacing t.co with twitter.com and matching on the complete value of the string

-In the same column, replace www. with an empty expression (i.e. no value), matching on substring

-Once more for referer_host, replace \..* with an empty expression, matching on regular expression. This step allows us to later put all traffic from the local Google domains under a single group.

-Reduce clutter by removing eight more columns: server_ts, referer, type, visitor_params, session_params, event_params, br_lang, and tz_off

I don't know how I can do that. Is there anyone who has a course which

0 Kudos
1 Reply
Turribeach

The Prepare recipe should be able to do what you want. Give it a try and let us know.

https://knowledge.dataiku.com/latest/data-preparation/prepare-recipe/concept-prepare-recipe.html

In fact there is a also a documented sample of using Web Logs:

https://knowledge.dataiku.com/latest/data-preparation/prepare-recipe/tutorial-web-logs-enrichment.ht...

 

0 Kudos