## Sign up to take part

Registered users can ask their own questions, contribute to discussions, and be part of the Community!

This website uses cookies. By clicking OK, you consent to the use of cookies. Read our cookie policy.

Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results forÂ

Registered users can ask their own questions, contribute to discussions, and be part of the Community!

- Community
- Â»
- Discussions
- Â»
- Using Dataiku
- Â»

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

Random forest classification

I use Algorithm : Random forest classification in Dataiku.

But in confusion matrix, I still see the threshold cut-off, I think it is only for regression? Anyone can help me to explain ? and how to choose Random forest regression (output is probability)

Solutions shown first - Read whole discussion

2 Replies

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

All Classification algorithms in DSS are meant to output classes (either binary: 0/1, or multi-class). The vast majority of classification algorithms don't directly predict classes but probabilities, and then apply a threshold on the probability.

Random Forest Classification is one of these, so it does predict a probability and then applies a threshold to it. If you deploy a Random Forest Classification model in DSS, it will output both the probability and the thresholded predicted class. Thus, if you are only interested in the probability, you don't need to bother about the threshold, and just use the predicted probabilities columns in the result.

In DSS, Random Forest Regression only applies to continuous scoring (ie predict a numerical variable like price, instead of a discrete variable like color).

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content

prediction

Decimal

error

error_decile

abs_error_decile

How do I interpret my result based on the prediction column?