DecisionTreeRegressor 점수 해석?

-1

내가 기능의 관련성을 평가하기 위해 노력하고 내가 DecisionTreeRegressor()DecisionTreeRegressor 점수 해석?

코드의 관련 부분을 사용하고이 아래에 제시 : 나는 print 기능을 실행하면

# TODO: Make a copy of the DataFrame, using the 'drop' function to drop the given feature 
new_data = data.drop(['Frozen'], axis = 1) 

# TODO: Split the data into training and testing sets(0.25) using the given feature as the target 
# TODO: Set a random state. 

from sklearn.model_selection import train_test_split 


X_train, X_test, y_train, y_test = train_test_split(new_data, data['Frozen'], test_size = 0.25, random_state = 1) 

# TODO: Create a decision tree regressor and fit it to the training set 

from sklearn.tree import DecisionTreeRegressor 


regressor = DecisionTreeRegressor(random_state=1) 
regressor.fit(X_train, y_train) 

# TODO: Report the score of the prediction using the testing set 

from sklearn.model_selection import cross_val_score 


#score = cross_val_score(regressor, X_test, y_test) 
score = regressor.score(X_test, y_test) 

print score # python 2.x

, 그것은 주어진 점수를 반환합니다 :

-0.649574327334

당신은 SCO를 찾을 수 있습니다 함수 및 implementatioin here 아래 아래 설명 일부 재 :

는 판정 R 예측의^2 계수를 반환. ... 가능한 가장 좋은 점수는 1.0이며 음수 일 수 있습니다 ( 모델이 임의로 악화 될 수 있기 때문에).

아직 전체 개념을 파악할 수 없어이 설명이 저에게는별로 도움이되지 않습니다. 예를 들어 점수가 음수가 될 수있는 이유와 정확히 무엇이 표시되는지 이해할 수 없습니다 (어떤 것이 제곱 된 경우에만 양수 일 수 있음).

이 점수는 무엇을 나타내며 그 이유는 무엇입니까?

기사를 아는 경우 (처음에는 유용 할 수 있습니다)!

출처

2017-09-10 wasabi

시작 : https://en.wikipedia.org/wiki/Coefficient_of_determination 및 http://scikit-learn.org/stable/modules/generated/sklearn.metrics.r2_score. html –

계수의 기초를 이해하면 [this] (https://stats.stackexchange.com/questions/183265/what-does-negative-r-squared-mean)와 [that] (https://stats.stackexchange.com/questions/12900/when-is-r-squared-negative). 그러나 첫 번째 기사는 배경 이야기를 이해하기에 충분합니다. 좋은 소식은, 너무 우선합니다! –

이 문서는 DecisionTreeRegressor이 구현 된 cross_val_score을 실행합니다. scikitlearn DecisionTreeRegressor의 문서를 살펴보십시오. 기본적으로 표시되는 점수는 R^2 또는 (1-u/v)입니다. U는 예측의 제곱 된 합 잔차이고, v는 총 제곱합 (제곱의 표본 합)입니다. 정말로 나쁜 예측을 할 때, U 및 V는 합계만큼 작은 소정 0으로 단지 수 있지만

U를/V는, 임의의 클 수 제곱 (> = 0)

출처

2017-09-10 10:28:41 chrisckwong821

R^2부터 제외 될 수 잔여 모델이 수평선보다 나쁜 데이터에 적합하면 그 정의 (https://en.wikipedia.org/wiki/Coefficient_of_determination). 기본적으로

R^2 = 1 - SS_res/SS_tot

및 SS_res 및 SS_tot은 항상 긍정적이다. SS_res >> SS_tot 일 경우 음수는 R^2입니다. 이 대답을보십시오 : https://stats.stackexchange.com/questions/12900/when-is-r-squared-negative

출처

2018-01-30 06:16:06

답변

관련 문제