Why is there a significant difference between the r2_score
function in scikit-learn and the formula for the Coefficient of Determination as described in Wikipedia? Which is the correct one?
I'm using with Python 3.5 to predict linear and quadratic models, and one of the measures of goodness of fit that I'm trying out is the . However, while testing, there's a marked difference between the r2_score
metric in scikit-learn
and the calculation provided in Wikipedia.
I'm providing my code here as reference, which computes the example in the Wikipedia page linked above.
from sklearn.metrics import r2_score import numpy y = [1, 2, 3, 4, 5] f = [1.9, 3.7, 5.8, 8.0, 9.6] # Convert to numpy array and ensure double precision to avoid single precision errors observed = numpy.array(y, dtype=numpy.float64) predicted = numpy.array(f, dtype=numpy.float64) scipy_value = r2_score(observed, predicted) >>> scipy_value:
As is evident, the scipy
calculated value is -3.8699999999999992
while the reference value in Wikipedia is 0.998
.
Thank you!
UPDATE: This is different from this question about how R^2 is calculated in scikit-learn as what I'm trying to understand and have clarified is the discrepancy between both results. That question states that the formula used in scikit is the same as Wikipedia's which should not result in different values.
UPDATE #2: It turns out I made a mistake reading the Wikipedia article's example. Answers and comments below mention that the example I provide is for the linear, least squares fit of the (x, y) values in the example. For that, the answer in Wikipedia's article is correct. For that, the R^2 calue provided is 0.998. For the R^2 between both vectors, scikit's answer is also correct. Thanks a lot for your help!
I think you have misinterpreted wikipedia. The example on wikipedia does not state:
y=[1,2,3,4,5]
f=[1.9, 3.7, 5.8, 8.0, 9.6]
R^2 = 0.998
Instead, it says that the R^2
for a linear least-squares fit to the data:
x=[1,2,3,4,5]
y=[1.9, 3.7, 5.8, 8.0, 9.6]
is equal to 0.998
Consider this script, which first uses np.linalg.lstsq
to find the least squares fit, and the uses both methods to find an R^2
of 0.998 for both:
import numpy as np
from sklearn.metrics import r2_score
x=np.arange(1,6,1)
y=np.array([1.9, 3.7, 5.8, 8.0, 9.6])
A=np.vstack([x, np.ones(len(x))]).T
# Use numpy's least squares function
m, c = np.linalg.lstsq(A, y)[0]
print m,c
# 1.97 -0.11
# Define the values of our least squares fit
f=m*x+c
print f
# [ 1.86 3.83 5.8 7.77 9.74]
# Calculate R^2 explicitly
yminusf2=(y-f)**2
sserr=sum(yminusf2)
mean=float(sum(y))/float(len(y))
yminusmean2=(y-mean)**2
sstot=sum(yminusmean2)
R2=1.-(sserr/sstot)
print R2
# 0.99766066838
# Use scikit
print r2_score(y,f)
# 0.99766066838
r2_score(y,f) == R2
# True