GridSearchCV returns a different value

I split my dataset in two parts: training set and test set. For now just forget the test set and use the training set with the function GridSearchCV of the package sklearn.model_selection to search the best parameters for an SVM:

Cs = [0.001, 0.01, 0.1, 1, 10, 100, 1000]
gammas = [0.001, 0.01, 0.1, 1]

# Set the parameters by cross-validation
param_grid = ['kernel': ['rbf'], 'gamma': gammas, 'C': Cs]

clf = GridSearchCV(svm.SVC(), param_grid = param_grid, cv=nfolds, verbose=1.)
clf.fit(x_train, labels)

after found my best C and gamma parameters, I create an SVM and I fit it with the training set (used before to search the best C and gamma):

model = svm.SVC(kernel='rbf', C = clf.best_params_['C'], gamma = clf.best_params_['gamma']) 
model.fit(x_train, y_train)

At this point I tried one thing, I used the predict() function of the GridSearchCV object and the one of the svm.SVC object:

predicted_label1 = model.predict(x_test)
predicted_label2 = clf.predict(x_test)

and then I used the classification_report(y_test, predicted_label) to valuate my two predicted_label vectors. In my mind I should obtain the same values but this not happens...Here my output:

 precision recall f1-score support

 0.0 0.24 0.97 0.39 357
 1.0 0.00 0.00 0.00 358
 2.0 0.00 0.00 0.00 357
 3.0 0.00 0.00 0.00 357

avg / total 0.06 0.24 0.10 1429

fine parametri
training set and test set saved
Create SVM classifier
 precision recall f1-score support

 0.0 0.70 0.63 0.66 357
 1.0 0.89 0.90 0.90 358
 2.0 0.89 0.94 0.91 357
 3.0 0.85 0.88 0.86 357

avg / total 0.83 0.84 0.83 1429

The first is from the GridSearchCV and the second from the SVM...
Is this normal?
What does GridSearchCV returns? Does it fit with the passed training set?

edited Nov 15 '18 at 21:12

asked Nov 15 '18 at 21:09

BossShell

1098

Should be the same... But you made lots of edits to your question, so are you sure the training data is the same between the CV and the final SVC? (train_set vs x/y_train?)

– Matthieu Brucher
Nov 15 '18 at 21:12

yes it is the same

– BossShell
Nov 15 '18 at 21:13

I fix the text. What do you think I'm doing wrong?

– BossShell
Nov 15 '18 at 21:16

can you add some data?

– seralou
Nov 15 '18 at 23:26

1

In GridSearchCV you used x_train, labels and in SVC you used x_train, y_train. Are you sure they are exact same? Can you make a simple reproducible code with some data if possible. I am getting same scores on sklearn inbuilt datasets.

– Vivek Kumar
Nov 16 '18 at 6:43

|
show 1 more comment

Cs = [0.001, 0.01, 0.1, 1, 10, 100, 1000]
gammas = [0.001, 0.01, 0.1, 1]

# Set the parameters by cross-validation
param_grid = ['kernel': ['rbf'], 'gamma': gammas, 'C': Cs]

clf = GridSearchCV(svm.SVC(), param_grid = param_grid, cv=nfolds, verbose=1.)
clf.fit(x_train, labels)

after found my best C and gamma parameters, I create an SVM and I fit it with the training set (used before to search the best C and gamma):

model = svm.SVC(kernel='rbf', C = clf.best_params_['C'], gamma = clf.best_params_['gamma']) 
model.fit(x_train, y_train)

At this point I tried one thing, I used the predict() function of the GridSearchCV object and the one of the svm.SVC object:

predicted_label1 = model.predict(x_test)
predicted_label2 = clf.predict(x_test)

and then I used the classification_report(y_test, predicted_label) to valuate my two predicted_label vectors. In my mind I should obtain the same values but this not happens...Here my output:

 precision recall f1-score support

 0.0 0.24 0.97 0.39 357
 1.0 0.00 0.00 0.00 358
 2.0 0.00 0.00 0.00 357
 3.0 0.00 0.00 0.00 357

avg / total 0.06 0.24 0.10 1429

fine parametri
training set and test set saved
Create SVM classifier
 precision recall f1-score support

 0.0 0.70 0.63 0.66 357
 1.0 0.89 0.90 0.90 358
 2.0 0.89 0.94 0.91 357
 3.0 0.85 0.88 0.86 357

avg / total 0.83 0.84 0.83 1429

The first is from the GridSearchCV and the second from the SVM...
Is this normal?
What does GridSearchCV returns? Does it fit with the passed training set?

edited Nov 15 '18 at 21:12

asked Nov 15 '18 at 21:09

BossShell

1098

Should be the same... But you made lots of edits to your question, so are you sure the training data is the same between the CV and the final SVC? (train_set vs x/y_train?)

– Matthieu Brucher
Nov 15 '18 at 21:12

yes it is the same

– BossShell
Nov 15 '18 at 21:13

I fix the text. What do you think I'm doing wrong?

– BossShell
Nov 15 '18 at 21:16

can you add some data?

– seralou
Nov 15 '18 at 23:26

1

In GridSearchCV you used x_train, labels and in SVC you used x_train, y_train. Are you sure they are exact same? Can you make a simple reproducible code with some data if possible. I am getting same scores on sklearn inbuilt datasets.

– Vivek Kumar
Nov 16 '18 at 6:43

|
show 1 more comment

Cs = [0.001, 0.01, 0.1, 1, 10, 100, 1000]
gammas = [0.001, 0.01, 0.1, 1]

# Set the parameters by cross-validation
param_grid = ['kernel': ['rbf'], 'gamma': gammas, 'C': Cs]

clf = GridSearchCV(svm.SVC(), param_grid = param_grid, cv=nfolds, verbose=1.)
clf.fit(x_train, labels)

after found my best C and gamma parameters, I create an SVM and I fit it with the training set (used before to search the best C and gamma):

model = svm.SVC(kernel='rbf', C = clf.best_params_['C'], gamma = clf.best_params_['gamma']) 
model.fit(x_train, y_train)

At this point I tried one thing, I used the predict() function of the GridSearchCV object and the one of the svm.SVC object:

predicted_label1 = model.predict(x_test)
predicted_label2 = clf.predict(x_test)

and then I used the classification_report(y_test, predicted_label) to valuate my two predicted_label vectors. In my mind I should obtain the same values but this not happens...Here my output:

 precision recall f1-score support

 0.0 0.24 0.97 0.39 357
 1.0 0.00 0.00 0.00 358
 2.0 0.00 0.00 0.00 357
 3.0 0.00 0.00 0.00 357

avg / total 0.06 0.24 0.10 1429

fine parametri
training set and test set saved
Create SVM classifier
 precision recall f1-score support

 0.0 0.70 0.63 0.66 357
 1.0 0.89 0.90 0.90 358
 2.0 0.89 0.94 0.91 357
 3.0 0.85 0.88 0.86 357

avg / total 0.83 0.84 0.83 1429

The first is from the GridSearchCV and the second from the SVM...
Is this normal?
What does GridSearchCV returns? Does it fit with the passed training set?

edited Nov 15 '18 at 21:12

asked Nov 15 '18 at 21:09

BossShell

1098

Cs = [0.001, 0.01, 0.1, 1, 10, 100, 1000]
gammas = [0.001, 0.01, 0.1, 1]

# Set the parameters by cross-validation
param_grid = ['kernel': ['rbf'], 'gamma': gammas, 'C': Cs]

clf = GridSearchCV(svm.SVC(), param_grid = param_grid, cv=nfolds, verbose=1.)
clf.fit(x_train, labels)

after found my best C and gamma parameters, I create an SVM and I fit it with the training set (used before to search the best C and gamma):

model = svm.SVC(kernel='rbf', C = clf.best_params_['C'], gamma = clf.best_params_['gamma']) 
model.fit(x_train, y_train)

At this point I tried one thing, I used the predict() function of the GridSearchCV object and the one of the svm.SVC object:

predicted_label1 = model.predict(x_test)
predicted_label2 = clf.predict(x_test)

and then I used the classification_report(y_test, predicted_label) to valuate my two predicted_label vectors. In my mind I should obtain the same values but this not happens...Here my output:

 precision recall f1-score support

 0.0 0.24 0.97 0.39 357
 1.0 0.00 0.00 0.00 358
 2.0 0.00 0.00 0.00 357
 3.0 0.00 0.00 0.00 357

avg / total 0.06 0.24 0.10 1429

fine parametri
training set and test set saved
Create SVM classifier
 precision recall f1-score support

 0.0 0.70 0.63 0.66 357
 1.0 0.89 0.90 0.90 358
 2.0 0.89 0.94 0.91 357
 3.0 0.85 0.88 0.86 357

avg / total 0.83 0.84 0.83 1429

The first is from the GridSearchCV and the second from the SVM...
Is this normal?
What does GridSearchCV returns? Does it fit with the passed training set?

python machine-learning scikit-learn svm grid-search

edited Nov 15 '18 at 21:12

asked Nov 15 '18 at 21:09

BossShell

1098

edited Nov 15 '18 at 21:12

asked Nov 15 '18 at 21:09

BossShell

1098

edited Nov 15 '18 at 21:12

asked Nov 15 '18 at 21:09

BossShell

1098

asked Nov 15 '18 at 21:09

BossShell

1098

asked Nov 15 '18 at 21:09

BossShell

1098

Should be the same... But you made lots of edits to your question, so are you sure the training data is the same between the CV and the final SVC? (train_set vs x/y_train?)

– Matthieu Brucher
Nov 15 '18 at 21:12

yes it is the same

– BossShell
Nov 15 '18 at 21:13

I fix the text. What do you think I'm doing wrong?

– BossShell
Nov 15 '18 at 21:16

can you add some data?

– seralou
Nov 15 '18 at 23:26

1

In GridSearchCV you used x_train, labels and in SVC you used x_train, y_train. Are you sure they are exact same? Can you make a simple reproducible code with some data if possible. I am getting same scores on sklearn inbuilt datasets.

– Vivek Kumar
Nov 16 '18 at 6:43

|
show 1 more comment

Should be the same... But you made lots of edits to your question, so are you sure the training data is the same between the CV and the final SVC? (train_set vs x/y_train?)

– Matthieu Brucher
Nov 15 '18 at 21:12

yes it is the same

– BossShell
Nov 15 '18 at 21:13

I fix the text. What do you think I'm doing wrong?

– BossShell
Nov 15 '18 at 21:16

can you add some data?

– seralou
Nov 15 '18 at 23:26

1

In GridSearchCV you used x_train, labels and in SVC you used x_train, y_train. Are you sure they are exact same? Can you make a simple reproducible code with some data if possible. I am getting same scores on sklearn inbuilt datasets.

– Vivek Kumar
Nov 16 '18 at 6:43

Should be the same... But you made lots of edits to your question, so are you sure the training data is the same between the CV and the final SVC? (train_set vs x/y_train?)

– Matthieu Brucher
Nov 15 '18 at 21:12

yes it is the same

– BossShell
Nov 15 '18 at 21:13

I fix the text. What do you think I'm doing wrong?

– BossShell
Nov 15 '18 at 21:16

can you add some data?

– seralou
Nov 15 '18 at 23:26

In GridSearchCV you used x_train, labels and in SVC you used x_train, y_train. Are you sure they are exact same? Can you make a simple reproducible code with some data if possible. I am getting same scores on sklearn inbuilt datasets.

– Vivek Kumar
Nov 16 '18 at 6:43

|
show 1 more comment

0

active

oldest

votes

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53327935%2fgridsearchcv-returns-a-different-value%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

0

active

oldest

votes

0

active

oldest

votes

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

QzWnpjGYPPtYJLxzPQbIt,C0fANQ8K tsCaEh0IW8k3HpNUgIByBK9EB70iaiZ 82hxCK,z6,KHTv9NJCyLS4,IK

搜尋此網誌

Myujth