Scoring in Gridsearch CV
Master System Design with Codemia
Enhance your system design skills with over 120 practice problems, detailed solutions, and hands-on exercises.
GridSearchCV, a crucial component in the machine learning toolkit, is used to perform hyperparameter tuning, which is the process of searching for the optimal set of hyperparameters for a given model. One of the critical aspects of using GridSearchCV is understanding how scoring works, as it directly influences the evaluation and selection of the best model. This article takes a detailed look into scoring in GridSearchCV, from basic concepts to advanced considerations.
Understanding Scoring
In the context of GridSearchCV, scoring is the strategy used to evaluate the performance of a model using a specific metric or set of metrics. The scoring determines which model is considered the "best" during the search over the hyperparameter space.
Technical Explanation
When performing a parameter search with GridSearchCV, you need to specify a scoring rule. The `scoring` parameter in GridSearchCV accepts several types of input:
- Single `Score` String: Accepts strings like 'accuracy', 'precision', 'recall', etc., which refer to predefined scoring metrics from the `sklearn.metrics` module.
- Callable Function: A user-defined function that takes two parameters: `y_true` and `y_pred`, and returns a floating-point number indicating the performance of the prediction.
- Dictionary of Multiple Metrics: If multiple aspects of model evaluation are needed, a dictionary with the metric names as keys and corresponding scoring rules as values can be provided.
Examples
Consider a simple GridSearchCV example with logistic regression:

