Fine-tuning parameters in Logistic Regression

logistic regression

model tuning

machine learning

parameter optimization

predictive modeling

Fine-tuning parameters in Logistic Regression

Master System Design with Codemia

Enhance your system design skills with over 120 practice problems, detailed solutions, and hands-on exercises.

Start Practicing Learn More

Introduction

Logistic regression is often introduced as a simple baseline model, but its behavior depends heavily on a few high-impact hyperparameters. Good tuning can improve accuracy, stability, calibration, and training speed without giving up the model's interpretability.

The key is to tune the parameters that actually change the optimization problem: regularization strength, penalty type, solver, class weighting, and convergence settings. Everything else is secondary until those pieces are under control.

The Parameters That Matter Most

In scikit-learn, the main tuning knobs are:

'C, the inverse of regularization strength'
'penalty, such as l1, l2, or elasticnet'
'solver, which determines what combinations are allowed and how optimization is performed'
'class_weight, which matters on imbalanced data'
'max_iter, which controls how long the solver may run before stopping'

Smaller C means stronger regularization. That usually shrinks coefficients and can reduce overfitting. Larger C relaxes the penalty and lets the model fit the training data more aggressively.

The solver matters because not every solver supports every penalty. For example, saga is the flexible choice when you want l1 or elasticnet, while lbfgs is a strong default for standard l2 logistic regression.

A Practical Tuning Workflow

The example below uses a pipeline so scaling and logistic regression are tuned together. It runs a grid search over several useful combinations.

python

1from sklearn.datasets import load_breast_cancer
2from sklearn.linear_model import LogisticRegression
3from sklearn.model_selection import GridSearchCV, train_test_split
4from sklearn.pipeline import Pipeline
5from sklearn.preprocessing import StandardScaler
6
7X, y = load_breast_cancer(return_X_y=True)
8
9X_train, X_test, y_train, y_test = train_test_split(
10    X, y, test_size=0.2, random_state=42, stratify=y
11)
12
13pipeline = Pipeline([
14    ("scale", StandardScaler()),
15    ("clf", LogisticRegression(max_iter=5000)),
16])
17
18param_grid = [
19    {
20        "clf__solver": ["lbfgs"],
21        "clf__penalty": ["l2"],
22        "clf__C": [0.01, 0.1, 1.0, 10.0],
23        "clf__class_weight": [None, "balanced"],
24    },
25    {
26        "clf__solver": ["saga"],
27        "clf__penalty": ["l1", "l2"],
28        "clf__C": [0.01, 0.1, 1.0, 10.0],
29        "clf__class_weight": [None, "balanced"],
30    },
31]
32
33search = GridSearchCV(
34    pipeline,
35    param_grid=param_grid,
36    cv=5,
37    scoring="roc_auc",
38    n_jobs=-1,
39)
40
41search.fit(X_train, y_train)
42
43print("best params:", search.best_params_)
44print("best cv score:", search.best_score_)
45print("test score:", search.score(X_test, y_test))

This is a good baseline because it avoids invalid solver and penalty combinations while still exploring meaningful regularization choices.

How to Read the Results

If the best C is very small, your data probably benefits from stronger regularization. If the best result repeatedly lands on the largest C in the grid, try larger values because the search may still be regularizing too aggressively.

If class_weight="balanced" wins clearly, that is a signal the class distribution matters enough that an unweighted model was biased toward the majority class. In that situation, also inspect precision, recall, and the confusion matrix instead of relying on accuracy alone.

For sparse, high-dimensional data such as text features, l1 or elasticnet can be especially valuable because they encourage sparse coefficients. For dense tabular data, l2 often remains the strongest simple default.

Tuning Beyond the Default Score

The best logistic regression parameters depend on the metric that matters to the business problem.

If false negatives are expensive, tune for recall or an fbeta score. If ranking quality matters, use ROC AUC or average precision. If you care about probability quality, evaluate calibration as well.

Hyperparameter tuning should therefore be paired with threshold selection. A well-tuned logistic regression model can still make bad business decisions if you keep the default probability cutoff of 0.5 without thinking about the cost of errors.

Common Pitfalls

The most common pitfall is skipping feature scaling. Logistic regression solvers usually behave much better when numeric features are standardized, especially when regularization is involved.

Another mistake is trying invalid parameter combinations. For example, not every solver supports l1 or elasticnet. Let the search space reflect the solver rules instead of generating combinations that only fail at runtime.

People also stop at accuracy on imbalanced data. A model can appear strong while missing the minority class almost completely. Use class-aware metrics and consider class_weight.

Convergence warnings should not be ignored either. They often mean you need more iterations, better scaling, or a different solver. Treat them as useful feedback, not as harmless noise.

Finally, compare against a simple untuned baseline. If tuning adds complexity without improving the metric that matters, the default model may already be good enough.

Summary

The main logistic regression tuning knobs are C, penalty, solver, class_weight, and max_iter.
Smaller C means stronger regularization and often less overfitting.
'lbfgs is a strong default for l2, while saga is useful for l1 and flexible search spaces.'
Always scale numeric features before tuning logistic regression.
Choose the scoring metric based on business cost, not habit.
Watch convergence warnings and compare tuned results against a simple baseline.