Retraining after Cross Validation with libsvm

machine learning

libsvm

cross validation

model retraining

svm

Retraining after Cross Validation with libsvm

Master System Design with Codemia

Enhance your system design skills with over 120 practice problems, detailed solutions, and hands-on exercises.

Start Practicing Learn More

Introduction

After cross-validation tells you which libsvm hyperparameters perform best, the standard next step is to train one final model on the full training set using those chosen parameters. Cross-validation is for model selection. Deployment or final evaluation should use a retrained model that has seen all available training examples except the final held-out test set.

What Cross-Validation Actually Produces

In libsvm, cross-validation is commonly used to compare values of C, gamma, and sometimes kernel type. That process estimates which parameter settings generalize well, but it does not usually produce the final model you want to ship.

For example, a parameter search might show that these settings work best:

kernel: RBF
'C = 8'
'gamma = 0.125'

That result tells you what to train next. It is not itself the final training artifact.

Retrain on the Full Training Set

Once you choose the hyperparameters, retrain using all of the training data. With libsvm, that usually means calling svm-train again without the cross-validation flag.

bash

svm-train -s 0 -t 2 -c 8 -g 0.125 train.scale final.model

That command trains a fresh SVM model on the entire scaled training set and saves the result as final.model.

If you have a separate test set, evaluate only after this retraining step:

bash

svm-predict test.scale final.model predictions.txt

That is the standard workflow because the final model should benefit from all training examples once hyperparameter selection is finished.

Keep Scaling Consistent

SVMs are sensitive to feature scale. If you used scaling before cross-validation, use the same scaling rules for final training and for the test set.

bash

svm-scale -s scale.params train.txt > train.scale
svm-scale -r scale.params test.txt > test.scale

The crucial detail is that the scaling parameters are fitted on the training data and then reused on the test data. Do not recompute scaling statistics on the test set. That leaks information and invalidates the evaluation.

Separate the Roles of Training, Validation, and Test Data

A disciplined workflow has three distinct stages:

training folds used inside cross-validation to compare parameter settings
full training set used for the final retrain after selection
untouched test set used once for final evaluation

This separation matters because otherwise the reported performance number becomes overly optimistic. If you use the test set while tuning parameters, it stops being a real test set.

Why Retraining Is Better Than Keeping a Fold Model

During k-fold cross-validation, each fitted model sees only a fraction of the training data at a time. Any one fold-specific model is therefore missing part of the information available in the overall dataset.

Retraining with the selected hyperparameters lets the final SVM learn from all of the training examples at once. That is why the final deployed model is typically stronger than any single model trained during the cross-validation loop.

Nested Cross-Validation Is a Different Goal

Sometimes people ask whether the cross-validation performance itself should be reported instead of retraining. The answer depends on the goal.

If the goal is an unbiased performance estimate for research, nested cross-validation is often appropriate. If the goal is to build the best final model after choosing hyperparameters, retraining on the full training set is the normal production step.

Those are different tasks:

performance estimation
final model training

Do not confuse them.

Save the Final Model Deliberately

Store the post-selection retrained model as a distinct artifact. That makes it obvious which file corresponds to the deployable model and which files came from temporary tuning experiments.

For example, keep a record of:

training data version
scaling parameters used
chosen C and gamma
final model filename

That small amount of discipline makes later reproduction much easier.

Common Pitfalls

The most common mistake is treating the best fold model from cross-validation as the final model. That model was trained on only part of the data.

Another mistake is forgetting to reuse the exact same scaling parameters when preparing the final training and test sets. SVM results can shift dramatically if scaling is inconsistent.

Teams also use the test set during parameter search and then report the test accuracy as if it were unbiased. Once the test set influences tuning, it is no longer a valid final evaluation set.

Summary

Cross-validation selects good libsvm hyperparameters, but it is not usually the final training step.
After selecting parameters, retrain on the full training set.
Keep scaling identical across tuning, retraining, and testing.
Use a separate untouched test set for final evaluation.
Save the retrained full-data model as the deployable artifact, not a model from one CV fold.