Monte Carlo#
Monte Carlo cross validation (or repeated random subsampling) is a technique that averages the validation score of a learner over a user-defined number of simulations where the learner is trained and tested on random splits of the dataset. The estimated validation score approaches the actual validation score as the number of simulations goes to infinity, however, only a tiny fraction of all possible simulations are needed to produce a pretty good approximation.
Interfaces: Validator, Parallel
Parameters#
| # | Name | Default | Type | Description | 
|---|---|---|---|---|
| 1 | simulations | 10 | int | The number of simulations i.e. random subsamplings of the dataset. | 
| 2 | ratio | 0.2 | float | The ratio of samples to hold out for testing. | 
Example#
use Rubix\ML\CrossValidation\MonteCarlo;
$validator = new MonteCarlo(30, 0.1);
  
    
      Last update: 2021-01-23