Effect of averaging score from many models rather than using just one model
Stage . | Average Individual ROC AUC . | ROC AUC using average confidence score . |
---|---|---|
PN | 0.591 ± 0.0115 | 0.603 |
2-cell | 0.555 ± 0.00779 | 0.575 |
4-cell | 0.567 ± 0.0114 | 0.581 |
8- to 16-cell | 0.585 ± 0.00779 | 0.585 |
Blastocyst | 0.686 ± 0.00937 | 0.680 |
Stage . | Average Individual ROC AUC . | ROC AUC using average confidence score . |
---|---|---|
PN | 0.591 ± 0.0115 | 0.603 |
2-cell | 0.555 ± 0.00779 | 0.575 |
4-cell | 0.567 ± 0.0114 | 0.581 |
8- to 16-cell | 0.585 ± 0.00779 | 0.585 |
Blastocyst | 0.686 ± 0.00937 | 0.680 |
The average individual ROC AUC scores are the average scores on the test set after 50 training iterations each with a different randomly allocated train/validation/test split with 25 embryos in the successful class and 48 embryos in the unsuccessful class for both the test and validation set. The accompanying errors are the standard error across these 50 training attempts. For the ROC AUC using average confidence score, the average confidence score was first calculated from 50 models (using 5-fold cross-validation and 50 training attempts) and these average confidence scores were then used to calculate the ROC AUC.
Effect of averaging score from many models rather than using just one model
Stage . | Average Individual ROC AUC . | ROC AUC using average confidence score . |
---|---|---|
PN | 0.591 ± 0.0115 | 0.603 |
2-cell | 0.555 ± 0.00779 | 0.575 |
4-cell | 0.567 ± 0.0114 | 0.581 |
8- to 16-cell | 0.585 ± 0.00779 | 0.585 |
Blastocyst | 0.686 ± 0.00937 | 0.680 |
Stage . | Average Individual ROC AUC . | ROC AUC using average confidence score . |
---|---|---|
PN | 0.591 ± 0.0115 | 0.603 |
2-cell | 0.555 ± 0.00779 | 0.575 |
4-cell | 0.567 ± 0.0114 | 0.581 |
8- to 16-cell | 0.585 ± 0.00779 | 0.585 |
Blastocyst | 0.686 ± 0.00937 | 0.680 |
The average individual ROC AUC scores are the average scores on the test set after 50 training iterations each with a different randomly allocated train/validation/test split with 25 embryos in the successful class and 48 embryos in the unsuccessful class for both the test and validation set. The accompanying errors are the standard error across these 50 training attempts. For the ROC AUC using average confidence score, the average confidence score was first calculated from 50 models (using 5-fold cross-validation and 50 training attempts) and these average confidence scores were then used to calculate the ROC AUC.
This PDF is available to Subscribers Only
View Article Abstract & Purchase OptionsFor full access to this pdf, sign in to an existing account, or purchase an annual subscription.