Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset for MsProject and LR model retraining #186

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

AKuederle
Copy link
Contributor

This PR attempts to retrain the model of the Ullrich ML classifier on the MsDataset.

Current state:

Dataset

  • Missing proper cohort labels
  • Need to add loader for clinical info
  • Fix loading missing reference info as None instead of empty array (difference between no information provided and 0 Wbs found)

New Model

  • Script to validate models on TVS
  • Retrained new model on all particpants
  • Updated Algo to use new model
  • New model only HA
  • New model only MS

New model Notes:

  • New Model slightly outperforms old Model on entire TVS dataset
  • However, performance on explcit example data is worse...
  • Most suprising thing is that the new model does not pass the test on the sin-wave dummy data anymore...

This was referenced Nov 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant