Skip to content

alzmcr/kaggle-Fast-Iron

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

Fast-Iron

Kaggle Blue Book for Bulldozers Competition

Windows 7 64bit on Intel QuadCore with 12GB RAM, Python 2.7 with Pandas, Numpy ,Scikit-Learn 0.13.1

##How to train your model ###How to make predictions on a new test set. Before train make predictions, data need to be pre-processed, step below:

  1. Place the training, appendix and test data in the Data folder
  2. Edit prepare_data.py and change the following line with names of training, appendix and test data
  • trainData = "Data\TrainAndValid.csv"
  • testData = "Data\Test.csv"
  • appendixData = "Data\Machine_Appendix.csv",
  1. Run the script. This will create four files in DataProcessed. This step take about 10-15 minutes depending on machine and file sizes.

PREDICT on Test.csv data

Simply run train_and_predict.py will create the output named current_prediction.csv.
train_and_predict.py is already set to run to recreate the output. gradient boosting regressor are serialized and trained. random forest need to be re-trained (too big to attach). Training the random forests takes 102 minutes.

TRAIN on new data

  1. Edit train_and_predict.py
    To train GB models change trainGB_models to True
    To train RF models change trainRF_models to True
    To save the models, change dumpModels to True

About

Kaggle Blue Book for Bulldozers Competition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages