EE6132-PA1

A Neural Network implementation

Run

To cross-validate:
python main.py --train --cross_validate --hidden_activation <sigmoid or relu> --lr <float> --momentum <float> --model_dir <str: path to directory where model should be stored> --epochs <integer> --splname <str: special name appended to model>
Example: python main.py --train --cross_validate --epochs 8 --hidden_activation relu --lr 0.01 --momentum 0.3 --model_dir ./models/ --splname sample: Initializes 5-fold cross-validation of the model, with 8 epoch of iteration in every fols, with relu activation in hidden layers, having learning rate of 0.01 and momentum of 0.3. The models are saved after every epoch in './models/' directory. The name specified by --splname argument 'sample' is appended to the end of the name by which the model is stored.
To simply train:
python main.py --train --hidden_activation <sigmoid or relu> --lr <float> --momentum <float> --model_dir <str: path to directory where model should be stored> --epochs <integer> --splname <str: special name appended to model>
Everything is same as cross-validation command except that --cross_validation is missing, specifying not to cross validate but to test directly on the test data.
To evaluate model with test data:
python main.py --evaluate --model_path <str: location where model is stored>
To extract features before training: Add --extract_features argument somewhere in the command line
Top n predictions Add --top_predictions k argument somewhere in the command line arguments for evaluating model(3)
Example: python main.py --top_predictions 3 --model_path ./models/model_relu_augment: prints the top 3 predictions made by the model stored in the address './models/model_relu_augment'.

Sample command
python main.py --train --hidden_activation sigmoid --epochs 8 --lr 0.05 --momentum 0.9 --model_dir ./models --splname mom09

Files

main.py: The main file that contains the implementation of experiments carried out. This file will be executed everytime to carry out any experiment.
model.py: Contains the implementation of the Class Model.
functions.py: Contains the implementation of all the functions and gradients used by MLP.
utils.py: Contains all the utility functions needed for experiments
knn_svm.py: Script to run classification of MNIST data using KNN and SVM models.
download_mnist.py: Contains the python script to download mnist data from http://yann.lecun.com/exdb/mnist/
RandomIdx.txt: Contains 20 random indices of test_data. Chosen images for getting top-3 predictions made by all models
data: Directory which contains the dataset
models: Directory in which models are stored after checkpoints.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EE6132-PA1

Run

Files

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
models		models
results		results
README.md		README.md
RandomIdx.txt		RandomIdx.txt
download_mnist.py		download_mnist.py
functions.py		functions.py
knn_svm.py		knn_svm.py
main.py		main.py
model.py		model.py
report.pdf		report.pdf
utils.py		utils.py

Kaushik-Raghavan/EE6132-PA1

Folders and files

Latest commit

History

Repository files navigation

EE6132-PA1

Run

Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages