NCSA-CAII-Ashby-Prize-Hackathon-Team-6

Challenge: The objective of this project is to create a machine learning model trained on accurate WRF-PartMC data that predicts climate-relevant aerosol properties from only the features that current GCMs can output. Problem Statement

Analyze Data

Normalize Data

To start solving this question, we first needed to normalize our input and output data points to reduce floating point error and to clean up major discrepancies between the norms of each input params.

Method 1: Normalizing Using Mean and STD
1. If value is zero, we replace it with a minimum non-zero value (so we can log).
2. When calculating mean and standard deviation, we use the log(value) to ensure floating point precision .
3. All variables except 'z' are converted to log space.
4. Global mean is subtracted and normalized by standard deviation for each variable.
5. Added cos(Time) as additional feature.
6. Used for MLP and TabNet.
Method 2: Normalizing for Each height
1. Variables are converted to log space similar to method 1.
2. Mean and standard deviation are calculated at each height instead of global.
3. Dataset used for final TabNet model.

Determine Strong correlation Inputs

Next, we visualized the correlation of the input variables with the output variables at a single timestamp, then over the course of the time range given.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ipynb_checkpoints		.ipynb_checkpoints
ccc		ccc
hackathon_data		hackathon_data
kp		kp
models_s		models_s
.gitignore		.gitignore
README.md		README.md
abs_correlation_t0_v2.png		abs_correlation_t0_v2.png
ezgif.com-gif-maker.gif		ezgif.com-gif-maker.gif
mean.png		mean.png
mle.png		mle.png
std.png		std.png
t_loop_all_z.gif		t_loop_all_z.gif
tabnet.png		tabnet.png
tabnet2.png		tabnet2.png
tabnet3.png		tabnet3.png
z.gif		z.gif
z2cnn001.gif		z2cnn001.gif
z2cnn003.gif		z2cnn003.gif
z2cnn006.gif		z2cnn006.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NCSA-CAII-Ashby-Prize-Hackathon-Team-6

Analyze Data

Normalize Data

Determine Strong correlation Inputs

Develop Models

MLP - Multilayer Perceptrons

TabNet

Gradient Boosting Tree

Get Predictions

Team:

About

Releases

Packages

Contributors 2

Languages

nunu2021/NCSA-CAII-Ashby-Prize-Hackathon-Team-6

Folders and files

Latest commit

History

Repository files navigation

NCSA-CAII-Ashby-Prize-Hackathon-Team-6

Analyze Data

Normalize Data

Determine Strong correlation Inputs

Develop Models

MLP - Multilayer Perceptrons

TabNet

Gradient Boosting Tree

Get Predictions

Team:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages