Skip to content

Github repository for Coursera's Data Science Specialization

Notifications You must be signed in to change notification settings

fsalhani/datasciencecoursera

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Getting and Cleaning Data Peer-graded Assignment

The repository

In this repository, you can find:

The main script

The R script in this repository reads the data from the train and test files and summarizes it by calculating the mean of each of the values aggregated by subject and activity. The steps it takes are:

  • Read the train and test raw data files
  • Read the label and feature files to make the data easier to read
  • Filter the raw data to keep only the columns that contain mean or standard deviation metrics
  • Merge the test and train datasets into one
  • Change the activity numbers into the correct labels
  • Aggregates the data_frame by activity and subject using the mean() function
  • Creates the tidy_data_set.txt file with the resulting data_frame

About

Github repository for Coursera's Data Science Specialization

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages