Peer review assignment of Coursera's "Getting and Cleaning Data" course.
Purpose of this repo is generating a tidy data from the source data.
To achieve the above purpose, you need to do the following:
- Prepare R environment and install
reshape2
package - Download the source data from https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip
- Unzip the source data into this directory
- Run
run_analysis.R
script
This repo consists of the following files:
- run_analysis.R : main R script to generate the tidy data.
- CodeBook.md : Documentation for the detail of the data.
- README.md : This readme file explaining the overview of this repo.
- Read feature name list
- Read X, y and subject data for test set and training set, respectively
- Give name to columns of the data frame of X using feature name list
- Concatenate X, y and subject data for test set and training set, respectively
- Concatenate test set and training set
- Extract mean and standard deviation features
- Overwrite activity id with activity name
- Compute the means for each features, grouped by actibity and subject