tidy/CodeBook.md at master · SilverStats/tidy

The column names are the measures dealing with mean and standard deviation, as explained in study_features_info.txt. The names, taken from study_features.txt, were modified to eliminate unnecessary characters like parentheses.

The activities have been named according to study_activity_labels.txt.

More detailed information is available in STUDY_README.txt.

My transformations are as follows:

changed activity numbers to activity names as coded by activity_labels.txt
added activity and subject information to the test and training datasets
merged the two datasets into one
extracted only the columns dealing with mean and standard deviation (using grep to find "mean" or "std" in the variable names)
renamed the columns to reflect the information they contain, using the key provided in features.txt
melted the data to get observations for each activity/subject combination
recast the data and calculated the average for each combination
wrote the resulting data set to a text file

The result is a single, tidy dataset, containing the averages for measurements dealing with population means and standard deviations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FilesExpand file tree

CodeBook.md

Latest commit

History

CodeBook.md

File metadata and controls