Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 1.11 KB

File metadata and controls

18 lines (13 loc) · 1.11 KB

The column names are the measures dealing with mean and standard deviation, as explained in study_features_info.txt. The names, taken from study_features.txt, were modified to eliminate unnecessary characters like parentheses.

The activities have been named according to study_activity_labels.txt.

More detailed information is available in STUDY_README.txt.

My transformations are as follows:

  • changed activity numbers to activity names as coded by activity_labels.txt
  • added activity and subject information to the test and training datasets
  • merged the two datasets into one
  • extracted only the columns dealing with mean and standard deviation (using grep to find "mean" or "std" in the variable names)
  • renamed the columns to reflect the information they contain, using the key provided in features.txt
  • melted the data to get observations for each activity/subject combination
  • recast the data and calculated the average for each combination
  • wrote the resulting data set to a text file

The result is a single, tidy dataset, containing the averages for measurements dealing with population means and standard deviations.