Skip to content

i-sultan/Pricing-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Pricing-Prediction

This is a Udacity machine learning nanodegree project deliverable, please use in accordance to Udacity honor code.

Project Goals

The goal of this project is to apply basic machine learning concepts on data collected for housing prices in the Boston, Massachusetts area to predict the selling price of a new home by performing the following:

  1. Explore the data to obtain important features and descriptive statistics about the dataset.
  2. Split the data into testing and training subsets.
  3. Determine a suitable performance metric for this problem.
  4. Analyze performance graphs for a learning algorithm with varying parameters and training set sizes.
  5. Pick the optimal model that best generalizes for unseen data.
  6. Finally, test this optimal model on a new sample and compare the predicted selling price to the statistics.

Software and Libraries

The following SW was used in the first part of the project:

  • Python 2.7
  • NumPy
  • scikit-learn

In the last part of this project, R was used as an EDA tool:

  • R 3.2.3
  • corrplot
  • GGally

Data Set Source

Data set can be found at: https://archive.ics.uci.edu/ml/datasets/Housing

Final Report and IPython Notebook

Final report and IPython notebook are included in this repository. IPython notebook is straightforward to use, please refer to http://cs231n.github.io/ipython-tutorial/ for a quick tutorial.

About

Udacity project deliverable: Predicting the value of a given house in the Boston real estate market using various statistical analysis tools.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors