Data cleaning of a dataset about apartments in Moscow

The data is a database of 30471 records of real properties in Moscow and Moscow region with specified attributes which probably influence the price. For each record 61 attributes are provided: the area of the property, living area, district, price, ecology, information about kindergartens, schools, hospitals, distance to the center, distance to the subway and train stations, distance to stores, museums, etc.

⬆️To contents

Stages of the project work

basic analysis of the data structure

Analyzing the data structure and determining the necessary transformations.

detection of missing data

Missing value detection and analysis. Detection of entries/features that need to be removed and those that need to be processed

processing of missing data

Analysis of missing data and filling numerical features with median values and categorical features with modes

outliers detection and cleaning

Finding and identifying outliers using logic, rules of three Sigma, and Tuke's method

Search and eliminate duplicates

Search and delete duplicates

⬆️To contents

Results:

As a result of the work, we received a dataset with records cleared from duplicates, missing data and outliers

⬆️To contents

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
outliers_lib		outliers_lib
.gitignore		.gitignore
README.md		README.md
data_cleaning_sber_apartments.ipynb		data_cleaning_sber_apartments.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data cleaning of a dataset about apartments in Moscow

Contents

Project description

Solving case

Data summary

Stages of the project work

Results:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Data cleaning of a dataset about apartments in Moscow

Contents

Project description

Solving case

Data summary

Stages of the project work

Results:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages