Skip to content

kachiyahiren/CodeAlpha_WebScraping

Repository files navigation

CodeAlpha_WebScraping

Project Overview

This project was completed as part of the CodeAlpha Data Analytics Internship.

The objective of this project is to collect data from a website using web scraping techniques and perform data analysis to generate meaningful insights.

Technologies Used

  • Python
  • Requests
  • BeautifulSoup
  • Pandas
  • Matplotlib
  • VS Code

Dataset Information

Total Records: 1000

Columns:

  • Title
  • Price
  • Rating

Project Workflow

Website Scraping

Dataset Creation

Data Cleaning

Statistical Analysis

Data Visualization

Business Insights

Key Findings

  • Average Price: 35.07
  • Maximum Price: 59.99
  • Minimum Price: 10.00
  • Most Common Rating: One Star

Visualizations

Rating Distribution

Rating Distribution

Price Distribution

Price Distribution

Project Structure

CodeAlpha_WebScraping/

├── data/

│ └── books.csv

├── screenshots/

│ ├── Rating_Distribution.png

│ └── Price_Distribution.png

├── report/

│ └── WebScraping_Report.docx

├── scraper.py

├── analysis.py

└── README.md

Conclusion

This project demonstrates the use of web scraping and data analysis techniques to collect, clean, analyze, and visualize data for decision-making purposes.

CodeAlpha_WebScraping

About

Web Scraping and Data Analysis using Python, BeautifulSoup, Pandas and Matplotlib.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages