Skip to content

Latest commit

 

History

History
21 lines (17 loc) · 739 Bytes

File metadata and controls

21 lines (17 loc) · 739 Bytes

Web Scraping Project using Python 🌐

📌 About

This project scrapes the list of largest companies in the United States by revenue from Wikipedia and stores the data in a structured CSV file.

🛠️ Libraries Used

  • requests → Fetching the webpage
  • BeautifulSoup (bs4) → Parsing & extracting HTML data
  • pandas → Storing data in DataFrame

▶️ How It Works

  1. Sends a request to Wikipedia page
  2. Extracts the companies table using BeautifulSoup
  3. Stores data into a pandas DataFrame
  4. Exports the DataFrame to a CSV file

📂 Source

Wikipedia - List of largest companies in the United States by revenue