This project scrapes the list of largest companies in the United States by revenue from Wikipedia and stores the data in a structured CSV file.
- requests → Fetching the webpage
- BeautifulSoup (bs4) → Parsing & extracting HTML data
- pandas → Storing data in DataFrame
- Sends a request to Wikipedia page
- Extracts the companies table using BeautifulSoup
- Stores data into a pandas DataFrame
- Exports the DataFrame to a CSV file
Wikipedia - List of largest companies in the United States by revenue