A comprehensive collection of Python practice projects focused on web crawling, data automation, AI integration, data analysis, and visualization. This repository includes modular examples demonstrating real-world applications such as scraping popular websites, processing stock and financial data, integrating OpenAI, and analyzing datasets using pandas and matplotlib.
Python 3.x
Web Crawling: Selenium, BeautifulSoup, requests
Automation & Scheduling: time, threading, os
Data Handling: json, dotenv, pandas, openpyxl
Visualization: matplotlib
AI & LLMs: openai, LangChain
Excel & Image Processing: Pillow (PIL), xlsxwriter
๐ฆ Amazon Website Crawler Extract product and context data from Amazon using requests and HTML parsing.
๐ธ Instagram User Info Crawler Automate login and extract user content with environment variables for secure auth.
๐ข Naver Auto Login & Blog Crawler Automate login, handle CAPTCHA, and scrape blog content using scrolling logic.
๐ช CoinOne Cryptocurrency Price Crawler Real-time price scraping and storage in JSON.
๐ Korea Stock Price Crawler Scrape and parse South Korean stock market data with multi-threaded logic.
๐งต Multithreading Practice Apply threading to speed up crawling and reduce blocking I/O time.
๐ง OpenAI Vision & Text Integration Use gpt-4, dall-e, and vision APIs to analyze images and generate text.
๐ Data Analysis with Pandas Clean, group, and analyze data with pandas and visualize with matplotlib.
๐งฎ Regression Analysis From basic linear regression to advanced model comparison using datasets.
๐งฑ Object-Oriented Programming Class structure and object management examples (CreateObject.py).
๐ File I/O and Management Read/write handling for .txt, .json, .xlsx and directory ops.
๐ผ Image Resizing Automation Process and resize images using the PIL library.
โฑ Time-Based Execution & Control Scripts using time.sleep(), timing execution, or simulating delays.
Python-Study-Repository/
โ
โโโ Ai/ # OpenAI API and translation-related scripts
โ โโโ .env
โ โโโ english.xlsx # Input Excel file for translation
โ โโโ OpenAi_img.py # OpenAI Vision API (image-to-text)
โ โโโ OpenAi_text.py # OpenAI Text API usage
โ โโโ output.xlsx # Output Excel file with translated results
โ โโโ test.jpg # Test image for Vision API
โ โโโ Translate_exel.py # Excel translation handler
โ โโโ Translate_http.py # HTTP-based translation script
โ โโโ Translate.py # General-purpose translation logic
โ
โโโ Langchain/ # LangChain-based LLM experiment scripts
โ โโโ Langchain_1.py # Sample test using LangChain + ChatModel
โ
โโโ Pandas/ # Data analysis using pandas
โ โโโ credit.csv # Sample dataset
โ โโโ Pandas_1.py # Basic DataFrame operations
โ โโโ Pandas_2.py # Grouping and aggregation
โ โโโ Pandas_3.py # Cleaning and filtering
โ โโโ PandasAnalyze.py # Custom analysis logic
โ โโโ product.xlsx # Excel-based product dataset
โ
โโโ Visualization/ # Data visualization and regression analysis
โ โโโ california_housing.csv # Dataset for regression examples
โ โโโ income.txt # Example dataset (income)
โ โโโ matplot_Graph.py # Graph drawing using matplotlib
โ โโโ regression_analysis_1.py # Basic linear regression analysis
โ โโโ regression_analysis_2.py # Multiple regression
โ โโโ regression_analysis_3.py # Regression using scikit-learn
โ โโโ regression_analysis_4.py # Comparison of regression metrics (Rยฒ, MSE)
โ โโโ regression_analysis_5.py # Comparing different regression models
โ โโโ StockData.py # Regression visualization on stock data
โ
โโโ WebCrawler/ # Web scraping scripts by target site
โ โโโ Amazon/
โ โ โโโ WebContext.txt # Crawler context or notes
โ โ โโโ WebsiteCrawle.py # Amazon crawler
โ โ
โ โโโ Instagram/
โ โ โโโ .env
โ โ โโโ .gitignore
โ โ โโโ InstagramCrawle.py # Instagram user data crawler
โ โ
โ โโโ Naver/
โ โ โโโ .env
โ โ โโโ .gitignore
โ โ โโโ BlogScrollCrawler.py # Naver blog scroll crawler
โ โ โโโ Logincaptcha.py # Login + CAPTCHA automation
โ โ
โ โโโ Stock/
โ โโโ CoinCrawler.py # Coin market crawler
โ โโโ CrawlerResult.txt # Example output of crawling
โ โโโ MultiThread.py # Web crawling using multithreading
โ โโโ project.py # Stock crawling project entry
โ โโโ StockCrawler.py # Stock data crawler
โ โโโ test.json # Sample output in JSON format
โ โโโ WebCrawler.py # General-purpose web crawler
โ
โโโ Others/ # Miscellaneous files
โ
โโโ testFile/ # Temporary test scripts or data
โโโ testFile2/ # Another test directory
โ
โโโ CreateObject.py # Object-oriented programming example
โโโ FileControll.py # File read/write control
โโโ ImageResizing.py # Image resize operations
โโโ TimeProcess.py # Script demonstrating time-related operations
โ
โโโ .gitignore
โโโ README.md
This project is open for educational and personal use. No specific license is applied.