Reinforcement Learning Interactive Laboratory (강화 학습 인터랙티브 실험실)

A comprehensive, interactive browser-based playground for experimenting with and visualizing various Reinforcement Learning (RL) algorithms in grid-world environments.

Features

Dynamic Environment Editor: Customize grid worlds by placing walls, start/end states, rewards, and penalties.
Multi-Algorithm Support:
- Q-Learning: Off-policy Temporal Difference learning.
- SARSA: On-policy Temporal Difference learning.
- Monte Carlo: Episodic updates based on full returns.
Real-time Visualization:
- Grid World: See the agent move in real-time.
- Value Heatmaps: Color-coded visualization of Q-values (Green for positive, Red for negative).
- Policy Arrows: Visual indicators of the current best action for each state.
- Path Tracing: Visual trail showing the agent's path.
- Episode Statistics: Real-time line chart of cumulative rewards per episode.
Interactive Controls:
- Play, Pause, Step-by-step execution.
- Playback: Replay the last finished episode to analyze behavior.
- Adjustable simulation speed.
- Resettable agent and environment.
Hyperparameter Tuning: Configurable Learning Rate (Alpha), Discount Factor (Gamma), and Exploration Rate (Epsilon).

Demo

To run the project locally:

Clone or Download the repository.
Serve the files using a local web server. Since this project uses ES modules, you cannot simply open index.html file directly in the browser.

Using Python (if installed):
```
# Run in the project root directory
python3 -m http.server 8000
```
Using Node.js (via http-server or similar):
```
npx http-server .
```
Open your browser and navigate to http://localhost:8000.

How to Use

Edit Environment:
- Use the panel on the left to select a tool (Wall, Start, End, Penalty, Reward, Eraser).
- Click or drag on the grid to modify the environment.
Configure Agent:
- Select an algorithm (Q-Learning, SARSA, Monte Carlo).
- Adjust hyperparameters if desired.
Run Simulation:
- Click Start Learning to begin the training process.
- Use the Speed slider to control execution speed.
- Use Step for frame-by-frame analysis (when paused).
Analyze:
- Observe the heatmap developing.
- Watch the episode statistics chart.
- Click Replay Last Episode to review the most recent run.
- Hover over cells to see detailed info (Coordinates, Q-Values).

Project Structure

index.html: Main entry point and layout.
src/: Source code.
- core/: Core RL logic (Environment, Agent, Algorithms).
- vis/: Visualization engine (Canvas rendering).
- ui/: UI management and interaction logic.
assets/: Static assets (CSS).

Technology Stack

JavaScript (ES6+): Core logic and DOM manipulation.
HTML5 Canvas: High-performance grid rendering.
CSS3: Styling and layout.
No external runtime dependencies (Vanilla JS).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets/css		assets/css
src		src
README.md		README.md
index.html		index.html
package.json		package.json
server.log		server.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Interactive Laboratory (강화 학습 인터랙티브 실험실)

Features

Demo

How to Use

Project Structure

Technology Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Interactive Laboratory (강화 학습 인터랙티브 실험실)

Features

Demo

How to Use

Project Structure

Technology Stack

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages