Fix discount factor to avoid ambiguity with transition probabilities by ethanelasky · Pull Request #9 · BerkeleyAI/textbook

ethanelasky · 2026-01-13T03:18:14Z

Summary

Change discount factor from γ = 0.5 to γ = 0.8 in value iteration and policy iteration examples
Update all calculations and resulting values to reflect the new discount factor

Motivation

The racecar MDP example uses transition probabilities of 0.5 in several places. When the discount factor is also 0.5, it creates ambiguity - students cannot easily pattern-match 0.5 and determine which represents T(s,a,s') (transition probability) and which represents γ (discount factor) in the Bellman equations.

For example, with γ = 0.5:

0.5 * [2 + 0.5 * V(s')]  # Which 0.5 is which?

With γ = 0.8:

0.5 * [2 + 0.8 * V(s')]  # Clear: 0.5 is T, 0.8 is γ

This change improves clarity for students learning MDP algorithms.

Changes

mdp/value-iteration.md: Updated γ from 0.5 to 0.8, recalculated all values
mdp/policies-iteration.md: Updated γ from 0.5 to 0.8, recalculated all values

Change discount factor from γ = 0.5 to γ = 0.8 in both value iteration and policy iteration examples. This helps students pattern-match formula to worked example better and helps them avoid mixing up the 0.5 in the discount factor with the 0.5 transition probabilities used by the racecar example. Updated all example calculations and values accordingly to maintain correctness with the new discount factor.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix discount factor to avoid ambiguity with transition probabilities#9

Fix discount factor to avoid ambiguity with transition probabilities#9
ethanelasky wants to merge 1 commit intoBerkeleyAI:mainfrom
ethanelasky:fix-mdp-discount-factor

ethanelasky commented Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

ethanelasky commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ethanelasky commented Jan 13, 2026 •

edited

Loading