Skip to content

Commit 66b9fa8

Browse files
authored
updated
1 parent 67d78a4 commit 66b9fa8

File tree

1 file changed

+9
-9
lines changed

1 file changed

+9
-9
lines changed

reinforcement_learning.html

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -172,15 +172,15 @@ <h2>Introduction</h2>
172172
<div class="section" id="core-idea">
173173
<h2>Core Components of Reinforcement Learning:</h2>
174174
<ul>
175-
<li><strong>Agent:</strong> The learner or decision-maker.</li>
176-
<li><strong>Environment:</strong> Everything the agent interacts with.</li>
177-
<li><strong>State (S):</strong> A representation of the current situation.</li>
178-
<li><strong>Action (A):</strong> All possible moves the agent can take.</li>
179-
<li><strong>Reward (R):</strong> A scalar feedback signal; guides the agent.</li>
180-
<li><strong>Policy (π):</strong> Strategy used by the agent to decide actions.</li>
181-
<li><strong>Value Function (V):</strong> Predicts future rewards.</li>
182-
<li><strong>Q-Function (Q):</strong> Predicts future rewards for action-state pairs.</li>
183-
<li><strong>Model (optional):</strong> Predicts the next state and reward.</li>
175+
<li><strong>Agent: </strong> The learner or decision-maker.</li>
176+
<li><strong>Environment: </strong> Everything the agent interacts with.</li>
177+
<li><strong>State (S): </strong> A representation of the current situation.</li>
178+
<li><strong>Action (A): </strong> All possible moves the agent can take.</li>
179+
<li><strong>Reward (R): </strong> A scalar feedback signal; guides the agent.</li>
180+
<li><strong>Policy (π): </strong> Strategy used by the agent to decide actions.</li>
181+
<li><strong>Value Function (V): </strong> Predicts future rewards.</li>
182+
<li><strong>Q-Function (Q): </strong> Predicts future rewards for action-state pairs.</li>
183+
<li><strong>Model (optional): </strong> Predicts the next state and reward.</li>
184184
</ul>
185185
<h3>Types of Reinforcement Learning</h3>
186186
<ol>

0 commit comments

Comments
 (0)