Optimal Stategies to score runs in cricket using Multi-Armed Bandits

This repository contains several implementations of multi-armed bandit (MAB) agents applied to a simulated cricket batting scenario. The simulation models a cricket innings where an agent (the batsman) selects among different shot strategies (arms) with the goal of maximizing runs while minimizing the risk of getting out.

Overview

In this project, you will find four distinct agent types implemented as part of our exploration of MAB strategies:

KL-UCB Survival Agent
- Description: Uses a KL-divergence based Upper Confidence Bound (UCB) method. The reward is based on survival (i.e., 1 - wicket), focusing on minimizing dismissals.
Reward-UCB Agents
- Variant 1: Reward-UCB (KL) Agent
  - Description: Computes rewards using an efficiency metric (1 - p(out)) * avg_runs and applies a KL-UCB approach.
- Variant 2: Reward-UCB (Simple) Agent
  - Description: A simpler variant that computes the reward as runs / 6.
- Variant 3: UCB1 Agent
  - Description: Implements the classic UCB1 algorithm based on the average reward plus an exploration bonus.
Risk-Adjusted Successive Elimination Agent
- Description: Uses a more sophisticated approach by computing a risk-adjusted reward (ratio of expected reward to risk) and eliminates arms that perform poorly. This agent progressively removes suboptimal strategies using confidence bounds.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
old_code		old_code
KL_UCB_agent.py		KL_UCB_agent.py
README.md		README.md
UCB_agent.py		UCB_agent.py
cricket_environment.py		cricket_environment.py
reward_KL_UCB_agent.py		reward_KL_UCB_agent.py
reward_UCB_agent.py		reward_UCB_agent.py
risk_adjusted_successive_elimiation_agent.py		risk_adjusted_successive_elimiation_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Optimal Stategies to score runs in cricket using Multi-Armed Bandits

Overview

About

Uh oh!

Releases

Packages

Languages

vismaychuriwala/Optimal-Strategies-in-Multi-Armed-Bandits

Folders and files

Latest commit

History

Repository files navigation

Optimal Stategies to score runs in cricket using Multi-Armed Bandits

Overview

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages