Skip to content

Experiment: Sampling for Class Imbalance

Shakleen Ishfar edited this page May 5, 2024 · 1 revision

tldr; Not worth it

Experiment

  1. Pink: Over-sample score 1 and 6, under-sample the rest.
  2. Yellow: Over-sample only.

Outcome

Performance for scores 1 and 6 doesn't meaningfully improve, as evidenced by the metrics shown below. In fact it gets worse when doing any sampling. Moreover, performance of other scores also degrades. Training takes longer due to more samples.

QWK Scores

W B Chart 5_5_2024, 9_31_41 AM

Confusion Matrix

W B Chart 5_5_2024, 9_34_33 AM

F1 Score

W B Chart 5_5_2024, 9_35_55 AM

Recall

W B Chart 5_5_2024, 9_36_14 AM

Precision

W B Chart 5_5_2024, 9_36_06 AM

Clone this wiki locally