Skip to content

Actions: JudgmentLabs/judgeval

CI

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
768 workflow run results
768 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

CI
CI #43: submitted by JCamyre
January 16, 2025 01:19 1m 38s
January 16, 2025 01:19 1m 38s
Remove telemetry
CI #42: Pull request #39 opened by SecroLoL
January 15, 2025 23:40 1m 40s remove_telemetry
January 15, 2025 23:40 1m 40s
Add Custom Judge Models for Custom Scorers
CI #41: Pull request #38 opened by SecroLoL
January 15, 2025 01:44 1m 31s add_custom_judge
January 15, 2025 01:44 1m 31s
CI
CI #40: submitted by SecroLoL
January 13, 2025 01:45 1m 49s
January 13, 2025 01:45 1m 49s
CI
CI #39: submitted by JCamyre
January 11, 2025 22:39 1m 38s
January 11, 2025 22:39 1m 38s
Add json correctness scorer
CI #38: Pull request #37 opened by SecroLoL
January 11, 2025 09:48 1m 28s add_json_correctness_scorer
January 11, 2025 09:48 1m 28s
Refactor default judges
CI #37: Pull request #36 opened by SecroLoL
January 11, 2025 06:58 1m 40s refactor_default_judges
January 11, 2025 06:58 1m 40s
CI
CI #36: submitted by SecroLoL
January 10, 2025 22:38 1m 38s
January 10, 2025 22:38 1m 38s
CI
CI #35: submitted by SecroLoL
January 10, 2025 22:37 1m 32s
January 10, 2025 22:37 1m 32s
CI
CI #34: submitted by SecroLoL
January 10, 2025 22:36 1m 38s
January 10, 2025 22:36 1m 38s
CI
CI #33: submitted by SecroLoL
January 10, 2025 22:36 1m 51s
January 10, 2025 22:36 1m 51s
Add developer docs
CI #32: Pull request #35 opened by SecroLoL
January 9, 2025 06:01 5h 49m 1s add_dev_docs
January 9, 2025 06:01 5h 49m 1s
Span-level evals additional features
CI #31: Pull request #34 opened by JCamyre
January 8, 2025 22:29 1m 35s joseph/span-level-evals
January 8, 2025 22:29 1m 35s
Make evaluation run names unique
CI #30: Pull request #33 opened by JCamyre
January 8, 2025 22:09 1m 41s joseph/eval-run-name-uniqueness
January 8, 2025 22:09 1m 41s
CI
CI #29: submitted by JCamyre
January 7, 2025 19:21 1m 31s
January 7, 2025 19:21 1m 31s
CI
CI #28: submitted by JCamyre
January 7, 2025 19:20 1m 38s
January 7, 2025 19:20 1m 38s
CI
CI #27: submitted by SecroLoL
January 7, 2025 00:07 1m 36s
January 7, 2025 00:07 1m 36s
CI
CI #26: submitted by SecroLoL
January 6, 2025 22:18 1m 39s
January 6, 2025 22:18 1m 39s
Add Span Level Evals (multi-step evaluation)
CI #25: Pull request #32 opened by JCamyre
January 6, 2025 19:15 1m 41s joseph/span-level-evals
January 6, 2025 19:15 1m 41s
CI
CI #24: submitted by JCamyre
January 2, 2025 03:15 1m 39s
January 2, 2025 03:15 1m 39s
CI
CI #23: submitted by JCamyre
December 31, 2024 01:57 1m 32s
December 31, 2024 01:57 1m 32s
CI
CI #22: submitted by JCamyre
December 31, 2024 01:19 1m 37s
December 31, 2024 01:19 1m 37s
Add trace ID to datasets, update UTs accordingly
CI #21: Pull request #31 opened by SecroLoL
December 31, 2024 00:55 1m 42s add_trace_to_datasets
December 31, 2024 00:55 1m 42s
Add UT for loading datasets from files
CI #20: Pull request #30 opened by SecroLoL
December 30, 2024 23:51 1m 44s test_dataset_creation
December 30, 2024 23:51 1m 44s
CI
CI #19: submitted by JCamyre
December 27, 2024 04:45 1m 48s
December 27, 2024 04:45 1m 48s
ProTip! You can narrow down the results and go further in time using created:<2024-12-27 or the other filters available.