Retrieve all user datasets #65

alanzhang25 · 2025-02-12T21:17:53Z

e2e test:

https://www.notion.so/18fe6c8347d680f9bb71c6361febab5f?v=18fe6c8347d6816e8a01000c1dbebebf&p=18fe6c8347d6804181a6e72d200869e6&pm=s

JCamyre

Looks solid! A couple tweaks.

src/e2etests/judgment_client_test.py

src/judgeval/data/datasets/dataset.py

JCamyre · 2025-02-12T22:45:14Z

src/judgeval/data/datasets/dataset.py

@@ -152,6 +152,57 @@ def pull(self, alias: str):
                    description=f"{progress.tasks[task_id].description} [rgb(25,227,160)]Done!)",
                )

+    def pull_all(judgment_api_key: str) -> dict:


Make this a method of the EvalDataset class by adding the self parameter and removing the judgment_api_key parameter, this is already an EvalDataset attribute

Oops I forgot to include part of my change, I actually had a @staticmethod on this because it didn't make a lot of sense to me for pull_all to be a instance method of EvalDataset because it is returning multiple 'EvalDataset'. Lmk ur thoughts on this though

Quick clarification -- if we are retrieving multiple datasets, shouldn't this be a method under the JudgmentClient object?

see new changes

JCamyre · 2025-02-12T22:48:25Z

src/judgeval/judgment_client.py

+        Returns:
+            EvalDataset: The retrieved dataset
+        """
+        return EvalDataset.pull_all(self.judgment_api_key)


Do something similar here:

dataset = EvalDataset(judgment_api_key=self.judgment_api_key) dataset.pull_all()

Isn't the point that we're actually getting all user EvalDatasets? Why is this function returning a single EvalDataset?

see new changes

JCamyre

Naming/commenting conventions, as well as an e2etest change.

src/e2etests/judgment_client_test.py

src/judgeval/judgment_client.py

src/judgeval/data/eval_dataset_client.py

src/e2etests/judgment_client_test.py

JCamyre · 2025-02-13T21:52:17Z

src/e2etests/judgment_client_test.py

+    def test_pull_all_datasets(self, client: JudgmentClient):
+        dataset: EvalDataset = client.create_dataset()
+        # dataset.add_example(Example(input="input 1", actual_output="output 1"))
+        # client.push_dataset(alias="test_dataset_6", dataset=dataset, overwrite=False)


You need to create a unique dataset (using a random name for the alias), and then add a few examples and ground truths to it, then check the count (the count should match based on how many examples and ground truths you added).
The issue with the current test is that it relies on everyone who is running the e2etest to have the test_dataset_6 and test_dataset_7 in their account.

src/judgeval/data/eval_dataset_client.py

src/judgeval/judgment_client.py

SecroLoL · 2025-02-13T22:54:10Z

src/judgeval/data/eval_dataset_client.py

I think that this file probably belongs in src/judgeval/data/datasets/ right?

We can just reroute the imports

SecroLoL

LGTM

JCamyre

LGTM! Thanks for making the changes.

JCamyre

UT's failed, please update these three tests to reflect how we shifted the responsibility of pushing/pulling to the EvalDatasetClient.

>       dataset.pull("test_alias")
E       AttributeError: 'EvalDataset' object has no attribute 'pull'

alanzhang25

Run UT

alanzhang25 requested a review from JCamyre February 12, 2025 21:22

JCamyre requested changes Feb 12, 2025

View reviewed changes

alanzhang25 added 5 commits February 13, 2025 11:47

add pull all test and code

9c55262

working vers

15d9b8d

add client test

b60a78a

Adding EvalDatasetClient to handle requests to backend

78e409c

update docs

160df00

alanzhang25 force-pushed the az-all-user-db-endpoint branch from 031caa9 to 160df00 Compare February 13, 2025 20:31

alanzhang25 added 3 commits February 13, 2025 12:32

add back commented out tests:

61776c2

remove from init

b178d2f

add back tests

2ca6b47

JCamyre requested changes Feb 13, 2025

View reviewed changes

alanzhang25 added 2 commits February 13, 2025 14:16

fix naming and test

d0564d4

add back test

8dfdbed

SecroLoL reviewed Feb 13, 2025

View reviewed changes

SecroLoL approved these changes Feb 13, 2025

View reviewed changes

alanzhang25 added 2 commits February 13, 2025 15:30

fix tests

486ae12

update location of evalDataSet Client

cf5335e

JCamyre approved these changes Feb 14, 2025

View reviewed changes

JCamyre requested changes Feb 14, 2025

View reviewed changes

fix unit tests

168180c

alanzhang25 force-pushed the az-all-user-db-endpoint branch from 1d3eb9b to 168180c Compare February 14, 2025 01:58

alanzhang25 commented Feb 14, 2025

View reviewed changes

alanzhang25 merged commit d7cc3a9 into main Feb 14, 2025
3 checks passed

alanzhang25 deleted the az-all-user-db-endpoint branch March 25, 2025 19:28

Retrieve all user datasets #65

Retrieve all user datasets #65

Uh oh!

Conversation

alanzhang25 commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JCamyre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JCamyre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SecroLoL left a comment

Choose a reason for hiding this comment

Uh oh!

JCamyre left a comment

Choose a reason for hiding this comment

Uh oh!

JCamyre left a comment

Choose a reason for hiding this comment

Uh oh!

alanzhang25 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

alanzhang25 commented Feb 12, 2025 •

edited

Loading