ENH Added evaluation #69

merveenoyan · 2022-08-02T14:38:17Z

This PR is actually a fragment of #26 as #26 is a bit overwhelming. I changed the API and put evaluation a part of the Card class not to bother user with extra attributes.
@BenjaminBossan @adrinjalali

BenjaminBossan · 2022-08-02T14:57:26Z

@merveenoyan Did you ping us because we should already take a look or just to let us know?

merveenoyan · 2022-08-02T15:03:55Z

@BenjaminBossan I pinged you to mainly look at UX of the evaluation and let me know if the direction is right.

BenjaminBossan · 2022-08-02T15:57:36Z

Hmm, not so much I can say about this yet. It is very specific to HF, right? E.g. IIUC we assume that the metric used by sklearn is the same as defined on HF/metrics.

merveenoyan · 2022-08-02T17:00:30Z

@BenjaminBossan having metrics in metadata is essentially for HF Hub, it also passes them to paperswithcode to associated task on associated dataset. (let's say you have text-classification, you'd like to evaluate it on GLUE) this is not the case usually for tabular models imo as people split their data on their own. I don't expect heavy usage for this for when you put your model for outside world, if someone else wants to benchmark their own models and programmatically access the best metrics, it would be good IMHO.

adrinjalali · 2022-08-02T17:52:10Z

Overall I'd say it's in the right direction. I would change it a bit so that the method accepts the results of a scorer instead of trying to get the name and call it itself. It can be simplified by accepting a float value and a metric name. We probably also want to have it more like:

data description: ...
   metric 1: ...
   metric 2: ...

instead of a list of (data description, metric specs) as is right now.

But it seems like the eval results can't support that?

merveenoyan · 2022-08-03T15:19:17Z

@adrinjalali with #71 I think we can let the user add EvalResults instead, no?

adrinjalali · 2022-08-03T16:16:37Z

Yeah that would also make sense. I'm impartial on whether to construct EvalResult inside a method or accept one from the user. Note that #71 doesn't really make user ever create a CardData object themselves, the utility function does that.

BenjaminBossan · 2022-08-03T16:27:51Z

I would tend towards not exposing modelcards internals to the user.

merveenoyan · 2022-08-08T16:46:32Z

BTW evaluation metrics can be seen in this repository:
https://huggingface.co/merve/hf_hub_example-d4b188ba-be75-44ed-9c2b-d2cd0640c233

skops/card/_model_card.py

adrinjalali · 2022-08-09T08:23:44Z

skops/card/_model_card.py

+        self : object
+        Card object.
+        """
+        self._eval_results = tabulate(


should we add to a table or set the table? The current code doesn't allow the user to call this method multiple times.

skops/card/default_template.md

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

adrinjalali

Thanks @merveenoyan

skops/card/_model_card.py

skops/card/default_template.md

skops/card/_model_card.py

BenjaminBossan

This is almost good to go from my point of view. I have a suggestion for better formatting of one of the examples, which you may or may not agree with. But I think the test needs to be changed, as I think it doesn't work as intended.

skops/card/tests/test_card.py

skops/card/_model_card.py

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

skops/card/_model_card.py

examples/plot_model_card.py

docs/model_card.rst

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

merveenoyan · 2022-08-10T12:16:57Z

@adrinjalali I addressed your comments.

adrinjalali

Otherwise LGTM.

docs/model_card.rst

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

merveenoyan · 2022-08-10T14:39:57Z

@BenjaminBossan can you review?

BenjaminBossan

Perfect, thanks

merveenoyan added 2 commits August 2, 2022 16:29

evaluate

a26e226

mypy ignore lines

96845b3

merveenoyan added 12 commits August 8, 2022 15:35

changed eval

b040826

Merge branch 'main' into eval_branch

1e52e5e

make style

a75a64b

make style

1179091

doc fix

8c1955b

doc fix

608fcc0

doc fix

41a8563

doc fix

b92a76a

doc fix

850a4ff

ellipsis fix

aa283f5

fixed doctest for good

fd5ff69

fixed doctest for good

5f58065

merveenoyan marked this pull request as ready for review August 8, 2022 14:43

adrinjalali reviewed Aug 9, 2022

View reviewed changes

merveenoyan and others added 4 commits August 9, 2022 11:44

Update skops/card/_model_card.py

5908828

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Update skops/card/_model_card.py

3fef33c

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Update skops/card/_model_card.py

c010459

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

addressed comments

f6d1738

merveenoyan requested a review from adrinjalali August 9, 2022 10:09

merveenoyan added 2 commits August 9, 2022 12:21

fixed my test

438e89a

updated test name and docs

0e7a058

adrinjalali reviewed Aug 9, 2022

View reviewed changes

skops/card/_model_card.py Show resolved Hide resolved

skops/card/default_template.md Show resolved Hide resolved

skops/card/_model_card.py Outdated Show resolved Hide resolved

merveenoyan added 4 commits August 9, 2022 15:54

updated test name and docs

2e3a635

updated docs

2301da7

Merge branch 'skops-dev:main' into eval_branch

e557114

added docs

54927a2

merveenoyan requested review from BenjaminBossan and adrinjalali August 9, 2022 14:16

BenjaminBossan requested changes Aug 9, 2022

View reviewed changes

skops/card/tests/test_card.py Outdated Show resolved Hide resolved

skops/card/_model_card.py Outdated Show resolved Hide resolved

Update skops/card/_model_card.py

aeb6728

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

merveenoyan mentioned this pull request Aug 9, 2022

ENH Model Examination #26

Closed

adrinjalali reviewed Aug 10, 2022

View reviewed changes

skops/card/_model_card.py Outdated Show resolved Hide resolved

skops/card/_model_card.py Outdated Show resolved Hide resolved

examples/plot_model_card.py Show resolved Hide resolved

docs/model_card.rst Outdated Show resolved Hide resolved

merveenoyan and others added 3 commits August 10, 2022 14:03

Update skops/card/_model_card.py

6ccea15

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Update skops/card/_model_card.py

d2a4835

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

fixed test and addressed comments

f951ec2

adrinjalali approved these changes Aug 10, 2022

View reviewed changes

docs/model_card.rst Outdated Show resolved Hide resolved

Update docs/model_card.rst

d384972

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

merveenoyan requested a review from BenjaminBossan August 10, 2022 13:34

adrinjalali approved these changes Aug 10, 2022

View reviewed changes

BenjaminBossan approved these changes Aug 10, 2022

View reviewed changes

BenjaminBossan merged commit 2e1b3a6 into skops-dev:main Aug 10, 2022

ENH Added evaluation #69

ENH Added evaluation #69

Uh oh!

Conversation

merveenoyan commented Aug 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan commented Aug 2, 2022

Uh oh!

merveenoyan commented Aug 2, 2022

Uh oh!

BenjaminBossan commented Aug 2, 2022

Uh oh!

merveenoyan commented Aug 2, 2022

Uh oh!

adrinjalali commented Aug 2, 2022

Uh oh!

merveenoyan commented Aug 3, 2022

Uh oh!

adrinjalali commented Aug 3, 2022

Uh oh!

BenjaminBossan commented Aug 3, 2022

Uh oh!

merveenoyan commented Aug 8, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adrinjalali Aug 9, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

merveenoyan commented Aug 10, 2022

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

merveenoyan commented Aug 10, 2022

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

merveenoyan commented Aug 2, 2022 •

edited

Loading