Enhance serving evaluation endpoints #595

darrylong · 2024-02-23T09:41:53Z

Description

This PR involves 3 changes to the serving functions:

Add metric_user_results to evaluation results as user_result
Modified /evaluate endpoint to accept evaluation data in the form of a json if data is included. Else, it will follow previous behavior of using feedback data added through the /feedback endpoint.
query is removed from reponse

Sample request for /evaluate:

{
    "metrics": ["RMSE()", "NDCG(k=10)"],
    "data": [
        ["123", "1539", 1],
        ["123", "2", 1],
        ["124", "1", 1]
    ]
}

Response:

{
    "result": {
        "NDCG@10": 0.3175294778309396,
        "RMSE": 2.781925109617526
    },
    "user_result": {
        "NDCG@10": {
            "123": 0.20438239758848611,
            "124": 0.43067655807339306
        },
        "RMSE": {
            "123": 2.244862849697699,
            "124": 3.3189873695373535
        }
    }
}

Related Issues

Checklist:

I have added tests.
I have updated the documentation accordingly.
I have updated README.md (if you are adding a new model).
I have updated examples/README.md (if you are adding a new example).
I have updated datasets/README.md (if you are adding a new dataset).

…ndpoint

qtuantruong · 2024-02-23T18:24:59Z

I think we can remove "query" from the response to minimize the bandwidth because "data" could be quite significant.

qtuantruong · 2024-02-23T18:30:00Z

Also, the returned user_result contains mapped user indices. Shall we try to map them back to the original user IDs, using the mapping in train_set, for it to be consistent with data in the request?

darrylong · 2024-02-27T09:14:28Z

Also, the returned user_result contains mapped user indices. Shall we try to map them back to the original user IDs, using the mapping in train_set, for it to be consistent with data in the request?

Now using mapped user indices, new response is updated in main post.

…esponse

Include metric_user_results in evaluation response, added eval json e…

7bcab56

…ndpoint

darrylong added the feature New feature/enhancement request label Feb 23, 2024

darrylong requested a review from qtuantruong February 23, 2024 09:41

darrylong self-assigned this Feb 23, 2024

darrylong added 2 commits February 26, 2024 14:56

Remove query from response

12aa123

Utilize mapped inversed user id map to get original id in response

1422161

darrylong changed the title ~~Enhance Serving Evaluation endpoints~~ Enhance serving evaluation endpoints Feb 27, 2024

darrylong and others added 2 commits February 27, 2024 17:24

Update serving test case to remove 'query' and add 'user_result' in r…

1f5d0ee

…esponse

simplify user ID mapping

6b86d62

qtuantruong approved these changes Feb 27, 2024

View reviewed changes

darrylong added 5 commits February 29, 2024 17:33

Combined evaluation and evaluation_json endpoints

6267719

Updated abort responses to show plaintext instead of html

2d299ef

Added unit test cases

43257dd

Updated error response for empty data

f56314a

Added unit tests for provided data evaluation

bf7b766

darrylong marked this pull request as ready for review February 29, 2024 16:07

tqtg added 2 commits February 29, 2024 13:56

Update app.py

b61adc0

Update test_app.py

bab99d6

darrylong merged commit 37db3c9 into PreferredAI:master Mar 1, 2024

darrylong deleted the enhance-serving-eval branch March 1, 2024 06:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhance serving evaluation endpoints #595

Enhance serving evaluation endpoints #595

Uh oh!

darrylong commented Feb 23, 2024 •

edited

Loading

Uh oh!

qtuantruong commented Feb 23, 2024

Uh oh!

qtuantruong commented Feb 23, 2024 •

edited

Loading

Uh oh!

darrylong commented Feb 27, 2024

Uh oh!

Uh oh!

Enhance serving evaluation endpoints #595

Enhance serving evaluation endpoints #595

Uh oh!

Conversation

darrylong commented Feb 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Checklist:

Uh oh!

qtuantruong commented Feb 23, 2024

Uh oh!

qtuantruong commented Feb 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

darrylong commented Feb 27, 2024

Uh oh!

Uh oh!

darrylong commented Feb 23, 2024 •

edited

Loading

qtuantruong commented Feb 23, 2024 •

edited

Loading