Prompting warnings in widget response when the inference doesn't work #96

merveenoyan · 2022-08-31T16:48:51Z

Hello,

Any warning gets appended in scikit-learn pipelines, even if the inference is successful. I suggest we should check if the prediction and the response looks good and if not, we should return warnings. Otherwise it gets prepended on top of the response and will break the widget. (what I observed was version mismatch, which doesn't happen in the production, I know, but I don't think version mismatch should concern the person if the predictions are returned well or any warning on warning level and not error level)
(This is something I observed for text classification pipeline because I repurposed code from tabular pipelines, let me know if this isn't the case.) Also feel free to ignore this issue if this doesn't make any sense. I think the below code should be refactored.

for warning in record:
            _warnings.append(f"{warning.category.__name__}({warning.message})")

        for warning in self._load_warnings:
            _warnings.append(f"{warning.category.__name__}({warning.message})")

        if _warnings:
            for warning in _warnings:
                logger.warning(warning)

            if not exception:
                # we raise an error if there are any warnings, so that routes.py
                # can catch and return a non 200 status code.
                ### THIS IS THE PART I COMPLAIN ON :')
                error = {
                    "error": "There were warnings while running the model.",
                    "output": res,
                }
                raise ValueError(json.dumps(error))
            else:
                # if there was an exception, we raise it so that routes.py can
                # catch and return a non 200 status code.
                raise exception

        return res

WDYT @adrinjalali @BenjaminBossan

BenjaminBossan · 2022-09-01T08:51:08Z

what I observed was version mismatch, which doesn't happen in the production, I know, but I don't think version mismatch should concern the person if the predictions are returned well or any warning on warning level and not error level

I think this was exactly the intent by @adrinjalali because there is no guarantee that predictions are correct if versions don't match.

merveenoyan · 2022-09-01T11:25:28Z

Does predictions change according to versions of sklearn? 😅 Do the implementation changes?
What would be a better solution is still returning the predictions and posting warnings below the widget (It can be put to another place in the response that can be understood by widget and put below the widget or something)

BenjaminBossan · 2022-09-01T11:32:54Z

Does predictions change according to versions of sklearn? sweat_smile Do the implementation changes?

Without knowing all the details, I think for the vast majority of cases, predictions won't change. However, there is a small chance that they do change, possibly leading to completely nonsensical output. Therefore, it's better to be safe and not return the prediction if correctness cannot be guaranteed.

adrinjalali · 2022-09-01T13:17:43Z

I think the solution is already there, The only thing which is not implemented is that the widget is currently not showing the returned warnings, but the api-inference is returning them. So the fix is on the widget side. This is the corresponding issue on the widget side: huggingface/huggingface.js#318

Regarding changing predictions, sometimes the old model wouldn't even load on the new sklearn and other way around, and our tests also include such a case. The HistGradientBoostingClassifier does exactly that between 1.0 and 1.1 versions.

merveenoyan · 2022-09-01T13:33:08Z

>>> Regarding changing predictions, sometimes the old model wouldn't even load on the new sklearn and other way around, and our tests also include such a case. The HistGradientBoostingClassifier does exactly that between 1.0 and 1.1 versions.

@adrinjalali But if the predictions are returned why would we still need it?

adrinjalali · 2022-09-01T13:46:26Z

we return a non-200 return code, with the warnings attached, and the user can decide if they want to use it or not. Some warnings can be ignored.

osanseviero · 2022-09-01T13:50:08Z

cc @mishig25 on this discussion on the widget side

merveenoyan · 2022-09-01T14:16:51Z

@adrinjalali @mishig25 Can we make that response parse-able by widget and print warnings below? Would help a lot of people willing to debug their widgets out (I get a lot of messages so it's a common thing I'd say).

osanseviero · 2022-09-02T13:57:10Z

Have we gotten lots of messages related to warnings? Usually messages with questions are more related to errors, which we do show in the widget most of the time.

adrinjalali · 2022-09-05T09:18:17Z

I'm not sure what you mean by parsable here @merveenoyan . They are json, so they can easily be parsed.

@osanseviero from the sklearn side we raise quite a few warnings, and it's quite useful for users to see them in the widgets.

merveenoyan · 2022-09-05T09:26:11Z

I can't click JSON output here, is it possible we put the warnings somewhere that it's not supposed to be? That's what I mean with parse-able. @adrinjalali

adrinjalali · 2022-09-05T10:20:27Z

Ah I see, if you call the API directly (using curl for example, then you get the full output. The skops.hub_utils.get_model_output also gives you the full output.

merveenoyan · 2022-09-06T12:36:33Z

@adrinjalali yes I know that, I'm talking for the widget itself because of this reason. I feel like (not sure) we're putting the warning in wrong place that it doesn't show over there.

adrinjalali · 2022-09-07T09:35:41Z

No we're not, you might want to re-read this one: #96 (comment) 😁

merveenoyan · 2022-09-08T11:55:30Z

@adrinjalali I can fix it after the text widget PR is done. (which I am a bit stuck with 503 errors)

osanseviero · 2022-09-28T09:01:21Z

cc @mishig25 @beurkinger on internal discussion https://huggingface.slack.com/archives/C0314PXQC3W/p1664296775532499

Currently, most of the tabular model widgets are broken. E.g. for https://huggingface.co/julien-c/wine-quality I see

{"error": "There were warnings while running the model.", "output": [5, 5, 7]}

And upon closer lookup of the Network tab, I see

error: "{\"error\": \"There were warnings while running the model.\", \"output\": [5, 5, 7]}"
warnings: [,…]
0: "UserWarning(Trying to unpickle estimator Pipeline from version 0.24.2 when using version 1.1.2. This might lead to breaking code or invalid results. Use at your own risk. For more info please refer to:\nhttps://scikit-learn.org/stable/model_persistence.html#security-maintainability-limitations)"
1: "UserWarning(Trying to unpickle estimator SVC from version 0.24.2 when using version 1.1.2. This might lead to breaking code or invalid results. Use at your own risk. For more info please refer to:\nhttps://scikit-learn.org/stable/model_persistence.html#security-maintainability-limitations)"
2: "UserWarning(Trying to unpickle estimator StandardScaler from version 0.24.2 when using version 1.1.2. This might lead to breaking code or invalid results. Use at your own risk. For more info please refer to:\nhttps://scikit-learn.org/stable/model_persistence.html#security-maintainability-limitations)"
3: "UserWarning(X has feature names, but StandardScaler was fitted without feature names)"

This is not a great UX as it's confusing for users. Even if we show the text of the warning in the widget, this will not be super useful for users (not model uploaders). Having such a strict requirement and breaking the widget for anyone without the same pinned version will lead to the widget not working for most users in the long run, which is undesirable. I don't think we can expect all people to use the same sklearn version as the one pinned in the API.

Should we consider showing the predictions, even if there is a mismatch, and expose the warnings below the widget?

osanseviero · 2022-09-28T09:04:31Z

Related issue shared by @BenjaminBossan huggingface/huggingface.js#318

BenjaminBossan · 2022-09-28T09:04:42Z

Should we consider showing the predictions, even if there is a mismatch, and expose the warnings below the widget?

Just to note, this part of the error message, "output": [5, 5, 7], corresponds to the model predictions. But it's not very obvious.

BenjaminBossan · 2022-09-28T09:30:12Z

This is not a great UX as it's confusing for users. Even if we show the text of the warning in the widget, this will not be super useful for users (not model uploaders). Having such a strict requirement and breaking the widget for anyone without the same pinned version will lead to the widget not working for most users in the long run, which is undesirable. I don't think we can expect all people to use the same sklearn version as the one pinned in the API.

These are some good points. I wonder if we should treat warnings caused by sklearn version mismatch differently, as they are false positives most of the time. Not sure how best to implement this, as we would still want to display the information that something might be wrong, but these warnings are certainly in a different category from, say, warnings about division by zero.

mishig25 · 2022-09-28T09:32:19Z

Right now, the response from https://huggingface.co/julien-c/wine-quality is:

{error: '{"error": "There were warnings while running the model.", "output": [5, 5, 7]}', status: 'error'}
which gets rendered as error status error is handled that way currently

The question is:
Option 1: should I handle this resposne differently (i.e. treat this response as successfull reponse & show the output & ``warningdespite it havingstatus:error`)
`Option 2`: should the API produce output in different shape ? (for example: `{output: [3,4,5], warning: 'xyz', status: 'success'}`)

Option1 or Option2 ?

BenjaminBossan · 2022-09-28T09:35:58Z

I was wondering if we can change the logic here:

api-inference-community/docker_images/sklearn/app/pipelines/tabular_classification.py

Lines 70 to 80 in 778fe84

    
           if _warnings: 
        
               for warning in _warnings: 
        
                   logger.warning(warning) 
        
               if not exception: 
        
                   # we raise an error if there are any warnings, so that routes.py 
        
                   # can catch and return a non 200 status code. 
        
                   error = { 
        
                       "error": "There were warnings while running the model.", 
        
                       "output": res, 
        
                   }

Right now, if there are any warnings, we just treat it as an error. Perhaps we can make an exception for warnings caused by version mismatch, return a successful response, and add an extra field with the information that there was a version mismatch.

osanseviero · 2022-09-28T09:37:49Z

I think giving a successful response + adding some warning in an extra field, and then having the widget show the successful widget/table, but with a warning below, makes lots of sense to me.

mishig25 · 2022-09-28T09:39:26Z

I think giving a successful response + adding some warning in an extra field, and then having the widget show the successful widget/table, but with a warning below, makes lots of sense to me.

I second to that. Treating response with status: error as success does not seem right

adrinjalali · 2022-09-28T09:39:45Z

This is not a great UX as it's confusing for users. Even if we show the text of the warning in the widget, this will not be super useful for users (not model uploaders). Having such a strict requirement and breaking the widget for anyone without the same pinned version will lead to the widget not working for most users in the long run, which is undesirable. I don't think we can expect all people to use the same sklearn version as the one pinned in the API.

That's not true. All skops examples pin the sklearn version, and therefore the widget would work. The pinned version wouldn't change over time. There are no guarantees for the outputs to be correct when versions change, and this is not a big deal when users have specified the versions in the config.json file, which is easily done by skops tools.

I think giving a successful response + adding some warning in an extra field, and then having the widget show the successful widget/table, but with a warning below, makes lots of sense to me.

This would lead to users getting wrong results and relying on them, which would be very bad. The output is not a "successful" output in this case.

mishig25 · 2022-09-28T09:41:40Z

This would lead to users getting wrong results and relying on them, which would be very bad. The output is not a "successful" output in this case.

In that case, we should still then show the response in the error box (but with the added warnings as well ?)

mishig25 · 2022-09-28T09:44:55Z

Another question I had is: right now, the response from https://huggingface.co/julien-c/wine-quality is:

{error: '{"error": "There were warnings while running the model.", "output": [5, 5, 7]}', status: 'error'}
which gets rendered as error status error is handled that way currently

Why there are no warnings in the response at the moment?

adrinjalali · 2022-09-28T09:45:28Z

I personally don't have a strong opinion on one of the two options:

widget takes the values from output if that key exists in the response json and puts them in the widget as usual, and shows the errors and warnings if there are any, so the user is warned
the widget always fails and shows all the warnings and errors as returned by the server and doesn't take the output key from the response to be put in the table.

IIRC @Narsil was very much in favor of the second option.

adrinjalali · 2022-09-28T09:47:55Z

Why there are no warnings in the response at the moment?

That's something which is worth fixing.

beurkinger · 2022-09-28T09:48:45Z

@mishig25 Personally I would go for option 1. If we don't the most tabular classification widgets users can try on the website will be broken, which is kind of ridiculous. No point in punishing people who just want to see how widget works / what kind of result they can expect.

I don't know how server responses are shaped, but it would be nice to get the type of the error, so we can give a more useful message (or to return a better error message on the server side).

mishig25 · 2022-09-28T09:52:09Z

From my side, I am happy to implement the widget either for Option1 or Option2.

However, there needs to be updates for both options:

if we go with Option1, we need to attach the warnings in the response (as asked here)
if we go with Option2, we need to change the response shape entirely (as suggested here)

I will submit a PR once one of the the options are decided & the necessary api-inference changes are made 👍

beurkinger · 2022-09-28T09:53:28Z

Continuing my previous message: simply dumping the response as JSON when we get an error is not very elegant or useful. I think it would be better to have a concise and to-the-point error/warning message, and give the user the opportunity to see the whole response using the "JSON output" button (which is currently deactivated when getting an error).

BenjaminBossan · 2022-09-28T10:24:18Z

Okay, there seems to be consensus around option 1. I will work on that. To be sure, we expect the response to be something like this:

{error: '{"error": "There were warnings while running the model.", "output": [5, 5, 7], "warnings": ["message 1", "message 2", ...]}', status: 'error'}

?

mishig25 · 2022-09-28T12:06:28Z

@BenjaminBossan yes 👍

merveenoyan · 2022-11-16T16:53:01Z

This is very irritating (see above issue I linked)

Below you will see prettier version of errors + warnings. I say we iterate over each and raise them separately. On top of this, during _get_output or __init__, if there's an error regarding deserialization of JSON we should raise that one too. (in case someone decides to edit JSON and breaks it, like I did to edit NaN values previously)

@Narsil if you could give me an example of where the warnings should be put for them to be prompted, I can easily implement this.
I'm not in favor of dumping whole output JSON full of errors or warnings but if we want that I can do it too (as discussed above)

adrinjalali · 2022-11-17T09:25:09Z

The warnings should now be included in the response, @BenjaminBossan fixed it in #114

We can now show them on the widget side.

adrinjalali · 2022-11-17T18:37:03Z

@merveenoyan is this done? I think the warnings are returned, but not displayed on the widget side, and that still needs to be done?

osanseviero · 2022-11-24T18:17:17Z

friendly ping @merveenoyan

Reopening this issue in the meantime

Narsil · 2022-11-24T20:44:42Z

Let me know i fI can help.

adrinjalali · 2022-11-25T08:48:41Z

If I'm not mistaken, @mishig25 can add them now to the widget.

merveenoyan mentioned this issue Sep 20, 2022

Have collections of requests #105

Open

mishig25 mentioned this issue Sep 28, 2022

Widget should show warning from api-inference huggingface/huggingface.js#318

Open

merveenoyan mentioned this issue Nov 16, 2022

The model name can't be found from config.json for skops models #136

Closed

merveenoyan closed this as completed Nov 17, 2022

osanseviero reopened this Nov 24, 2022

Prompting warnings in widget response when the inference doesn't work #96

Prompting warnings in widget response when the inference doesn't work #96

Comments

merveenoyan commented Aug 31, 2022 • edited Loading

BenjaminBossan commented Sep 1, 2022

merveenoyan commented Sep 1, 2022

BenjaminBossan commented Sep 1, 2022

adrinjalali commented Sep 1, 2022

merveenoyan commented Sep 1, 2022

adrinjalali commented Sep 1, 2022

osanseviero commented Sep 1, 2022

merveenoyan commented Sep 1, 2022

osanseviero commented Sep 2, 2022

adrinjalali commented Sep 5, 2022

merveenoyan commented Sep 5, 2022

adrinjalali commented Sep 5, 2022

merveenoyan commented Sep 6, 2022

adrinjalali commented Sep 7, 2022

merveenoyan commented Sep 8, 2022

osanseviero commented Sep 28, 2022

osanseviero commented Sep 28, 2022

BenjaminBossan commented Sep 28, 2022

BenjaminBossan commented Sep 28, 2022

mishig25 commented Sep 28, 2022

BenjaminBossan commented Sep 28, 2022

osanseviero commented Sep 28, 2022

mishig25 commented Sep 28, 2022

adrinjalali commented Sep 28, 2022

mishig25 commented Sep 28, 2022 • edited Loading

mishig25 commented Sep 28, 2022

adrinjalali commented Sep 28, 2022

adrinjalali commented Sep 28, 2022

beurkinger commented Sep 28, 2022

mishig25 commented Sep 28, 2022 • edited Loading

beurkinger commented Sep 28, 2022

BenjaminBossan commented Sep 28, 2022 • edited Loading

mishig25 commented Sep 28, 2022

merveenoyan commented Nov 16, 2022 • edited Loading

adrinjalali commented Nov 17, 2022

adrinjalali commented Nov 17, 2022

osanseviero commented Nov 24, 2022

Narsil commented Nov 24, 2022

adrinjalali commented Nov 25, 2022

merveenoyan commented Aug 31, 2022 •

edited

Loading

mishig25 commented Sep 28, 2022 •

edited

Loading

mishig25 commented Sep 28, 2022 •

edited

Loading

BenjaminBossan commented Sep 28, 2022 •

edited

Loading

merveenoyan commented Nov 16, 2022 •

edited

Loading