Enable type checking for users and add types to nearly everything #156

dhdaines · 2024-07-06T01:55:28Z

Some of the code (pipeline.py) had some types. But this wasn't super useful because the magical py.typed file was not included in the package.

So this adds that, but also adds some types.

I can't totally figure out how to use Protocol correctly for pipeline functions but in actual fact they have a fixed signature, and the attributes we add are purely for internal use, so it's type: ignore for the time being.

dhdaines · 2024-07-06T03:18:05Z

fixes #158

dhdaines · 2024-07-06T14:08:30Z

I got carried away and added types nearly everywhere.

For CompleteSet there is an issue since its API is quite strange (though I understand the intent)

For Vector this can't be done efficiently because it relies on mixing str and float everywhere including in the values, so numeric calculations are simply undefined for string vectors (otherwise we'd have to cal isinstance repeatedly, which is super slow)

dhdaines · 2024-07-06T14:10:14Z

Final comment: any good reason why boost needs to be an int? It's a number in lunr.js and it just gets multiplied into a float anyway at some point.

lunr/utils.py

lunr/tokenizer.py

lunr/token_set_builder.py

lunr/query.py

yeraydiazdiaz

Thanks for taking the time to do this, I've been meaning to add types for a long time.

The PR will likely need reformatting and I suggest squashing the commits as well, but thanks for separating things so it's easier to review.

yeraydiazdiaz · 2024-09-08T11:37:33Z

lunr/match_data.py

@@ -64,5 +67,7 @@ def add(self, term, field, metadata):
            else:
                self.metadata[term][field][key] = metadata[key]

-    def __eq__(self, other):
+    def __eq__(self, other: object):


Why is this not also MatchData as above?

This is because __eq__ by definition has to accept any object as an argument, even if a comparison with that object is not possible - mypy will give a specific warning about this.

I feel like this should be documented in mypy's documentation somewhere but I can't figure out where! It's also explained here: https://stackoverflow.com/questions/54801832/mypy-eq-incompatible-with-supertype-object

lunr/query_lexer.py

dhdaines · 2024-09-09T14:19:35Z

Formatted, type-checked, toxed, squashed and force-pushed!

dhdaines · 2024-09-09T14:21:03Z

The PR will likely need reformatting and I suggest squashing the commits as well, but thanks for separating things so it's easier to review.

Question - should I rebase the other PRs on top of this one? That will take some editing because of all of the types.

codecov-commenter · 2024-09-09T14:30:53Z

Codecov Report

Attention: Patch coverage is 95.60000% with 11 lines in your changes missing coverage. Please review.

Project coverage is 95.82%. Comparing base (d07b60f) to head (5a2f171).

Files with missing lines	Patch %	Lines
lunr/index.py	82.97%	8 Missing ⚠️
lunr/match_data.py	88.88%	1 Missing ⚠️
lunr/query_parser.py	95.83%	1 Missing ⚠️
lunr/token_set.py	95.83%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #156      +/-   ##
==========================================
- Coverage   96.02%   95.82%   -0.21%     
==========================================
  Files          48       48              
  Lines        3171     3257      +86     
==========================================
+ Hits         3045     3121      +76     
- Misses        126      136      +10

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dhdaines mentioned this pull request Jul 6, 2024

Missing py.typed #158

Open

dhdaines changed the title ~~Enable type checking for users and add types to basic API~~ Enable type checking for users and add types to nearly everything Jul 6, 2024

dhdaines commented Jul 6, 2024

View reviewed changes

lunr/utils.py Show resolved Hide resolved

dhdaines commented Jul 6, 2024

View reviewed changes

lunr/tokenizer.py Show resolved Hide resolved

dhdaines commented Jul 6, 2024

View reviewed changes

lunr/token_set_builder.py Show resolved Hide resolved

dhdaines commented Jul 6, 2024

View reviewed changes

lunr/query.py Show resolved Hide resolved

yeraydiazdiaz reviewed Sep 8, 2024

View reviewed changes

feat: enable type checking for users

a44386d

dhdaines force-pushed the py_typed branch from 620e5ca to a44386d Compare September 9, 2024 14:19

fix: maybe maintain python 3.7 compatibility

5c7dcd0

dhdaines added 2 commits September 9, 2024 10:32

fix: use typing-extensions everywhere for 3.7 compat

8f7e942

docs: add changelog entry

5a2f171

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable type checking for users and add types to nearly everything #156

Enable type checking for users and add types to nearly everything #156

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

yeraydiazdiaz left a comment

yeraydiazdiaz Sep 8, 2024

dhdaines Sep 9, 2024

dhdaines Sep 9, 2024

dhdaines commented Sep 9, 2024

dhdaines commented Sep 9, 2024

codecov-commenter commented Sep 9, 2024 •

edited

Loading

Enable type checking for users and add types to nearly everything #156

Are you sure you want to change the base?

Enable type checking for users and add types to nearly everything #156

Conversation

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

dhdaines commented Jul 6, 2024

yeraydiazdiaz left a comment

Choose a reason for hiding this comment

yeraydiazdiaz Sep 8, 2024

Choose a reason for hiding this comment

dhdaines Sep 9, 2024

Choose a reason for hiding this comment

dhdaines Sep 9, 2024

Choose a reason for hiding this comment

dhdaines commented Sep 9, 2024

dhdaines commented Sep 9, 2024

codecov-commenter commented Sep 9, 2024 • edited Loading

Codecov Report

codecov-commenter commented Sep 9, 2024 •

edited

Loading