-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH v2] Add partial fit to the correct branch for decisiontreeclassifier #54
Conversation
Signed-off-by: Adam Li <[email protected]>
Signed-off-by: Adam Li <[email protected]>
with nogil: | ||
while not update_stack.empty(): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@PSSF23 does this code affect the normal building process not via partial_fit
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it shouldn't. update_stack only exists when the initial_roots has content.
The CI errors seem related to the code introduced: https://dev.azure.com/neurodata/neurodata/_build/results?buildId=590&view=logs&j=dde5042c-7464-5d47-9507-31bdd2ee0a3a&t=4bd2dad8-62b3-5bf9-08a5-a9880c530c94 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@adam2392 Yes, the classes
param was not properly passed in by the CI, so it was expecting a different kind of error. We can remove the _check_partial_fit_first_call
check, but then there might be other errors associated with partial_fit
api.
By the way, the performance is improved with the new branch, about the same as streaming tree on its own now. So previously the inferior results might be due to some bug in submodulev2
.
Can you elaborate? Perhaps we can move the check to a better place and then have everything pass? |
It appears that they are testing all methods with fitting functions ( The checks for |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For unit tests, sklearn
had built-in checks for partial_fit
from other streaming algorithms, so I didn't prepare any extra ones. We can add new ones as we see fit, but the existing ones should cover most situations.
Signed-off-by: Adam Li <[email protected]>
I see. I took a look and all you have to do is validate the input parameters before running the partial fit check: 63637f7 |
Signed-off-by: Adam Li <[email protected]>
Do you want to add the support for |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@adam2392 Thanks for all the improvements. I need to work on simulations for the cancer project this week so feel free to go ahead. We also found some potential bugs on honest code so I might need to revisit it.
Signed-off-by: Adam Li <[email protected]>
Kay merged! @PSSF23 feel free to open up a PR to enable this for the |
…fier (#54) Supersedes: #50 Implements partial_fit API for all classification decision trees. --------- Signed-off-by: Adam Li <[email protected]> Co-authored-by: Haoyin Xu <[email protected]>
Supersedes: #50
Reference Issues/PRs
What does this implement/fix? Explain your changes.
Any other comments?