Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TLDR-369 class for full dedoc pipeline running #300

Merged
merged 6 commits into from
Aug 1, 2023

Conversation

NastyBoget
Copy link
Collaborator

  • some refactoring;
  • remove version parameter from metadata extractors, structure constructors and parsed document methods;
  • add version file and version resolving for the library;
  • add recursive handling of attachments;
  • add parameter for saving attachments in a custom directory;
  • remove dedoc threaded manager;
  • fix documentation according to the changes.

dedoc/dedoc_manager.py Outdated Show resolved Hide resolved
@dronperminov dronperminov merged commit 31fb470 into develop Aug 1, 2023
2 checks passed
@dronperminov dronperminov deleted the TLDR-369_dedoc_pipeline branch August 1, 2023 11:55
dronperminov added a commit that referenced this pull request Aug 1, 2023
* TLDR-386 pdf auto reader bug (#298)

* TLDR-386 Added features importances

* TLDR-386 added script for txtlayer dataset generation

* TLDR-386 move all data to the cloud

* Review fixes

* exclude version and changelog files (#299)

* TLDR-419 add confidence annotation (#301)

* add new annotation

* add confidence extracting

* add test for confidence annotation

* add confidence annotation to documentation

* fix flake

* add mergeable field for annotation

* review fixes

* TLDR-369 class for full dedoc pipeline running (#300)

* DedocPipeline added (work in progress)

* TLDR-369_dedoc_manager

* TLDR-369 fix documentation and add test for attachments recursion

* TLDR-369 change version saving

* TLDR-369 review fixes

* TLDR-369 added temporary file name

* new version 0.10.0 (#302)

---------

Co-authored-by: Bogatenkova Anastasiya <[email protected]>
dronperminov added a commit that referenced this pull request Aug 1, 2023
* TLDR-386 pdf auto reader bug (#298)

* TLDR-386 Added features importances

* TLDR-386 added script for txtlayer dataset generation

* TLDR-386 move all data to the cloud

* Review fixes

* exclude version and changelog files (#299)

* TLDR-419 add confidence annotation (#301)

* add new annotation

* add confidence extracting

* add test for confidence annotation

* add confidence annotation to documentation

* fix flake

* add mergeable field for annotation

* review fixes

* TLDR-369 class for full dedoc pipeline running (#300)

* DedocPipeline added (work in progress)

* TLDR-369_dedoc_manager

* TLDR-369 fix documentation and add test for attachments recursion

* TLDR-369 change version saving

* TLDR-369 review fixes

* TLDR-369 added temporary file name

* new version 0.10.0 (#302)

---------

Co-authored-by: Bogatenkova Anastasiya <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants