feat:Alignment during recontruction of image #1657

SkaarFacee · 2024-06-24T15:25:11Z

No description provided.

codecov · 2024-06-24T15:51:52Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.40%. Comparing base (1cea7d8) to head (b912996).
Report is 18 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1657      +/-   ##
==========================================
+ Coverage   96.35%   96.40%   +0.04%     
==========================================
  Files         164      164              
  Lines        7773     7780       +7     
==========================================
+ Hits         7490     7500      +10     
+ Misses        283      280       -3

Flag	Coverage Δ
unittests	`96.40% <100.00%> (+0.04%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

felixdittrich92 · 2024-06-25T07:26:24Z

Hi @SkaarFacee 👋,

Thanks for the PR.

felixdittrich92 · 2024-06-25T07:27:45Z

doctr/utils/reconstitution.py

@@ -38,14 +38,18 @@ def synthesize_page(
    # Draw each word
    for block in page["blocks"]:
        for line in block["lines"]:
+            line_ymin = min(int(round(h * word["geometry"][0][1])) for word in line["words"])


I would suggest the following:

def synthesize_page( page: Dict[str, Any], draw_proba: bool = False, font_family: Optional[str] = None, adjust_to_line: bool = False, ) -> np.ndarray: """Draw a the content of the element page (OCR response) on a blank page. Args: ---- page: exported Page object to represent draw_proba: if True, draw words in colors to represent confidence. Blue: p=1, red: p=0 font_size: size of the font, default font = 13 font_family: family of the font adjust_to_line: if True, adjust y coordinates to line geometry Returns: ------- the synthesized page """ # Draw template h, w = page["dimensions"] response = 255 * np.ones((h, w, 3), dtype=np.int32) # Draw each word for block in page["blocks"]: multiline = len(block["lines"]) > 1 for line in block["lines"]: for word in line["words"]: # Get absolute word geometry (xmin, ymin), (xmax, ymax) = word["geometry"] xmin, xmax = int(round(w * xmin)), int(round(w * xmax)) if multiline and adjust_to_line: ymin = int(round(h * line["geometry"][0][1])) ymax = int(round(h * line["geometry"][1][1])) else: ymin, ymax = int(round(h * ymin)), int(round(h * ymax))

In this case the user can still decide and adjusting makes only sense if we have lines (so resolve_lines=True)

Okay, I will do that right away

felixdittrich92 · 2024-06-25T07:31:02Z

doctr/utils/reconstitution.py

                # White drawing context adapted to font size, 0.75 factor to convert pts --> pix
-                font = get_font(font_family, int(0.75 * (ymax - ymin)))
+                ymin, ymax = line_ymin, line_ymax
+                calculate_font_size = int(0.75 * (ymax - ymin))


This does still not work well see:

does still overlap

That is interesting. I shall debug this and see why the issue exists

Hey there, what models did you use for these?

Hey :)

fast_base and parseq and db_mobilenet_v3_large and crnn_mobilenet_v3_large

felixdittrich92 · 2024-10-10T16:44:02Z

#1750

feat:Alignment during recontruction of image

b912996

felixdittrich92 requested changes Jun 25, 2024

View reviewed changes

felixdittrich92 linked an issue Jun 25, 2024 that may be closed by this pull request

[reconstitution] Improve synthesize output quality #1528

Open

felixdittrich92 added this to the 0.10.0 milestone Jun 25, 2024

felixdittrich92 added type: enhancement Improvement module: utils Related to doctr.utils labels Jun 25, 2024

felixdittrich92 marked this pull request as draft June 25, 2024 07:34

felixdittrich92 closed this Oct 10, 2024

felixdittrich92 removed this from the 0.10.0 milestone Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat:Alignment during recontruction of image #1657

feat:Alignment during recontruction of image #1657

SkaarFacee commented Jun 24, 2024

codecov bot commented Jun 24, 2024 •

edited

Loading

felixdittrich92 commented Jun 25, 2024

felixdittrich92 Jun 25, 2024

SkaarFacee Jun 26, 2024

felixdittrich92 Jun 25, 2024

SkaarFacee Jun 26, 2024

SkaarFacee Jun 30, 2024

felixdittrich92 Jul 1, 2024

felixdittrich92 commented Oct 10, 2024

feat:Alignment during recontruction of image #1657

feat:Alignment during recontruction of image #1657

Conversation

SkaarFacee commented Jun 24, 2024

codecov bot commented Jun 24, 2024 • edited Loading

Codecov Report

felixdittrich92 commented Jun 25, 2024

felixdittrich92 Jun 25, 2024

Choose a reason for hiding this comment

SkaarFacee Jun 26, 2024

Choose a reason for hiding this comment

felixdittrich92 Jun 25, 2024

Choose a reason for hiding this comment

SkaarFacee Jun 26, 2024

Choose a reason for hiding this comment

SkaarFacee Jun 30, 2024

Choose a reason for hiding this comment

felixdittrich92 Jul 1, 2024

Choose a reason for hiding this comment

felixdittrich92 commented Oct 10, 2024

codecov bot commented Jun 24, 2024 •

edited

Loading