Skip to content
Snippets Groups Projects
Unverified Commit 0bf9e780 authored by Daniel Ecer's avatar Daniel Ecer Committed by GitHub
Browse files

figure image bounding box annotation for single document (#389)

* enable debug logging for tests

* added cli scaffolding

* extract images from pdf

* fixed type hint

* added bounding box to_list

* converted bounding box to named tuple

* added tests for validate

* implemented bounding box intersection

* implemented finding bounding boxes of single image

* added test for smaller partial image

* added libgl1 for open cv

* linting: use with statement for Popen

* added support for multiple image files

* added support for xml files

* join graphic href with xml dirname

* renamed cv2 to cv

* using ObjectDetectorMatcher

* moved funtions to image object matching module

* added TestGetObjectMatch

* added ImageObjectMarchResult

* added test_should_match_smaller_image

* added test_should_match_smaller_rotated_90_image

* fixed typo ImageObjectMatchResult

* moved object_detector_matcher parameter down

* added get_image_list_object_match

* added support for gzipped files

* indent output json

* added annotation file_name

* using sample image as fixture

* reduce size of sample image

* added save_images_as_pdf

* added test using multiple images

* prefer better match based on keyword match count

* added --debug cli arg

* handle case where no homography can be found

* use category based on parent element

* added formula type

* added related_element_id

* added logging tqdm

* added info

* resize larger image to make finding bounding boxes faster

* convert to gray scale to make finding boxes even faster

* made max width height configurable

* cache image features

* increased default max width height

* added tests for get_image_array_with_max_resolution

* allow max width height to be zero

* make grayscale conversion optional

* disabled max resolution by default
parent e113fa00
No related branches found
No related tags found
No related merge requests found
Showing
with 1379 additions and 56 deletions
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment