- Mar 29, 2022
-
-
Daniel Ecer authored
Error message: ``` TypeError: __init__() got an unexpected keyword argument 'as_tuple' ```
-
- Jan 11, 2022
-
-
dependabot[bot] authored
Bumps [nltk](https://github.com/nltk/nltk) from 3.6.5 to 3.6.7. - [Release notes](https://github.com/nltk/nltk/releases) - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.6.5...3.6.7 ) --- updated-dependencies: - dependency-name: nltk dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Jan 07, 2022
-
-
dependabot[bot] authored
Bumps [types-requests](https://github.com/python/typeshed) from 2.26.1 to 2.26.3. - [Release notes](https://github.com/python/typeshed/releases) - [Commits](https://github.com/python/typeshed/commits ) --- updated-dependencies: - dependency-name: types-requests dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Jan 04, 2022
-
-
dependabot[bot] authored
Bumps [mypy](https://github.com/python/mypy) from 0.910 to 0.930. - [Release notes](https://github.com/python/mypy/releases) - [Commits](https://github.com/python/mypy/compare/v0.910...v0.930 ) --- updated-dependencies: - dependency-name: mypy dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [scikit-image](https://github.com/scikit-image/scikit-image) from 0.18.3 to 0.19.1. - [Release notes](https://github.com/scikit-image/scikit-image/releases) - [Changelog](https://github.com/scikit-image/scikit-image/blob/main/RELEASE.txt) - [Commits](https://github.com/scikit-image/scikit-image/compare/v0.18.3...v0.19.1 ) --- updated-dependencies: - dependency-name: scikit-image dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [gevent](https://github.com/gevent/gevent) from 21.8.0 to 21.12.0. - [Release notes](https://github.com/gevent/gevent/releases) - [Changelog](https://github.com/gevent/gevent/blob/master/docs/changelog_pre.rst) - [Commits](https://github.com/gevent/gevent/compare/21.8.0...21.12.0 ) --- updated-dependencies: - dependency-name: gevent dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Dec 15, 2021
-
-
dependabot[bot] authored
Bumps [types-requests](https://github.com/python/typeshed) from 2.25.11 to 2.26.1. - [Release notes](https://github.com/python/typeshed/releases) - [Commits](https://github.com/python/typeshed/commits ) --- updated-dependencies: - dependency-name: types-requests dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [opencv-python](https://github.com/skvark/opencv-python) from 4.5.3.56 to 4.5.4.60. - [Release notes](https://github.com/skvark/opencv-python/releases) - [Commits](https://github.com/skvark/opencv-python/commits ) --- updated-dependencies: - dependency-name: opencv-python dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [pip](https://github.com/pypa/pip) from 21.3 to 21.3.1. - [Release notes](https://github.com/pypa/pip/releases) - [Changelog](https://github.com/pypa/pip/blob/main/NEWS.rst) - [Commits](https://github.com/pypa/pip/compare/21.3...21.3.1 ) --- updated-dependencies: - dependency-name: pip dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [types-cachetools](https://github.com/python/typeshed) from 4.2.4 to 4.2.6. - [Release notes](https://github.com/python/typeshed/releases) - [Commits](https://github.com/python/typeshed/commits ) --- updated-dependencies: - dependency-name: types-cachetools dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Dec 14, 2021
-
-
dependabot[bot] authored
Bumps [lxml](https://github.com/lxml/lxml) from 4.6.3 to 4.6.5. - [Release notes](https://github.com/lxml/lxml/releases) - [Changelog](https://github.com/lxml/lxml/blob/master/CHANGES.txt) - [Commits](https://github.com/lxml/lxml/compare/lxml-4.6.3...lxml-4.6.5 ) --- updated-dependencies: - dependency-name: lxml dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Oct 21, 2021
-
-
Daniel Ecer authored
* log keypoint match score * reduced default min keypoint match score * reduced the default min keypoint match to 0.01
-
- Oct 19, 2021
-
-
Daniel Ecer authored
* added pdfminer.six dependency * output pt_bbox and coords using pt_bbox * increased version to 0.0.2 * avoid failing cv operation with float bounding box * configure pdfminer logging level (local only) * make canny filter configurable (disable by default) * make min template match score configurable * move constants to the beginning of the module * changed default min template match score to 0.6 * log template matching progress
-
- Oct 18, 2021
-
-
dependabot[bot] authored
Bumps [types-requests](https://github.com/python/typeshed) from 2.25.10 to 2.25.11. - [Release notes](https://github.com/python/typeshed/releases) - [Commits](https://github.com/python/typeshed/commits ) --- updated-dependencies: - dependency-name: types-requests dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Oct 14, 2021
-
-
dependabot[bot] authored
Bumps [flake8](https://github.com/pycqa/flake8) from 3.9.2 to 4.0.1. - [Release notes](https://github.com/pycqa/flake8/releases) - [Commits](https://github.com/pycqa/flake8/compare/3.9.2...4.0.1 ) --- updated-dependencies: - dependency-name: flake8 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [nltk](https://github.com/nltk/nltk) from 3.6.3 to 3.6.5. - [Release notes](https://github.com/nltk/nltk/releases) - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.6.3...3.6.5 ) --- updated-dependencies: - dependency-name: nltk dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
Daniel Ecer authored
* added get_scale_invariant_template_match * re-enabled TestGetImageListObjectMatch * added test_should_match_smaller_image_with_max_width_height * moved TestGetScaleInvariantTemplateMatch up * fallback to template matching * improved logging * simplified template match result * added check for figure height * handle no best match * prevent pil error with large asset images * fixed trying to calculate similarity of too small image * minor: renamed literals with two underscores * linting * added link to blog
-
- Oct 13, 2021
-
-
Daniel Ecer authored
* replace dagger html entity * strip extra spaces at beginning of xml document
-
dependabot[bot] authored
Bumps [types-requests](https://github.com/python/typeshed) from 2.25.9 to 2.25.10. - [Release notes](https://github.com/python/typeshed/releases) - [Commits](https://github.com/python/typeshed/commits ) --- updated-dependencies: - dependency-name: types-requests dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [pip](https://github.com/pypa/pip) from 21.2.4 to 21.3. - [Release notes](https://github.com/pypa/pip/releases) - [Changelog](https://github.com/pypa/pip/blob/main/NEWS.rst) - [Commits](https://github.com/pypa/pip/compare/21.2.4...21.3 ) --- updated-dependencies: - dependency-name: pip dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
Daniel Ecer authored
* log progress while matching figure image to each pdf page * yield empty image object match to keep total * added simple MultiLevelCache * implemented disk cache * added TestRegisterPickleFunction * increased default memory cache size
-
- Oct 11, 2021
-
-
Daniel Ecer authored
* moved get_image_array_with_max_resolution * added --pdf-scale-to argument * convert pdf from local pdf * show progress when loading pdf images * workaround for auto-scaling * log progress while checking for existing files
-
Daniel Ecer authored
-
dependabot[bot] authored
Bumps [apache-beam[gcp]](https://github.com/apache/beam) from 2.32.0 to 2.33.0. - [Release notes](https://github.com/apache/beam/releases) - [Changelog](https://github.com/apache/beam/blob/master/CHANGES.md) - [Commits](https://github.com/apache/beam/compare/v2.32.0...v2.33.0 ) --- updated-dependencies: - dependency-name: apache-beam[gcp] dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Oct 07, 2021
-
-
dependabot[bot] authored
Updates the requirements on [google-cloud-bigquery](https://github.com/googleapis/python-bigquery) to permit the latest version. - [Release notes](https://github.com/googleapis/python-bigquery/releases) - [Changelog](https://github.com/googleapis/python-bigquery/blob/main/CHANGELOG.md) - [Commits](https://github.com/googleapis/python-bigquery/compare/v0.21.0...v2.28.0 ) --- updated-dependencies: - dependency-name: google-cloud-bigquery dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
* Bump fsspec from 2021.5.0 to 2021.10.0 Bumps [fsspec](https://github.com/intake/filesystem_spec) from 2021.5.0 to 2021.10.0. - [Release notes](https://github.com/intake/filesystem_spec/releases) - [Commits](https://github.com/intake/filesystem_spec/compare/2021.05.0...2021.10.0 ) --- updated-dependencies: - dependency-name: fsspec dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> * also upgrade gcsfs Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by:
Daniel Ecer <de-code@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.62.2 to 4.62.3. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](https://github.com/tqdm/tqdm/compare/v4.62.2...v4.62.3 ) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [nltk](https://github.com/nltk/nltk) from 3.6.2 to 3.6.3. - [Release notes](https://github.com/nltk/nltk/releases) - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.6.2...3.6.3 ) --- updated-dependencies: - dependency-name: nltk dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [types-requests](https://github.com/python/typeshed) from 2.25.6 to 2.25.9. - [Release notes](https://github.com/python/typeshed/releases) - [Commits](https://github.com/python/typeshed/commits ) --- updated-dependencies: - dependency-name: types-requests dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [flask](https://github.com/pallets/flask) from 2.0.1 to 2.0.2. - [Release notes](https://github.com/pallets/flask/releases) - [Changelog](https://github.com/pallets/flask/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/flask/compare/2.0.1...2.0.2 ) --- updated-dependencies: - dependency-name: flask dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Sep 20, 2021
-
-
Daniel Ecer authored
* renamed --skip-errors to --ignore-unmatched-graphics * implemented --skip-errors * run test_should_annotate_using_jats_xml using beam * fixed MapOrLog usage when skipping errors
-
- Sep 15, 2021
-
-
Daniel Ecer authored
-
Daniel Ecer authored
* moved bbox main to separate module * renamed test module to match main module under test
-
Daniel Ecer authored
* conditionally write dummy copy of xml file * renamed to output_json_path * added coords attribute * using namespace for coords attribute * add namespace to nsmap
-
- Sep 14, 2021
-
-
Daniel Ecer authored
* process pdf and xml file lists * allow sub directories in output path * make output_annotated_images_path relative to output * make sure that output directory is created * use write_bytes in favour of explicit makedirs (cloud ready) * make --pdf-base-path required * added pipeline * not extending ABC due to serialization errors * log pipeline options * added test to check serialization * changed super import to avoid one of the serialization errors in Dataflow * moved most functionality to separate module * added libgl1 to setup.py * added PreventFusion * minor import grouping * added TransformAndCount * reverted super __init__ call * use parse args, not ignoring unknown args * expose all of the worker arguments
-
- Sep 09, 2021
-
-
Daniel Ecer authored
* added BoundingBoxScoreSummary * added TestGetBoundingBoxMatchScoreSummary * implemented algorithm to adjust final bounding box
-
- Sep 08, 2021
-
-
Daniel Ecer authored
* specify cache key for object keypoints * made image id required for lower level functions * added logging to find bounding boxes * 2nd it to find bounding box without bounding box * use fixed size when calculating image similarity
-
- Sep 07, 2021
-
-
Daniel Ecer authored
* calculate structural similarity using skimage * fixed crop_image_to_bounding_box * output score in JSON * optionally output images with bounding boxes * display bbox label inside if not enough space above * sort by score, then key points * fixed cache issue by using explicit cache key prefix (otherwise ids may have been reused after memory being freed)
-
- Sep 03, 2021
-
-
Daniel Ecer authored
* raise GraphicImageNotFoundError * allow skipping errors
-
Daniel Ecer authored
* enable debug logging for tests * added cli scaffolding * extract images from pdf * fixed type hint * added bounding box to_list * converted bounding box to named tuple * added tests for validate * implemented bounding box intersection * implemented finding bounding boxes of single image * added test for smaller partial image * added libgl1 for open cv * linting: use with statement for Popen * added support for multiple image files * added support for xml files * join graphic href with xml dirname * renamed cv2 to cv * using ObjectDetectorMatcher * moved funtions to image object matching module * added TestGetObjectMatch * added ImageObjectMarchResult * added test_should_match_smaller_image * added test_should_match_smaller_rotated_90_image * fixed typo ImageObjectMatchResult * moved object_detector_matcher parameter down * added get_image_list_object_match * added su...
-