- Oct 14, 2021
-
-
dependabot[bot] authored
Bumps [nltk](https://github.com/nltk/nltk) from 3.6.3 to 3.6.5. - [Release notes](https://github.com/nltk/nltk/releases) - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.6.3...3.6.5 ) --- updated-dependencies: - dependency-name: nltk dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Oct 13, 2021
-
-
Daniel Ecer authored
* log progress while matching figure image to each pdf page * yield empty image object match to keep total * added simple MultiLevelCache * implemented disk cache * added TestRegisterPickleFunction * increased default memory cache size
-
- Oct 07, 2021
-
-
dependabot[bot] authored
Updates the requirements on [google-cloud-bigquery](https://github.com/googleapis/python-bigquery) to permit the latest version. - [Release notes](https://github.com/googleapis/python-bigquery/releases) - [Changelog](https://github.com/googleapis/python-bigquery/blob/main/CHANGELOG.md) - [Commits](https://github.com/googleapis/python-bigquery/compare/v0.21.0...v2.28.0 ) --- updated-dependencies: - dependency-name: google-cloud-bigquery dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
* Bump fsspec from 2021.5.0 to 2021.10.0 Bumps [fsspec](https://github.com/intake/filesystem_spec) from 2021.5.0 to 2021.10.0. - [Release notes](https://github.com/intake/filesystem_spec/releases) - [Commits](https://github.com/intake/filesystem_spec/compare/2021.05.0...2021.10.0 ) --- updated-dependencies: - dependency-name: fsspec dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> * also upgrade gcsfs Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by:
Daniel Ecer <de-code@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.62.2 to 4.62.3. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](https://github.com/tqdm/tqdm/compare/v4.62.2...v4.62.3 ) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [nltk](https://github.com/nltk/nltk) from 3.6.2 to 3.6.3. - [Release notes](https://github.com/nltk/nltk/releases) - [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog) - [Commits](https://github.com/nltk/nltk/compare/3.6.2...3.6.3 ) --- updated-dependencies: - dependency-name: nltk dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [flask](https://github.com/pallets/flask) from 2.0.1 to 2.0.2. - [Release notes](https://github.com/pallets/flask/releases) - [Changelog](https://github.com/pallets/flask/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/flask/compare/2.0.1...2.0.2 ) --- updated-dependencies: - dependency-name: flask dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Sep 07, 2021
-
-
Daniel Ecer authored
* calculate structural similarity using skimage * fixed crop_image_to_bounding_box * output score in JSON * optionally output images with bounding boxes * display bbox label inside if not enough space above * sort by score, then key points * fixed cache issue by using explicit cache key prefix (otherwise ids may have been reused after memory being freed)
-
- Sep 03, 2021
-
-
Daniel Ecer authored
* enable debug logging for tests * added cli scaffolding * extract images from pdf * fixed type hint * added bounding box to_list * converted bounding box to named tuple * added tests for validate * implemented bounding box intersection * implemented finding bounding boxes of single image * added test for smaller partial image * added libgl1 for open cv * linting: use with statement for Popen * added support for multiple image files * added support for xml files * join graphic href with xml dirname * renamed cv2 to cv * using ObjectDetectorMatcher * moved funtions to image object matching module * added TestGetObjectMatch * added ImageObjectMarchResult * added test_should_match_smaller_image * added test_should_match_smaller_rotated_90_image * fixed typo ImageObjectMatchResult * moved object_detector_matcher parameter down * added get_image_list_object_match * added su...
-
- Aug 26, 2021
-
-
Daniel Ecer authored
* added mypy dependency * added dev-mypy * added mypy make target * declare EMPTY class prop * removed incorrect tensors type hint * added type hint to excluded_tokens * removed unused ProcessedWrapper * replacing backports.tempfile with builtin * added T_ArgumentParserOrGroup * fixed iter_tokenized_tokens return type hint * added types-requests * added type to DEFAULT_ANNOTATORS * removed blank line * ignore distutils import * replaced T_Element with etree.ElementBase * changed type check back to etree._Element * replaced project_tests.sh * removed second mypy make target dependency
-
dependabot[bot] authored
Bumps [tensorflow-transform](https://github.com/tensorflow/transform) from 0.25.0 to 1.3.0. - [Release notes](https://github.com/tensorflow/transform/releases) - [Changelog](https://github.com/tensorflow/transform/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/transform/compare/v0.25.0...v1.3.0 ) --- updated-dependencies: - dependency-name: tensorflow-transform dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
* Bump pylint from 2.8.3 to 2.10.2 Bumps [pylint](https://github.com/PyCQA/pylint) from 2.8.3 to 2.10.2. - [Release notes](https://github.com/PyCQA/pylint/releases) - [Changelog](https://github.com/PyCQA/pylint/blob/main/ChangeLog) - [Commits](https://github.com/PyCQA/pylint/compare/v2.8.3...v2.10.2 ) --- updated-dependencies: - dependency-name: pylint dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> * make installing dependencies more predictable * downgrade numpy due to conflict with apache beam * downgraded numpy further due to conflict with tensorflow * linting: use dict literal * linting: pass in encoding to open function * linting: pcoll renamed to input_or_inputs * linting: iterate over list * linting: use from .. import * added pyarrow as explicit dependency * downgrade google-cloud-bigquery Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by:
Daniel Ecer <de-code@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [gevent](https://github.com/gevent/gevent) from 21.1.2 to 21.8.0. - [Release notes](https://github.com/gevent/gevent/releases) - [Changelog](https://github.com/gevent/gevent/blob/master/docs/changelog_pre.rst) - [Commits](https://github.com/gevent/gevent/compare/21.1.2...21.8.0 ) --- updated-dependencies: - dependency-name: gevent dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.62.0 to 4.62.2. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](https://github.com/tqdm/tqdm/compare/v4.62.0...v4.62.2 ) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [numpy](https://github.com/numpy/numpy) from 1.21.1 to 1.21.2. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.21.1...v1.21.2 ) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- Aug 02, 2021
-
-
dependabot-preview[bot] authored
-
- Jul 19, 2021
-
-
dependabot-preview[bot] authored
-
- Jul 14, 2021
-
-
dependabot-preview[bot] authored
-
- Jul 07, 2021
-
-
dependabot-preview[bot] authored
-
- Jun 23, 2021
-
-
dependabot-preview[bot] authored
-
- Jun 14, 2021
-
-
dependabot-preview[bot] authored
-
- May 25, 2021
-
-
dependabot-preview[bot] authored
-
- May 24, 2021
-
-
dependabot-preview[bot] authored
-
- May 18, 2021
-
-
Daniel Ecer authored
* initial create vocabulary utility * extract vocabulary from embeddings * renamed to --output-word-count-file * added main call * extracted iter_tokenized_tokens * avoid empty tokens * using tokenizer from delft * optionally sort by count * added file list support * added support for remote files * added limit argument * added fsspec dependency * optionally use multi threading or processing * included full github link * renamed to create_vocabulary * moved to tools vocabulary * filter embeddings * renamed to embeddings * using fsspec to open embeddings file when extracting * use fsspec when filtering embeddings * document tools * added link to tools.md
-
- May 13, 2021
-
-
dependabot-preview[bot] authored
Bumps [numpy](https://github.com/numpy/numpy) from 1.19.5 to 1.20.3. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.5...v1.20.3 ) Signed-off-by:
dependabot-preview[bot] <support@dependabot.com> Co-authored-by:
dependabot-preview[bot] <27856297+dependabot-preview[bot]@users.noreply.github.com>
-
Daniel Ecer authored
* removed usage of six * removed six dependency
-
Daniel Ecer authored
This reverts commit 612a2c3f.
-
dependabot-preview[bot] authored
Bumps [tensorflow-transform](https://github.com/tensorflow/transform) from 0.25.0 to 0.30.0. - [Release notes](https://github.com/tensorflow/transform/releases) - [Changelog](https://github.com/tensorflow/transform/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/transform/compare/v0.25.0...v0.30.0 ) Signed-off-by:
dependabot-preview[bot] <support@dependabot.com> Co-authored-by:
dependabot-preview[bot] <27856297+dependabot-preview[bot]@users.noreply.github.com>
-
dependabot-preview[bot] authored
Bumps [gunicorn](https://github.com/benoitc/gunicorn) from 20.0.4 to 20.1.0. - [Release notes](https://github.com/benoitc/gunicorn/releases) - [Commits](https://github.com/benoitc/gunicorn/compare/20.0.4...20.1.0 ) Signed-off-by:
dependabot-preview[bot] <support@dependabot.com> Co-authored-by:
dependabot-preview[bot] <27856297+dependabot-preview[bot]@users.noreply.github.com>
-
- May 12, 2021
-
-
dependabot-preview[bot] authored
-
dependabot-preview[bot] authored
-
- May 06, 2021
-
-
dependabot-preview[bot] authored
-
- Apr 22, 2021
-
-
dependabot-preview[bot] authored
-
- Apr 14, 2021
-
-
dependabot-preview[bot] authored
-
- Apr 08, 2021
-
-
dependabot-preview[bot] authored
-
- Apr 06, 2021
-
-
dependabot-preview[bot] authored
-
- Mar 22, 2021
-
-
dependabot-preview[bot] authored
-
- Mar 08, 2021
-
-
dependabot-preview[bot] authored
-
- Feb 26, 2021
-
-
dependabot-preview[bot] authored
-
- Feb 22, 2021
-
-
dependabot-preview[bot] authored
-