Skip to content
Snippets Groups Projects
  1. Sep 03, 2021
    • Daniel Ecer's avatar
      figure image bounding box annotation for single document (#389) · 0bf9e780
      Daniel Ecer authored
      * enable debug logging for tests
      
      * added cli scaffolding
      
      * extract images from pdf
      
      * fixed type hint
      
      * added bounding box to_list
      
      * converted bounding box to named tuple
      
      * added tests for validate
      
      * implemented bounding box intersection
      
      * implemented finding bounding boxes of single image
      
      * added test for smaller partial image
      
      * added libgl1 for open cv
      
      * linting: use with statement for Popen
      
      * added support for multiple image files
      
      * added support for xml files
      
      * join graphic href with xml dirname
      
      * renamed cv2 to cv
      
      * using ObjectDetectorMatcher
      
      * moved funtions to image object matching module
      
      * added TestGetObjectMatch
      
      * added ImageObjectMarchResult
      
      * added test_should_match_smaller_image
      
      * added test_should_match_smaller_rotated_90_image
      
      * fixed typo ImageObjectMatchResult
      
      * moved object_detector_matcher parameter down
      
      * added get_image_list_object_match
      
      * added su...
  2. Aug 31, 2021
    • Daniel Ecer's avatar
      refactored docker build (#394) · 53327b3a
      Daniel Ecer authored
      * using single dockerfile
      
      * updated docker compose config version
      
      * refactored dockerfile
      
      * calling dev tools via python module
      
      * removed install_dev argument
      
      * fixed DOCKER_DEV_RUN placeholder
      
      * minor refactory of apt-get command
      
      * using builder image
      
      * updated nltk data directory
  3. May 13, 2021
  4. Dec 07, 2020
    • Daniel Ecer's avatar
      fix serialisation issue (#280) · 3d7ba502
      Daniel Ecer authored
      * added lxml end-to-end test
      
      * configured test logging
      
      * prefer /usr/bin/timeout
      
      * added debug logging
      
      * added pdf end to end preprocessing test
      
      * fixed serialization error
      
      * added debug logging
      
      * check pipeline results
      
      * test png output
      
      * fixed element type check
      
      * install poppler-utils into docker image
      
      * add --assume-yes
  5. Jan 17, 2020
  6. Jan 07, 2020
  7. Sep 09, 2019
    • Daniel Ecer's avatar
      switched to python3 (#145) · b3473e4c
      Daniel Ecer authored
      * ugraded to python 3
      
      * upgrade pylint and pytest
      
      * replaced StandardError
      
      * exclude useless-object-inheritance
      
      * python3 compatibilities uncovered by linting
      
      * fixed tests
      
      * fixed more python3 test incompatibilities
  8. Jun 10, 2019
  9. Jun 05, 2019
    • Daniel Ecer's avatar
      added autocut model (#106) · 85754f2d
      Daniel Ecer authored
      * added dev-venv target
      
      * added subextract model
      
      * added nltk dependency
      
      * flake8 ignore line break before binary operator
      
      * moved dev dependencies up
      
      * added nltk punkt download
      
      * added nltk download to dev-venv; pytest and pytest-not-slow target
      
      * added subextract training pipeline
      
      * added optional xpath namespaces
      
      * log failed xml file
      
      * use recover parser option
      
      * added subextract app
      
      * start subextract server
      
      * renamed to autocut
      
      * declare slow and very_slow pytest markers
      
      * make autocut main test as slow
      
      * fixed post data
      
      * updated README
      
      * also build non-dev image as part of ci
      
      * added pytest.ini to dev image
  10. Jun 03, 2019
    • Daniel Ecer's avatar
      moved tests (#104) · 43fe414c
      Daniel Ecer authored
    • Daniel Ecer's avatar
      refactor build (#103) · aae39c7a
      Daniel Ecer authored
      * added Makefile; optionally install dev requirements
      
      * added ci-build-and-test
      
      * use ci-build-and-test
      
      * don't install dev dependencies as part of project_tests
      
      * renamed to dev
      
      * use PROJECT_FOLDER for consistency with other projects
      
      * change order of tests run
  11. Nov 02, 2018
    • Daniel Ecer's avatar
      pylint and flake8 checking (#39) · 91e1c0d0
      Daniel Ecer authored
      * added pylint check
      
      * added pylintrc to docker image
      
      * reduced accessive apache beam debug logging
      
      * configured pylint, addressed linting
      
      * enabled flake8 checks
      
      * downgrade pycodestyle to 2.3.1 due to error
      
      * switch to 4 spaces indent
      
      * autopep8
      
      * more flake8
      
      * added new line to .flake8
  12. Nov 01, 2018
  13. Apr 19, 2018
    • Peter Hooper's avatar
      Fix versioning for google-cloud packages. (#28) · 851eb577
      Peter Hooper authored
      * Renamed files to be consistent with 'sciencebeam' repo
      
      * Update Dockerfile and README after filename change
      
      * Update version of tensorflow-transform to 0.6
      
      * Pin versions of oauth2client and httplib2 to prevent errors with google-cloud. Update apache_beam to 2.4 as used by new version of tensorflow-transform
  14. Mar 20, 2018
  15. Nov 30, 2017