improve memory consumption when finding bounding boxes (2nd iteration) (#426)
* log progress while matching figure image to each pdf page * yield empty image object match to keep total * added simple MultiLevelCache * implemented disk cache * added TestRegisterPickleFunction * increased default memory cache size
Showing
- requirements.dev.txt 1 addition, 0 deletionsrequirements.dev.txt
- requirements.txt 3 additions, 0 deletionsrequirements.txt
- sciencebeam_gym/tools/image_annotation/find_bounding_boxes_utils.py 107 additions, 75 deletions...m_gym/tools/image_annotation/find_bounding_boxes_utils.py
- sciencebeam_gym/utils/cache.py 57 additions, 0 deletionssciencebeam_gym/utils/cache.py
- sciencebeam_gym/utils/image_object_matching.py 17 additions, 2 deletionssciencebeam_gym/utils/image_object_matching.py
- sciencebeam_gym/utils/pickle_reg.py 17 additions, 0 deletionssciencebeam_gym/utils/pickle_reg.py
- tests/utils/cache_test.py 44 additions, 0 deletionstests/utils/cache_test.py
- tests/utils/pickle_reg_test.py 18 additions, 0 deletionstests/utils/pickle_reg_test.py
cachetools==4.2.4 | ||
diskcache==5.2.1 | ||
Flask==2.0.2 | ||
gevent==21.8.0 | ||
gunicorn==20.1.0 | ||
... | ... | @@ -23,4 +25,5 @@ scikit-image==0.18.3 |
sklearn-crfsuite==0.3.6 | ||
scikit-learn>=0.24.2 | ||
tensorflow-transform==1.3.0 | ||
typing-extensions==3.10.0.2 | ||
tqdm==4.62.3 |
sciencebeam_gym/utils/cache.py
0 → 100644
sciencebeam_gym/utils/pickle_reg.py
0 → 100644
tests/utils/cache_test.py
0 → 100644
tests/utils/pickle_reg_test.py
0 → 100644
Please register or sign in to comment