Skip to content

globalise-huygens/document_feature_extraction

Repository files navigation

This program is desgined to extract document layout features from historical documents. The program accepts a directory of page scans (Jpg's or jpeg's) and returns a JSON for each scan with the region types detected and their coordinates

To run the program:

  1. In terminal navigate to the folder "document_feature_extraction"
  2. Open the script "layout_detect.py" and find the section with the comment "# === HARD‑CODED CONFIG ==="
  3. Enter in directory paths and API key
  4. run the command: "python layout_detect.py" from terminal in the folder "document_feature_extraction"

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published