This program is desgined to extract document layout features from historical documents. The program accepts a directory of page scans (Jpg's or jpeg's) and returns a JSON for each scan with the region types detected and their coordinates
To run the program:
- In terminal navigate to the folder "document_feature_extraction"
- Open the script "layout_detect.py" and find the section with the comment "# === HARD‑CODED CONFIG ==="
- Enter in directory paths and API key
- run the command: "python layout_detect.py" from terminal in the folder "document_feature_extraction"