PDF Comparison and Highlighting Tool

This Python-based tool allows for efficient comparison of two or more PDF documents, highlighting the differences between them. It extracts and compares the words in the PDFs, ignoring whitespace differences, and highlights the changed, added, or missing words.

Features:

Word-based Comparison: Compares text from two or more PDFs, highlighting only added, modified, or deleted words.
Whitespace Ignored: Ignores any differences in whitespace, focusing only on actual word changes.
Precise Highlighting: Highlights the differences in the compared PDF files using custom colors (e.g., red for PDF2 and green for PDF3).
Side-by-Side Merging: Merges the original and highlighted PDFs side by side for easy comparison.

Usage:

Provide paths to the PDF files to be compared.
The tool will extract words from the PDFs, compare them, and highlight the differences.
It saves the highlighted PDFs and a merged output with the original and highlighted PDFs placed side by side for an easy visual comparison.

Dependencies:

PyMuPDF (fitz)
difflib (standard Python library)

Notes:

The tool performs comparison on a page-by-page basis. If the PDF documents differ in the number of pages, it will compare up to the smallest page count.
Text formatting (e.g., font size, style) is not considered in the comparison; only the raw text content is compared.

Example Output:

Original PDF: The untouched source document.
Highlighted PDF: PDFs with added, changed, or missing words highlighted in different colors.
Combined Output: A single PDF containing the original and highlighted versions side by side.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
2pdf_comparison.py		2pdf_comparison.py
3pdf_compare.py		3pdf_compare.py
README.md		README.md
test1.pdf		test1.pdf
test2.pdf		test2.pdf
test3.pdf		test3.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDF Comparison and Highlighting Tool

Features:

Usage:

Dependencies:

Notes:

Example Output:

Result:

About

Uh oh!

Releases

Packages

Languages

malavika-suresh/multiple_pdf_comparison

Folders and files

Latest commit

History

Repository files navigation

PDF Comparison and Highlighting Tool

Features:

Usage:

Dependencies:

Notes:

Example Output:

Result:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages