Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

I have about 200 PDF scans that need to have information systematically removed.

ID: 658424 • Letter: I

Question

I have about 200 PDF scans that need to have information systematically removed. This information to remove includes a set of digits that may vary in length.

The current plan is to print each document out remove the information and re-scan it. I was hoping there would be a way to automate this process.

I have tried using Adobe Acrobat Pro's OCR and then find and replace functionality in Word, but I have run into a few problems. I can't figure out how to search for number in multiple Word documents and when I use OCR on the documents in Adobe Acrobat it finds pictures that are on the page and converts them to text.

If there is any software that may be able to automate this it would be really helpful.

Explanation / Answer

From your question/comments I gather that the PDFs contain images only.

1) Extract the images using a PDF image extractor like IweSoft PDF Image Extractor.

2) Blur the relevant text in the images

3) Assemble the images back into a new PDF using any tool you like (for creating PDFs there are hundreds of options, so I won't go into that)

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote