I have about 200 PDF scans that need to have information systematically removed.

ID: 658424 • Letter: I

Question

I have about 200 PDF scans that need to have information systematically removed. This information to remove includes a set of digits that may vary in length.

The current plan is to print each document out remove the information and re-scan it. I was hoping there would be a way to automate this process.

I have tried using Adobe Acrobat Pro's OCR and then find and replace functionality in Word, but I have run into a few problems. I can't figure out how to search for number in multiple Word documents and when I use OCR on the documents in Adobe Acrobat it finds pictures that are on the page and converts them to text.

If there is any software that may be able to automate this it would be really helpful.

Explanation / Answer

From your question/comments I gather that the PDFs contain images only.

1) Extract the images using a PDF image extractor like IweSoft PDF Image Extractor.

2) Blur the relevant text in the images

3) Assemble the images back into a new PDF using any tool you like (for creating PDFs there are hundreds of options, so I won't go into that)

Navigate

I have about 15 computers on the same network and most of their installed softwa

I have absolutely no clue how to do this assignment Your supervisor at the Nutri

Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.

I have about 200 PDF scans that need to have information systematically removed.

Question

Explanation / Answer

Related Questions

Navigate