The pdf format usually contains e-books, instructions and other documents. Some of them are protected from copying, that is, the information in them is contained in the form of a picture and the text can be "pulled out" from it only by recognition.
Necessary
- - Abbyy FineReader;
- - Abbyy Screenshot Reader.
Instructions
Step 1
Download and install the Abbyy FineReader program on your computer, for this go to the official website of the application https://www.abbyy.ua/download/, select the desired product and click on the Download link. This program is designed to scan paper documents, but you can use it to recognize a file in pdf format. To do this, start the program, then select the "File" - "Open" menu. Select the file you want to recognize from your computer.
Step 2
Set the recognition settings: language (you can select several languages, for example, when the text is in Russian, but it contains words in English); division of text into blocks (text blocks, images), resolution. Select the desired area of text, right-click and select the type of block (text, picture or table).
Step 3
Then click "Recognize". After that, you can save the resulting text by copying it into a Word document. Recognition of a pdf file can be performed both page by page and for the entire document at once.
Step 4
Install Abbyy Screenshot Reader. After that, the program icon will appear in the tray. This application allows you to recognize text from a pdf document opened on the screen. This also applies to any images, and everything that is generally displayed on the monitor.
Step 5
Open a document, click on the program icon, select the type of source (image, text) and the type of data you want to receive. You can choose text, table or image. For example, if you need to recognize tabular data, select the direction "Text" - "Table".
Step 6
Next, a cross-shaped cursor will appear on the screen, highlight the required information. After recognition, a MS Excel table will appear with the inserted information from the document. To split the received text into table columns use the menu "Tools" - "Split by columns", select a separator (space or tab) and click "OK".