Help on Image to OCR Converter
How to make image and PDF files editable and searchable Using Image to OCR Converter
Help on PDF file types
How to Edit Text in Images Using Image to OCR Converter
Steps
-
-
Execute the downloaded setup file.
-
Input the image or pdf file to be converted. Image files such as BMP, JPG, TIFF, PNG can be converted to text-based HTML, Ms-Word, PDF file formats.
-
PDF file can also be directly converted by either converting whole pdf to image format or by extracting only the images present in the pdf. Read more about
PDF file types
-
Select the desired output formats. Select any desired text-based output format such as HTML, Ms-Word, PDF, Text. One or more file formats can be selected together. However, select less output formats for fast processing.
-
Select the language of input file. Select the same language option as the text language of the input file. Image to OCR Converter recognizes 40 different language. The default is English.
-
Click on Save button to start OCR processing and save to the selected formats.
-
Now you can edit the converted text-based formats as per your needs.
Tips
- Select less output formats for fast processing
PDF file types:
PDF Files |
Sample PDF files |
Portable Document Format (PDF) is a file format for document exchange. PDF file can contain text, fonts, images, and 2D vector graphics (line, circle etc.) in it.
|
|
PDF types:
-
Image-only PDF:
An Image-only pdf file contains images in it. Image-only pdf usually contains scanned images and paper documents. Almost all new scanners have in-bulit capability to create single-page or multi-page Image-only pdf file from scanned pages. The disadvantage of Image-only pdf is that text present in images of Image-only pdf file cannot be edited or searched. Image-only pdf files can be converted to "Seachable PDF" and "Text-only PDF" files using Image to OCR Converter. Image-only pdf has no editing support, gives low quality printouts at high zoom levels and cannot be converted to other text-based formats such as MS Word, Doc, HTML, Text etc. without Image to OCR Converter.
|
|
-
Searchable PDF:
Searchable PDF is a Image-only PDF file with the addition of a text layer beneath the image. The hidden text layer beneath the visible image layer retains the look of the original page while enabling text searchability. This approach enables search capabilities in an image-only pdf and is inexpensive to create than retyping whole image-only pdf to text-only PDF files. This balanced approach is especially suitable for documents that have to be searchable but would be too expensive to recompose. The text layer is created by Image to OCR Converter that scans the text on each page. It then creates a searchable pdf file with the recognized text stored in a layer beneath the image of the text. Searchable pdf provides better look-and-feel than image-only pdf. However, it has reduced pdf editing support, gives low quality printouts at high zoom levels and is also difficult to convert to other text-based formats such as MS Word, Doc, HTML, Text etc.
|
|
-
Text-only PDF:
A "Text-only" PDF contains only text and does not contains images. It may contains vector graphics, lines, rectangles etc. but does not contains raster images and pictures. Text-only PDF is the best pdf format for pdf editing, printing, searching and conversion to other text-based formats such as MS Word, Doc, HTML, Text etc. Image to OCR Converter is the first and only OCR software available that provides OCR conversion to text-only pdf file format. However, as text-only pdf does not contain images, so scanned papers and images containing only textual information or text-based images should be converted for better results. All raster pictures, photographs containing non-textual information are ignored by Image to OCR Converter and images containing textual information are processed and converted to text-only pdf file. Text-only pdf has the best pdf editing support, gives high quality printouts at high zoom levels and is also the best pdf format for conversion to other text-based formats such as MS Word, Doc, HTML, Text etc.
|
|
Useful external links