How to Extract Text from Images with Python?

Last Updated : 26 Dec, 2020

OCR (Optical Character Recognition) is the process of electronical conversion of Digital images into machine-encoded text. Where the digital image is generally an image that contains regions that resemble characters of a language. OCR is a field of research in pattern recognition, artificial intelligence and computer vision. This is due to the fact that newer OCR’s are trained by providing them sample data which is ran over a machine learning algorithm. This technique of extracting text from images is generally carried out in work environments where it is certain that the image would be containing text data. In this article, we would learn about extracting text from images. We would be utilizing python programming language for doing so.

For enabling our python program to have Character recognition capabilities, we would be making use of pytesseract OCR library. The library could be installed onto our python environment by executing the following command in the command interpreter of the OS:-

pip install pytesseract

The library (if used on Windows OS) requires the tesseract.exe binary to be also present for proper installation of the library. During the installation of the aforementioned executable, we would be prompted to specify a path for it. This path needs to be remembered as it would be utilized later on in the code. For most installations the path would be C:\\Program Files (x86)\\Tesseract-OCR\\tesseract.exe.

Explanation:

Firstly we imported the Image module from PIL library (for opening an image) and then pytesseract module from pytesseract library(for text extraction). Then after we defined the path_to_tesseract variable which contains the path to the executable binary (tesseract.exe) that we installed in the prerequisite (this path would depend on the location where the binary is installed). Then we defined the image_path variable which contains the path to the image file. This path is passed to the open() function to create an image object out of our image. After this, we assigned the pytesseract.tesseract_cmd variable the path stored in path_to_tesseract variable (this would be used by the library to find the executable and use it for extraction). After which we passed the image object (img) to image_to_string() function. This function takes in argument an image object and returns the text recognized inside it. In the end, we displayed the text which was found in the image using text[:-1] (due to a additional character (^L) that gets appended by default).

Example 1:

Image for demonstration:

An image of white text with black background

Below is the full implementation:

Python3

from PIL import Image 
from pytesseract import pytesseract 
  
# Defining paths to tesseract.exe 
# and the image we would be using 
path_to_tesseract = r"C:\Program Files\Tesseract-OCR\tesseract.exe"
image_path = r"csv\sample_text.png"
  
# Opening the image & storing it in an image object 
img = Image.open(image_path) 
  
# Providing the tesseract executable 
# location to pytesseract library 
pytesseract.tesseract_cmd = path_to_tesseract 
  
# Passing the image object to image_to_string() function 
# This function will extract the text from the image 
text = pytesseract.image_to_string(img) 
  
# Displaying the extracted text 
print(text[:-1])

Output:

now children state should after above same long made such

point run take call together few being would walk give

Example 2:

Image for demonstration:

Code:

Python3

from PIL import Image 
from pytesseract import pytesseract 
  
# Defining paths to tesseract.exe  
# and the image we would be using 
path_to_tesseract = r"C:\Program Files\Tesseract-OCR\tesseract.exe"
image_path = r"csv\d.jpg"
  
# Opening the image & storing it in an image object 
img = Image.open(image_path) 
  
# Providing the tesseract  
# executable location to pytesseract library 
pytesseract.tesseract_cmd = path_to_tesseract 
  
# Passing the image object to  
# image_to_string() function 
# This function will 
# extract the text from the image 
text = pytesseract.image_to_string(img) 
  
# Displaying the extracted text 
print(text[:-1])

Output:

Geeksforgeeks

Suggest improvement

Convert the .GIF to .BMP and it's vice-versa in Python

Python Pillow - Writing Text on Image

Share your thoughts in the comments

Introduction to Pillow

Installation and setup

Loading and Saving Images

Image Manipulation Basics

Adjusting Image Properties

Image Filtering and Effects

Drawing on Images

Image Transformations

Working with Image Metadata

Image Analysis and Processing

Image Compositing and Blending

Image Formats and Conversion

Batch Processing and Automation

Image Enhancement Techniques

Working with Animated Images

OCR and Text Extraction from Images

Pillow function

Introduction to Pillow

Installation and setup

Loading and Saving Images

Image Manipulation Basics

Adjusting Image Properties

Image Filtering and Effects

Drawing on Images

Image Transformations

Working with Image Metadata

Image Analysis and Processing

Image Compositing and Blending

Image Formats and Conversion

Batch Processing and Automation

Image Enhancement Techniques

Working with Animated Images

OCR and Text Extraction from Images

Pillow function

How to Extract Text from Images with Python?

Python3

Python3

Please Login to comment...

Similar Reads

What kind of Experience do you want to share?