Recognizing a Car License Plate is a very important task for a camera surveillance-based security system. We can extract the license plate from an image using some computer vision techniques and then we can use Optical Character Recognition to recognize the license number. Here I will guide you through the whole procedure of this task.
Requirements:
opencv-python >= 3.4.x
numpy >= 1.17.2
skimage >= 0.16.2
tensorflow >= 2.x.
imutils >= 0.5.3
.
Example:
Input:
Output:
29A33185
- Find all the contours in the image.
- Find the bounding rectangle of every contour.
- Compare and validate the sides ratio and area of every bounding rectangle with an average license plate.
- Apply image segmentation in the image inside the validated contour to find characters in it.
- Recognize characters using an OCR.
Methodology:
1. To reduce the noise we need to blur the input Image with Gaussian Blur and then convert it to grayscale.
2. Find vertical edges in the image.
3. To reveal the plate we have to binarize the image. For this apply Otsu’s Thresholding on the vertical edge image. In other thresholding methods, we have to choose a threshold value to binarize the image but Otsu’s Thresholding determines the value automatically.
4. Apply Closing Morphological Transformation on the thresholded image. Closing is useful to fill small black regions between white regions in a thresholded image. It reveals the rectangular white box of license plates.
5. To detect the plate we need to find contours in the image. It is important to binarize and morph the image before finding contours so that it can find a more relevant and less number of contours in the image. If you draw all the extracted contours on the original image, it would look like this:
6. Now find the minimum area rectangle enclosed by each of the contours and validate their side ratios and area. We have defined the minimum and maximum area of the plate as 4500 and 30000 respectively.
7. Now find the contours in the validated region and validate the side ratios and area of the bounding rectangle of the largest contour in that region. After validating you will get a perfect contour of a license plate. Now extract that contour from the original image. You will get the image of the plate:
8. To recognize the characters on the license plate precisely, we have to apply image segmentation. The first step is to extract the value channel from the HSV format of the plate’s image.
9. Now apply adaptive thresholding on the plate’s value channel image to binarize it and reveal the characters. The image of the plate can have different lighting conditions in different areas, in that case, adaptive thresholding can be more suitable to binarize because it uses different threshold values for different regions based on the brightness of the pixels in the region around it.
10. After binarizing apply bitwise not operation on the image to find the connected components in the image so that we can extract character candidates.
11. Construct a mask to display all the character components and then find contours in the mask. After extracting the contours take the largest one, find its bounding rectangle and validate side ratios.
12. After validating the side ratios find the convex hull of the contour and draw it on the character candidate mask.
13. Now find all the contours in the character candidate mask and extract those contour areas from the plate’s value thresholded image, you will get all the characters separately.
Steps 8 to 13 are performed by the segment_chars function that you can find below in the full source code. The driver code for the functions used in steps 6 to 13 is written in the method check_plate of class PlateFinder.
Full Source Code with its working: First, create a PlateFinder class that finds the license plates and validates their size ratio and area.
import cv2
import numpy as np
from skimage.filters import threshold_local
import tensorflow as tf
from skimage import measure
import imutils
import os
def sort_cont(character_contours):
"""
To sort contours
"""
i = 0
boundingBoxes = [cv2.boundingRect(c) for c in character_contours]
(character_contours, boundingBoxes) = zip ( * sorted ( zip (character_contours,
boundingBoxes),
key = lambda b: b[ 1 ][i],
reverse = False ))
return character_contours
def segment_chars(plate_img, fixed_width):
"""
extract Value channel from the HSV format
of image and apply adaptive thresholding
to reveal the characters on the license plate
"""
V = cv2.split(cv2.cvtColor(plate_img, cv2.COLOR_BGR2HSV))[ 2 ]
thresh = cv2.adaptiveThreshold(V, 255 ,
cv2.ADAPTIVE_THRESH_GAUSSIAN_C,
cv2.THRESH_BINARY,
11 , 2 )
thresh = cv2.bitwise_not(thresh)
# resize the license plate region to
# a canoncial size
plate_img = imutils.resize(plate_img, width = fixed_width)
thresh = imutils.resize(thresh, width = fixed_width)
bgr_thresh = cv2.cvtColor(thresh, cv2.COLOR_GRAY2BGR)
# perform a connected components analysis
# and initialize the mask to store the locations
# of the character candidates
labels = measure.label(thresh, background = 0 )
charCandidates = np.zeros(thresh.shape, dtype = 'uint8' )
# loop over the unique components
characters = []
for label in np.unique(labels):
# if this is the background label, ignore it
if label = = 0 :
continue
# otherwise, construct the label mask to display
# only connected components for the current label,
# then find contours in the label mask
labelMask = np.zeros(thresh.shape, dtype = 'uint8' )
labelMask[labels = = label] = 255
cnts = cv2.findContours(labelMask,
cv2.RETR_EXTERNAL,
cv2.CHAIN_APPROX_SIMPLE)
cnts = cnts[ 1 ] if imutils.is_cv3() else cnts[ 0 ]
# ensure at least one contour was found in the mask
if len (cnts) > 0 :
# grab the largest contour which corresponds
# to the component in the mask, then grab the
# bounding box for the contour
c = max (cnts, key = cv2.contourArea)
(boxX, boxY, boxW, boxH) = cv2.boundingRect(c)
# compute the aspect ratio, solodity, and
# height ration for the component
aspectRatio = boxW / float (boxH)
solidity = cv2.contourArea(c) / float (boxW * boxH)
heightRatio = boxH / float (plate_img.shape[ 0 ])
# determine if the aspect ratio, solidity,
# and height of the contour pass the rules
# tests
keepAspectRatio = aspectRatio < 1.0
keepSolidity = solidity > 0.15
keepHeight = heightRatio > 0.5 and heightRatio < 0.95
# check to see if the component passes
# all the tests
if keepAspectRatio and keepSolidity and keepHeight and boxW > 14 :
# compute the convex hull of the contour
# and draw it on the character candidates
# mask
hull = cv2.convexHull(c)
cv2.drawContours(charCandidates, [hull], - 1 , 255 , - 1 )
contours, hier = cv2.findContours(charCandidates,
cv2.RETR_EXTERNAL,
cv2.CHAIN_APPROX_SIMPLE)
if contours:
contours = sort_cont(contours)
# value to be added to each dimension
# of the character
addPixel = 4 for c in contours:
(x, y, w, h) = cv2.boundingRect(c)
if y > addPixel:
y = y - addPixel
else :
y = 0
if x > addPixel:
x = x - addPixel
else :
x = 0
temp = bgr_thresh[y:y + h + (addPixel * 2 ),
x:x + w + (addPixel * 2 )]
characters.append(temp)
return characters
else :
return None
class PlateFinder:
def __init__( self , minPlateArea, maxPlateArea):
# minimum area of the plate
self .min_area = minPlateArea
# maximum area of the plate
self .max_area = maxPlateArea
self .element_structure = cv2.getStructuringElement(
shape = cv2.MORPH_RECT, ksize = ( 22 , 3 ))
def preprocess( self , input_img):
imgBlurred = cv2.GaussianBlur(input_img, ( 7 , 7 ), 0 )
# convert to gray
gray = cv2.cvtColor(imgBlurred, cv2.COLOR_BGR2GRAY)
# sobelX to get the vertical edges
sobelx = cv2.Sobel(gray, cv2.CV_8U, 1 , 0 , ksize = 3 )
# otsu's thresholding
ret2, threshold_img = cv2.threshold(sobelx, 0 , 255 ,
cv2.THRESH_BINARY + cv2.THRESH_OTSU)
element = self .element_structure
morph_n_thresholded_img = threshold_img.copy()
cv2.morphologyEx(src = threshold_img,
op = cv2.MORPH_CLOSE,
kernel = element,
dst = morph_n_thresholded_img)
return morph_n_thresholded_img
def extract_contours( self , after_preprocess):
contours, _ = cv2.findContours(after_preprocess,
mode = cv2.RETR_EXTERNAL,
method = cv2.CHAIN_APPROX_NONE)
return contours
def clean_plate( self , plate):
gray = cv2.cvtColor(plate, cv2.COLOR_BGR2GRAY)
thresh = cv2.adaptiveThreshold(gray,
255 ,
cv2.ADAPTIVE_THRESH_GAUSSIAN_C,
cv2.THRESH_BINARY,
11 , 2 )
contours, _ = cv2.findContours(thresh.copy(),
cv2.RETR_EXTERNAL,
cv2.CHAIN_APPROX_NONE)
if contours:
areas = [cv2.contourArea(c) for c in contours]
# index of the largest contour in the area
# array
max_index = np.argmax(areas)
max_cnt = contours[max_index]
max_cntArea = areas[max_index]
x, y, w, h = cv2.boundingRect(max_cnt)
rect = cv2.minAreaRect(max_cnt)
if not self .ratioCheck(max_cntArea, plate.shape[ 1 ],
plate.shape[ 0 ]):
return plate, False , None
return plate, True , [x, y, w, h]
else :
return plate, False , None
def check_plate( self , input_img, contour):
min_rect = cv2.minAreaRect(contour)
if self .validateRatio(min_rect):
x, y, w, h = cv2.boundingRect(contour)
after_validation_img = input_img[y:y + h, x:x + w]
after_clean_plate_img, plateFound, coordinates = self .clean_plate(
after_validation_img)
if plateFound:
characters_on_plate = self .find_characters_on_plate(
after_clean_plate_img)
if (characters_on_plate is not None and len (characters_on_plate) = = 8 ):
x1, y1, w1, h1 = coordinates
coordinates = x1 + x, y1 + y
after_check_plate_img = after_clean_plate_img
return after_check_plate_img, characters_on_plate, coordinates
return None , None , None
def find_possible_plates( self , input_img):
"""
Finding all possible contours that can be plates
"""
plates = []
self .char_on_plate = []
self .corresponding_area = []
self .after_preprocess = self .preprocess(input_img)
possible_plate_contours = self .extract_contours( self .after_preprocess)
for cnts in possible_plate_contours:
plate, characters_on_plate, coordinates = self .check_plate(input_img, cnts)
if plate is not None :
plates.append(plate)
self .char_on_plate.append(characters_on_plate)
self .corresponding_area.append(coordinates)
if ( len (plates) > 0 ):
return plates
else :
return None
def find_characters_on_plate( self , plate):
charactersFound = segment_chars(plate, 400 )
if charactersFound:
return charactersFound
# PLATE FEATURES
def ratioCheck( self , area, width, height):
min = self .min_area
max = self .max_area
ratioMin = 3
ratioMax = 6
ratio = float (width) / float (height)
if ratio < 1 :
ratio = 1 / ratio
if (area < min or area > max ) or (ratio < ratioMin or ratio > ratioMax):
return False
return True
def preRatioCheck( self , area, width, height):
min = self .min_area
max = self .max_area
ratioMin = 2.5
ratioMax = 7
ratio = float (width) / float (height)
if ratio < 1 :
ratio = 1 / ratio
if (area < min or area > max ) or (ratio < ratioMin or ratio > ratioMax):
return False
return True
def validateRatio( self , rect):
(x, y), (width, height), rect_angle = rect
if (width > height):
angle = - rect_angle
else :
angle = 90 + rect_angle
if angle > 15 :
return False
if (height = = 0 or width = = 0 ):
return False
area = width * height
if not self .preRatioCheck(area, width, height):
return False
else :
return True
|
Here is the explanation of each and every method of PlateFinder class.
In the preprocessing method, the following step has been done:
- Blur the Image
- Convert to Grayscale
- Find vertical edges
- Threshold of the vertical-edged image.
- Close Morph the Threshold image.
Method extract_contours returns all external contours from the preprocessed image.
Method find_possible_plates preprocess the image with preprocess method then extracts contours by extract_contours method then it checks side ratios and area of all extracted contours and cleans the image inside the contour with check_plate and clean_plate methods. After cleaning the contour image with the clean_plate method, it finds all characters on the plate with the find_characters_on_plate method.
find_characters_on_plate method uses the segment_chars function to find the characters. It finds characters by computing the convex hull of the contours of a thresholded value image and drawing it on the characters to reveal them.
Now use OCR to recognize the character one by one on the extracted license plate.
class OCR:
def __init__( self , modelFile, labelFile):
self .model_file = modelFile
self .label_file = labelFile
self .label = self .load_label( self .label_file)
self .graph = self .load_graph( self .model_file)
self .sess = tf.compat.v1.Session(graph = self .graph,
config = tf.compat.v1.ConfigProto())
def load_graph( self , modelFile):
graph = tf.Graph()
graph_def = tf.compat.v1.GraphDef()
with open (modelFile, "rb" ) as f:
graph_def.ParseFromString(f.read())
with graph.as_default():
tf.import_graph_def(graph_def)
return graph
def load_label( self , labelFile):
label = []
proto_as_ascii_lines = tf.io.gfile.GFile(labelFile).readlines()
for l in proto_as_ascii_lines:
label.append(l.rstrip())
return label
def convert_tensor( self , image, imageSizeOuput):
"""
takes an image and transform it in tensor
"""
image = cv2.resize(image,
dsize = (imageSizeOuput,
imageSizeOuput),
interpolation = cv2.INTER_CUBIC)
np_image_data = np.asarray(image)
np_image_data = cv2.normalize(np_image_data.astype( 'float' ),
None , - 0.5 , . 5 ,
cv2.NORM_MINMAX)
np_final = np.expand_dims(np_image_data, axis = 0 )
return np_final
def label_image( self , tensor):
input_name = "import/input"
output_name = "import/final_result"
input_operation = self .graph.get_operation_by_name(input_name)
output_operation = self .graph.get_operation_by_name(output_name)
results = self .sess.run(output_operation.outputs[ 0 ],
{input_operation.outputs[ 0 ]: tensor})
results = np.squeeze(results)
labels = self .label
top = results.argsort()[ - 1 :][:: - 1 ]
return labels[top[ 0 ]]
def label_image_list( self , listImages, imageSizeOuput):
plate = ""
for img in listImages:
if cv2.waitKey( 25 ) & 0xFF = = ord ( 'q' ):
break
plate = plate + self .label_image( self .convert_tensor(img, imageSizeOuput))
return plate, len (plate)
|
It loads the pre-trained OCR model and its label file in load_graph and load_label functions. label_image_list method transforms the image to a tensor with the convert_tensor method and then predicts the label of the tensor with the label_image_list function and returns the license number.
Code: Create a main function to perform the whole task in a sequence.
if __name__ = = "__main__" :
findPlate = PlateFinder(minPlateArea = 4100 ,
maxPlateArea = 15000 )
model = OCR(modelFile = "model/binary_128_0.50_ver3.pb" ,
labelFile = "model/binary_128_0.50_labels_ver2.txt" )
cap = cv2.VideoCapture( 'test.MOV' )
while (cap.isOpened()):
ret, img = cap.read()
if ret = = True :
cv2.imshow( 'original video' , img)
if cv2.waitKey( 25 ) & 0xFF = = ord ( 'q' ):
break
possible_plates = findPlate.find_possible_plates(img)
if possible_plates is not None :
for i, p in enumerate (possible_plates):
chars_on_plate = findPlate.char_on_plate[i]
recognized_plate, _ = model.label_image_list(
chars_on_plate, imageSizeOuput = 128 )
print (recognized_plate)
cv2.imshow( 'plate' , p)
if cv2.waitKey( 25 ) & 0xFF = = ord ( 'q' ):
break
else :
break
cap.release()
cv2.destroyAllWindows()
|
Now, run this main file to see the output.
This is how the output will look like:
How to improve the model?
- You can set a particular small region in the frame to find the plates inside it. (make sure all vehicles must pass through that region).
- You can train your own machine learning model to recognize characters because the given model doesn’t recognize all the alphabets.
References:
Image preprocessing techniques in OpenCV documentation.