Detect and Recognize Car License Plate from a video in real time

Recognizing a Car License Plate is a very important task for a camera surveillance-based security system. We can extract the license plate from an image using some computer vision techniques and then we can use Optical Character Recognition to recognize the license number. Here I will guide you through the whole procedure of this task.

Requirements:

opencv-python >= 3.4.x
numpy >= 1.17.2
skimage >= 0.16.2
tensorflow >= 2.x.
imutils >= 0.5.3

.
Example:

Input:

Output:

29A33185

Approach:

Find all the contours in the image.
Find the bounding rectangle of every contour.
Compare and validate the sides ratio and area of every bounding rectangle with an average license plate.
Apply image segmentation in the image inside the validated contour to find characters in it.
Recognize characters using an OCR.

Methodology:
1. To reduce the noise we need to blur the input Image with Gaussian Blur and then convert it to grayscale.

2. Find vertical edges in the image.

3. To reveal the plate we have to binarize the image. For this apply Otsu’s Thresholding on the vertical edge image. In other thresholding methods, we have to choose a threshold value to binarize the image but Otsu’s Thresholding determines the value automatically.

4. Apply Closing Morphological Transformation on the thresholded image. Closing is useful to fill small black regions between white regions in a thresholded image. It reveals the rectangular white box of license plates.

5. To detect the plate we need to find contours in the image. It is important to binarize and morph the image before finding contours so that it can find a more relevant and less number of contours in the image. If you draw all the extracted contours on the original image, it would look like this:

6. Now find the minimum area rectangle enclosed by each of the contours and validate their side ratios and area. We have defined the minimum and maximum area of the plate as 4500 and 30000 respectively.

7. Now find the contours in the validated region and validate the side ratios and area of the bounding rectangle of the largest contour in that region. After validating you will get a perfect contour of a license plate. Now extract that contour from the original image. You will get the image of the plate:

This step is performed by clean_plate and ratioCheck method of class PlateFinder.

8. To recognize the characters on the license plate precisely, we have to apply image segmentation. The first step is to extract the value channel from the HSV format of the plate’s image.

9. Now apply adaptive thresholding on the plate’s value channel image to binarize it and reveal the characters. The image of the plate can have different lighting conditions in different areas, in that case, adaptive thresholding can be more suitable to binarize because it uses different threshold values for different regions based on the brightness of the pixels in the region around it.

10. After binarizing apply bitwise not operation on the image to find the connected components in the image so that we can extract character candidates.

11. Construct a mask to display all the character components and then find contours in the mask. After extracting the contours take the largest one, find its bounding rectangle and validate side ratios.

12. After validating the side ratios find the convex hull of the contour and draw it on the character candidate mask.

13. Now find all the contours in the character candidate mask and extract those contour areas from the plate’s value thresholded image, you will get all the characters separately.

Steps 8 to 13 are performed by the segment_chars function that you can find below in the full source code. The driver code for the functions used in steps 6 to 13 is written in the method check_plate of class PlateFinder.
Full Source Code with its working: First, create a PlateFinder class that finds the license plates and validates their size ratio and area.

Python3

import cv2 

import numpy as np 

from skimage.filters import threshold_local 

import tensorflow as tf 

from skimage import measure 

import imutils 

import os 

def sort_cont(character_contours): 

    """ 

    To sort contours 

    """

    i = 0

    boundingBoxes = [cv2.boundingRect(c) for c in character_contours] 

    (character_contours, boundingBoxes) = zip(*sorted(zip(character_contours, 

                                                          boundingBoxes), 

                                                      key = lambda b: b[1][i], 

                                                      reverse = False)) 

    return character_contours 

def segment_chars(plate_img, fixed_width): 

    """ 

    extract Value channel from the HSV format 

    of image and apply adaptive thresholding 

    to reveal the characters on the license plate 

    """

    V = cv2.split(cv2.cvtColor(plate_img, cv2.COLOR_BGR2HSV))[2] 

    thresh = cv2.adaptiveThreshold(V, 255, 

                                   cv2.ADAPTIVE_THRESH_GAUSSIAN_C, 

                                   cv2.THRESH_BINARY, 

                                   11, 2) 

    thresh = cv2.bitwise_not(thresh) 

    # resize the license plate region to 

    # a canoncial size 

    plate_img = imutils.resize(plate_img, width = fixed_width) 

    thresh = imutils.resize(thresh, width = fixed_width) 

    bgr_thresh = cv2.cvtColor(thresh, cv2.COLOR_GRAY2BGR) 

    # perform a connected components analysis 

    # and initialize the mask to store the locations 

    # of the character candidates 

    labels = measure.label(thresh, background = 0) 

    charCandidates = np.zeros(thresh.shape, dtype ='uint8') 

    # loop over the unique components 

    characters = [] 

    for label in np.unique(labels): 

        # if this is the background label, ignore it 

        if label == 0: 

            continue

        # otherwise, construct the label mask to display 

        # only connected components for the current label, 

        # then find contours in the label mask 

        labelMask = np.zeros(thresh.shape, dtype ='uint8') 

        labelMask[labels == label] = 255

        cnts = cv2.findContours(labelMask, 

                     cv2.RETR_EXTERNAL, 

                     cv2.CHAIN_APPROX_SIMPLE) 

        cnts = cnts[1] if imutils.is_cv3() else cnts[0] 

        # ensure at least one contour was found in the mask 

        if len(cnts) > 0: 

            # grab the largest contour which corresponds 

            # to the component in the mask, then grab the 

            # bounding box for the contour 

            c = max(cnts, key = cv2.contourArea) 

            (boxX, boxY, boxW, boxH) = cv2.boundingRect(c) 

            # compute the aspect ratio, solodity, and 

            # height ration for the component 

            aspectRatio = boxW / float(boxH) 

            solidity = cv2.contourArea(c) / float(boxW * boxH) 

            heightRatio = boxH / float(plate_img.shape[0]) 

            # determine if the aspect ratio, solidity, 

            # and height of the contour pass the rules 

            # tests 

            keepAspectRatio = aspectRatio < 1.0

            keepSolidity = solidity > 0.15

            keepHeight = heightRatio > 0.5 and heightRatio < 0.95

            # check to see if the component passes 

            # all the tests 

            if keepAspectRatio and keepSolidity and keepHeight and boxW > 14: 

                # compute the convex hull of the contour 

                # and draw it on the character candidates 

                # mask 

                hull = cv2.convexHull(c) 

                cv2.drawContours(charCandidates, [hull], -1, 255, -1) 

    contours, hier = cv2.findContours(charCandidates, 

                                         cv2.RETR_EXTERNAL, 

                                         cv2.CHAIN_APPROX_SIMPLE) 

    if contours: 

        contours = sort_cont(contours) 

        # value to be added to each dimension 

        # of the character 

        addPixel = 4 

        for c in contours: 

            (x, y, w, h) = cv2.boundingRect(c) 

            if y > addPixel: 

                y = y - addPixel 

            else: 

                y = 0

            if x > addPixel: 

                x = x - addPixel 

            else: 

                x = 0

            temp = bgr_thresh[y:y + h + (addPixel * 2), 

                              x:x + w + (addPixel * 2)] 

            characters.append(temp) 

        return characters 

    else: 

        return None

class PlateFinder: 

    def __init__(self, minPlateArea, maxPlateArea): 

        # minimum area of the plate 

        self.min_area = minPlateArea 

        # maximum area of the plate 

        self.max_area = maxPlateArea  

        self.element_structure = cv2.getStructuringElement( 

                              shape = cv2.MORPH_RECT, ksize =(22, 3)) 

    def preprocess(self, input_img): 

        imgBlurred = cv2.GaussianBlur(input_img, (7, 7), 0) 

        # convert to gray 

        gray = cv2.cvtColor(imgBlurred, cv2.COLOR_BGR2GRAY) 

        # sobelX to get the vertical edges 

        sobelx = cv2.Sobel(gray, cv2.CV_8U, 1, 0, ksize = 3)  

        # otsu's thresholding 

        ret2, threshold_img = cv2.threshold(sobelx, 0, 255, 

                         cv2.THRESH_BINARY + cv2.THRESH_OTSU) 

        element = self.element_structure 

        morph_n_thresholded_img = threshold_img.copy() 

        cv2.morphologyEx(src = threshold_img, 

                         op = cv2.MORPH_CLOSE, 

                         kernel = element, 

                         dst = morph_n_thresholded_img) 

        return morph_n_thresholded_img 

    def extract_contours(self, after_preprocess): 

        contours, _ = cv2.findContours(after_preprocess, 

                                          mode = cv2.RETR_EXTERNAL, 

                                          method = cv2.CHAIN_APPROX_NONE) 

        return contours 

    def clean_plate(self, plate): 

        gray = cv2.cvtColor(plate, cv2.COLOR_BGR2GRAY) 

        thresh = cv2.adaptiveThreshold(gray, 

                                       255, 

                                       cv2.ADAPTIVE_THRESH_GAUSSIAN_C, 

                                       cv2.THRESH_BINARY, 

                                       11, 2) 

        contours, _ = cv2.findContours(thresh.copy(), 

                                          cv2.RETR_EXTERNAL, 

                                          cv2.CHAIN_APPROX_NONE) 

        if contours: 

            areas = [cv2.contourArea(c) for c in contours] 

            # index of the largest contour in the area 

            # array 

            max_index = np.argmax(areas)  

            max_cnt = contours[max_index] 

            max_cntArea = areas[max_index] 

            x, y, w, h = cv2.boundingRect(max_cnt) 

            rect = cv2.minAreaRect(max_cnt) 

            if not self.ratioCheck(max_cntArea, plate.shape[1], 

                                                plate.shape[0]): 

                return plate, False, None

            return plate, True, [x, y, w, h] 

        else: 

            return plate, False, None

    def check_plate(self, input_img, contour): 

        min_rect = cv2.minAreaRect(contour) 

        if self.validateRatio(min_rect): 

            x, y, w, h = cv2.boundingRect(contour) 

            after_validation_img = input_img[y:y + h, x:x + w] 

            after_clean_plate_img, plateFound, coordinates = self.clean_plate( 

                                                        after_validation_img) 

            if plateFound: 

                characters_on_plate = self.find_characters_on_plate( 

                                              after_clean_plate_img) 

                if (characters_on_plate is not None and len(characters_on_plate) == 8): 

                    x1, y1, w1, h1 = coordinates 

                    coordinates = x1 + x, y1 + y 

                    after_check_plate_img = after_clean_plate_img 

                    return after_check_plate_img, characters_on_plate, coordinates 

        return None, None, None

    def find_possible_plates(self, input_img): 

        """ 

        Finding all possible contours that can be plates 

        """

        plates = [] 

        self.char_on_plate = [] 

        self.corresponding_area = [] 

        self.after_preprocess = self.preprocess(input_img) 

        possible_plate_contours = self.extract_contours(self.after_preprocess) 

        for cnts in possible_plate_contours: 

            plate, characters_on_plate, coordinates = self.check_plate(input_img, cnts) 

            if plate is not None: 

                plates.append(plate) 

                self.char_on_plate.append(characters_on_plate) 

                self.corresponding_area.append(coordinates) 

        if (len(plates) > 0): 

            return plates 

        else: 

            return None

    def find_characters_on_plate(self, plate): 

        charactersFound = segment_chars(plate, 400) 

        if charactersFound: 

            return charactersFound 

    # PLATE FEATURES 

    def ratioCheck(self, area, width, height): 

        min = self.min_area 

        max = self.max_area 

        ratioMin = 3

        ratioMax = 6

        ratio = float(width) / float(height) 

        if ratio < 1: 

            ratio = 1 / ratio 

        if (area < min or area > max) or (ratio < ratioMin or ratio > ratioMax): 

            return False

        return True

    def preRatioCheck(self, area, width, height): 

        min = self.min_area 

        max = self.max_area 

        ratioMin = 2.5

        ratioMax = 7

        ratio = float(width) / float(height) 

        if ratio < 1: 

            ratio = 1 / ratio 

        if (area < min or area > max) or (ratio < ratioMin or ratio > ratioMax): 

            return False

        return True

    def validateRatio(self, rect): 

        (x, y), (width, height), rect_angle = rect 

        if (width > height): 

            angle = -rect_angle 

        else: 

            angle = 90 + rect_angle 

        if angle > 15: 

            return False

        if (height == 0 or width == 0): 

            return False

        area = width * height 

        if not self.preRatioCheck(area, width, height): 

            return False

        else: 

            return True

Here is the explanation of each and every method of PlateFinder class.
In the preprocessing method, the following step has been done:

Blur the Image
Convert to Grayscale
Find vertical edges
Threshold of the vertical-edged image.
Close Morph the Threshold image.

Method extract_contours returns all external contours from the preprocessed image.

Method find_possible_plates preprocess the image with preprocess method then extracts contours by extract_contours method then it checks side ratios and area of all extracted contours and cleans the image inside the contour with check_plate and clean_plate methods. After cleaning the contour image with the clean_plate method, it finds all characters on the plate with the find_characters_on_plate method.

find_characters_on_plate method uses the segment_chars function to find the characters. It finds characters by computing the convex hull of the contours of a thresholded value image and drawing it on the characters to reveal them.

Now use OCR to recognize the character one by one on the extracted license plate.

Python3

class OCR: 

    def __init__(self, modelFile, labelFile): 

        self.model_file = modelFile 

        self.label_file = labelFile 

        self.label = self.load_label(self.label_file) 

        self.graph = self.load_graph(self.model_file) 

        self.sess = tf.compat.v1.Session(graph=self.graph,  

                                         config=tf.compat.v1.ConfigProto()) 

    def load_graph(self, modelFile): 

        graph = tf.Graph() 

        graph_def = tf.compat.v1.GraphDef() 

        with open(modelFile, "rb") as f: 

            graph_def.ParseFromString(f.read()) 

        with graph.as_default(): 

            tf.import_graph_def(graph_def) 

        return graph 

    def load_label(self, labelFile): 

        label = [] 

        proto_as_ascii_lines = tf.io.gfile.GFile(labelFile).readlines() 

        for l in proto_as_ascii_lines: 

            label.append(l.rstrip()) 

        return label 

    def convert_tensor(self, image, imageSizeOuput): 

        """ 

        takes an image and transform it in tensor 

        """

        image = cv2.resize(image, 

                           dsize =(imageSizeOuput, 

                                  imageSizeOuput), 

                           interpolation = cv2.INTER_CUBIC) 

        np_image_data = np.asarray(image) 

        np_image_data = cv2.normalize(np_image_data.astype('float'), 

                                      None, -0.5, .5, 

                                      cv2.NORM_MINMAX) 

        np_final = np.expand_dims(np_image_data, axis = 0) 

        return np_final 

    def label_image(self, tensor): 

        input_name = "import/input"

        output_name = "import/final_result"

        input_operation = self.graph.get_operation_by_name(input_name) 

        output_operation = self.graph.get_operation_by_name(output_name) 

        results = self.sess.run(output_operation.outputs[0], 

                                {input_operation.outputs[0]: tensor}) 

        results = np.squeeze(results) 

        labels = self.label 

        top = results.argsort()[-1:][::-1] 

        return labels[top[0]] 

    def label_image_list(self, listImages, imageSizeOuput): 

        plate = "" 

        for img in listImages: 

            if cv2.waitKey(25) & 0xFF == ord('q'): 

                break

            plate = plate + self.label_image(self.convert_tensor(img, imageSizeOuput)) 

        return plate, len(plate)

It loads the pre-trained OCR model and its label file in load_graph and load_label functions. label_image_list method transforms the image to a tensor with the convert_tensor method and then predicts the label of the tensor with the label_image_list function and returns the license number.
Code: Create a main function to perform the whole task in a sequence.

Python3

if __name__ == "__main__": 

    findPlate = PlateFinder(minPlateArea=4100,  

                                maxPlateArea=15000) 

    model = OCR(modelFile="model/binary_128_0.50_ver3.pb", 

                            labelFile="model/binary_128_0.50_labels_ver2.txt") 

    cap = cv2.VideoCapture('test.MOV') 

    while (cap.isOpened()): 

        ret, img = cap.read() 

        if ret == True: 

            cv2.imshow('original video', img) 

            if cv2.waitKey(25) & 0xFF == ord('q'): 

                break

            possible_plates = findPlate.find_possible_plates(img) 

            if possible_plates is not None: 

                for i, p in enumerate(possible_plates): 

                    chars_on_plate = findPlate.char_on_plate[i] 

                    recognized_plate, _ = model.label_image_list( 

                               chars_on_plate, imageSizeOuput = 128) 

                    print(recognized_plate) 

                    cv2.imshow('plate', p) 

                    if cv2.waitKey(25) & 0xFF == ord('q'): 

                        break

        else: 

            break

    cap.release() 

    cv2.destroyAllWindows()

Now, run this main file to see the output.

This is how the output will look like:

How to improve the model?

You can set a particular small region in the frame to find the plates inside it. (make sure all vehicles must pass through that region).
You can train your own machine learning model to recognize characters because the given model doesn’t recognize all the alphabets.

References:
Image preprocessing techniques in OpenCV documentation.

Article Tags :

Computer Subject

Project

Python

Image-Processing

OpenCV

Python-OpenCV