Prerequisite: Read a file line-by-line in Python
Given a text file fname, a number N, the task is to read the last N lines of the file.
As we know, Python provides multiple in-built features and modules for handling files. Let’s discuss different ways to read last N lines of a file using Python.
File:
Method 1: Naive approach
In this approach, the idea is to use a negative iterator with the readlines() function to read all the lines requested by the user from the end of file.
# Python implementation to # read last N lines of a file # Function to read # last N lines of the file def LastNlines(fname, N):
# opening file using with() method
# so that file get closed
# after completing work
with open (fname) as file :
# loop to read iterate
# last n lines and print it
for line in ( file .readlines() [ - N:]):
print (line, end = '')
# Driver Code: if __name__ = = '__main__' :
fname = 'File1.txt'
N = 3
try :
LastNlines(fname, N)
except :
print ( 'File not found'
|
Output:
Eighth line Ninth line Tenth line
Method 2: Using OS module and buffering policy
In this approach, the idea is to work on the buffering policy in the python. A buffer stores a part of data received from a file stream of the operating system for a time period it is used and then more data comes in.
The buffer size determines the size of the data that can be stored at a time until it is used. We have the option to pass an integer to buffering in order to set buffering policy and if we do not specify any policy then the size of the buffer depends upon the device’s block size. Usually, the buffer is 4096 or 8192 bytes long. In this approach size of the buffer is 8192 bytes.
Moreover, the st_size attribute of os.stat() method in the OS module is used to represent the size of the file in bytes.
Below is the implementation of the above approach.
# Python implementation to # read last N lines of a file # Using OS module and buffering policy # importing os module import os
# Function to read # last N lines of the file def LastNlines(fname, N):
# taking buffer size of 8192 bytes
bufsize = 8192
# calculating size of
# file in bytes
fsize = os.stat(fname).st_size
iter = 0
# opening file using with() method
# so that file get closed
# after completing work
with open (fname) as f:
if bufsize > fsize:
# adjusting buffer size
# according to size
# of file
bufsize = fsize - 1
# list to store
# last N lines
fetched_lines = []
# while loop to
# fetch last N lines
while True :
iter + = 1
# moving cursor to
# the last Nth line
# of file
f.seek(fsize - bufsize * iter )
# storing each line
# in list upto
# end of file
fetched_lines.extend(f.readlines())
# halting the program
# when size of list
# is equal or greater to
# the number of lines requested or
# when we reach end of file
if len (fetched_lines) > = N or f.tell() = = 0 :
print (''.join(fetched_lines[ - N:]))
break
# Driver Code: if __name__ = = '__main__' :
fname = 'File1.txt'
N = 3
try :
LastNlines(fname, N)
except :
print ( 'File not found' )
|
Output:
Eighth line Ninth line Tenth line
Method 3: Through Exponential search
In this method, the idea is to use Exponential Search algorithm which is generally used for searching sorted, unbounded or infinite lists. To get information about exponential search click here.
This approach uses assert statement which acts as a debugging tool to checks a condition. The program will continue to execute if the given statement is true otherwise, it generates an AssertionError exception. To get more details of assert statements click here.
Click here to get familiar with different kinds of use of seek() method.
Below is the implementation of the above approach.
# Python implementation to # read last N lines of a file # through Exponential search # Function to read # last N lines of the file def LastNlines(fname, N):
# assert statement check
# a condition
assert N > = 0
# declaring variable
# to implement
# exponential search
pos = N + 1
# list to store
# last N lines
lines = []
# opening file using with() method
# so that file get closed
# after completing work
with open (fname) as f:
# loop which runs
# until size of list
# becomes equal to N
while len (lines) < = N:
# try block
try :
# moving cursor from
# left side to
# pos line from end
f.seek( - pos, 2 )
# exception block
# to handle any run
# time error
except IOError:
f.seek( 0 )
break
# finally block
# to add lines
# to list after
# each iteration
finally :
lines = list (f)
# increasing value
# of variable
# exponentially
pos * = 2
# returning the
# whole list
# which stores last
# N lines
return lines[ - N:]
# Driver Code: if __name__ = = '__main__' :
fname = 'File1.txt'
N = 3
try :
lines = LastNlines(fname, N)
for line in lines:
print (line, end = '')
except :
print ( 'File not found' )
|
Output:
Eighth line Ninth line Tenth line