Python program to extract Email-id from URL text file

Last Updated : 29 Dec, 2020

Prerequisite : Pattern Matching with Python Regex
Given the URL text-file, the task is to extract all the email-ids from that text file and print the urllib.request library can be used to handle all the URL related work.

Example :

Input : 
Hello
This is Geeksforgeeks
review-team@geeksforgeeks.org
review-team@geeksforgeeks.org
GfG is a portal for geeks
feedback@geeksforgeeks.org
careers@geeksforgeeks.org

Output :
[]
[]
['review-team@geeksforgeeks.org']
['review-team@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']

URL text file can be handled using urllib.request. For extracting the emails using regular expressions, re library can be used. For more details of Regular Expression, refer this.

# library that handles the URL stuff 
import urllib.request 
  
# Importing module required for 
# regular expressions 
import re 
  
# Assign urlopen to a file object variable 
fhand = urllib.request.urlopen 
    ('https://media.geeksforgeeks.org/wp-content/uploads/e-mail-1.txt') 
  
for line in fhand: 
    # Getting the text file 
    # content line by line. 
    s = line.decode().strip() 
  
    # regex for extracting all email-ids 
    # from the text file 
    reg = re.findall(r"[A-Za-z0-9._%+-]+"
                     r"@[A-Za-z0-9.-]+"
                     r"\.[A-Za-z]{2,4}", s) 
  
    # printing the list output 
    print(reg) 

Output :

[]
[]
['review-team@geeksforgeeks.org']
['review-team@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']

Suggest improvement

Extract text from PDF File using Python

Share your thoughts in the comments

Python program to extract Email-id from URL text file

Please Login to comment...

Similar Reads

What kind of Experience do you want to share?