Skip to content
Related Articles

Related Articles

Improve Article
Save Article
Like Article

Python program to extract Email-id from URL text file

  • Last Updated : 29 Dec, 2020

Prerequisite : Pattern Matching with Python Regex
Given the URL text-file, the task is to extract all the email-ids from that text file and print the urllib.request library can be used to handle all the URL related work.

Example :

Input : 
Hello
This is Geeksforgeeks
review-team@geeksforgeeks.org
contribute@geeksforgeeks.org
GfG is a portal for geeks
feedback@geeksforgeeks.org
careers@geeksforgeeks.org

Output :
[]
[]
['review-team@geeksforgeeks.org']
['contribute@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']

 
URL text file can be handled using urllib.request. For extracting the emails using regular expressions, re library can be used. For more details of Regular Expression, refer this.




# library that handles the URL stuff
import urllib.request
  
# Importing module required for
# regular expressions
import re
  
# Assign urlopen to a file object variable
fhand = urllib.request.urlopen
  
for line in fhand:
    # Getting the text file
    # content line by line.
    s = line.decode().strip()
  
    # regex for extracting all email-ids
    # from the text file
    reg = re.findall(r"[A-Za-z0-9._%+-]+"
                     r"@[A-Za-z0-9.-]+"
                     r"\.[A-Za-z]{2,4}", s)
  
    # printing the list output
    print(reg)

Output :

[]
[]
['review-team@geeksforgeeks.org']
['contribute@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']
My Personal Notes arrow_drop_up
Recommended Articles
Page :

Start Your Coding Journey Now!