Skip to content
Related Articles

Related Articles

Python program to extract Email-id from URL text file
  • Last Updated : 29 Dec, 2020

Prerequisite : Pattern Matching with Python Regex
Given the URL text-file, the task is to extract all the email-ids from that text file and print the urllib.request library can be used to handle all the URL related work.

Example :

Input : 
Hello
This is Geeksforgeeks
review-team@geeksforgeeks.org
contribute@geeksforgeeks.org
GfG is a portal for geeks
feedback@geeksforgeeks.org
careers@geeksforgeeks.org

Output :
[]
[]
['review-team@geeksforgeeks.org']
['contribute@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']

 
URL text file can be handled using urllib.request. For extracting the emails using regular expressions, re library can be used. For more details of Regular Expression, refer this.




# library that handles the URL stuff
import urllib.request
  
# Importing module required for
# regular expressions
import re
  
# Assign urlopen to a file object variable
fhand = urllib.request.urlopen
  
for line in fhand:
    # Getting the text file
    # content line by line.
    s = line.decode().strip()
  
    # regex for extracting all email-ids
    # from the text file
    reg = re.findall(r"[A-Za-z0-9._%+-]+"
                     r"@[A-Za-z0-9.-]+"
                     r"\.[A-Za-z]{2,4}", s)
  
    # printing the list output
    print(reg)

Output :

[]
[]
['review-team@geeksforgeeks.org']
['contribute@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']

 Attention geek! Strengthen your foundations with the Python Programming Foundation Course and learn the basics.  

To begin with, your interview preparations Enhance your Data Structures concepts with the Python DS Course. And to begin with your Machine Learning Journey, join the Machine Learning – Basic Level Course

My Personal Notes arrow_drop_up
Recommended Articles
Page :