Prerequisite : Pattern Matching with Python Regex
Given the URL text-file, the task is to extract all the email-ids from that text file and print the urllib.request
library can be used to handle all the URL related work.
Example :
Input :
Hello
This is Geeksforgeeks
review-team@geeksforgeeks.org
review-team@geeksforgeeks.org
GfG is a portal for geeks
feedback@geeksforgeeks.org
careers@geeksforgeeks.org
Output :
[]
[]
['review-team@geeksforgeeks.org']
['review-team@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']
URL text file can be handled using urllib.request
. For extracting the emails using regular expressions, re
library can be used. For more details of Regular Expression, refer this.
import urllib.request
import re
fhand = urllib.request.urlopen
for line in fhand:
s = line.decode().strip()
reg = re.findall(r "[A-Za-z0-9._%+-]+"
r "@[A-Za-z0-9.-]+"
r "\.[A-Za-z]{2,4}" , s)
print (reg)
|
Output :
[]
[]
['review-team@geeksforgeeks.org']
['review-team@geeksforgeeks.org']
[]
['feedback@geeksforgeeks.org']
['careers@geeksforgeeks.org']