Open In App
Related Articles

Converting HTML to Text with BeautifulSoup

Improve Article
Improve
Save Article
Save
Like Article
Like

Many times while working with web automation we need to convert HTML code into Text. This can be done using the BeautifulSoup. This module provides get_text() function that takes HTML as input and returns text as output.

Example 1:

Python3




# importing the library
from bs4 import BeautifulSoup
 
# Initializing variable
gfg = BeautifulSoup("<b>Section </b><br/>BeautifulSoup<ul>\
<li>Example <b>1</b></li>")
 
# Calculating result
res = gfg.get_text()
 
# Printing the result
print(res)


 

 

Output:

 

Section BeautifulSoupExample 1

 

Example 2: This example extracts data from the live website then converts it into text. In this example, we used the request module from urllib library to read HTML data from URL.

 

Python3




# importing the library
from bs4 import BeautifulSoup
from urllib import request
 
# Initializing variable
gfg = BeautifulSoup(request.urlopen(url).read())
 
# Extracting data for article section
bodyHtml = gfg.find('article', {'class' : 'content'})
 
# Calculating result
res = bodyHtml.get_text()
 
# Printing the result
print(res)


Output:


Whether you're preparing for your first job interview or aiming to upskill in this ever-evolving tech landscape, GeeksforGeeks Courses are your key to success. We provide top-quality content at affordable prices, all geared towards accelerating your growth in a time-bound manner. Join the millions we've already empowered, and we're here to do the same for you. Don't miss out - check it out now!

Last Updated : 16 Apr, 2021
Like Article
Save Article
Previous
Next
Similar Reads
Complete Tutorials