Find the text of the given tag using BeautifulSoup
Last Updated :
30 May, 2022
Web scraping is a process of using software bots called web scrapers in extracting information from HTML or XML content of a web page. Beautiful Soup is a library used for scraping data through python. Beautiful Soup works along with a parser to provide iteration, searching, and modifying the content that the parser provides(in the form of a parse tree). It’s fairly easy to crawl through the web pages and to find the text of a given tag using Beautiful Soup.
In this article, we will discuss finding the text from the given tag.
Step-by-step Approach:
- First import the library.
Python3
from bs4 import BeautifulSoup
import requests
|
- Fetch the raw HTML content from the URL.
Python3
html_content = requests.get(url).text
|
- Now parse through the content.
Python3
soup = BeautifulSoup(html_content, "html.parser" )
|
- After the content is parsed we search for a specific tag and print its text.
Python3
print (soup.find( 'title' ))
|
Below is the complete program.
Python3
from bs4 import BeautifulSoup
import requests
html_content = requests.get(url).text
soup = BeautifulSoup(html_content, "html.parser" )
print (soup.find( 'title' ).text)
|
Output:
Similarly to get all the occurrences of the given tag:
Python3
from bs4 import BeautifulSoup
import requests
html_content = requests.get(url).text
soup = BeautifulSoup(html_content, "html.parser" )
texts = soup.find_all( 'p' )
for text in texts:
print (text.get_text())
|
Output:
Like Article
Suggest improvement
Share your thoughts in the comments
Please Login to comment...