Program to remove HTML tags from a given String

Given a string str which contains some HTML tags, the task is to remove all the tags present in the given string str.


Input: str = “<div><b>Geeks for Geeks</b></div>”
Output: Geeks for Geeks

Input: str = “<a href=””>GFG</a>”
Output: GFG

Approach: The idea is to use the Regular Expression to solve this problem. The following steps can be followed to compute the resultant string:

  1. Get the string.
  2. Since every HTML tags are inclosed in angular brackets(<>). Therefore use replaceAll() function in regex to replace every substring start with “<“ and ends with “>” to empty string.
  3. The function is used as:
    String str;
    str.replaceAll("\\", "");

Below is the implementation of the above approach:






// Java program for the above approach
class GFG {
    // Function to remove the HTML tags
    // from the given tags
    static void RemoveHTMLTags(String str)
        // Use replaceAll function in regex
        // to erase every tags enclosed in <>
        str = str.replaceAll("\\<.*?\\>", "");
        // Print string after removing tags
    // Driver Code
    public static void main(String[] args)
        String str;
        // Given String
        str = "<div><b>Geeks for Geeks</b></div>";
        // Function call to print the
        // HTML string after removing tags



Geeks for Geeks

Attention reader! Don’t stop learning now. Get hold of all the important DSA concepts with the DSA Self Paced Course at a student-friendly price and become industry ready.

My Personal Notes arrow_drop_up

Check out this Author's contributed articles.

If you like GeeksforGeeks and would like to contribute, you can also write an article using or mail your article to See your article appearing on the GeeksforGeeks main page and help other Geeks.

Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.