Open In App
Related Articles

Java Program to Extract a Image From a PDF

Like Article
Save Article
Report issue

Program to extract an image from a PDF using Java. The external jar file is required to import in the program. Below is the implementation for the same.


  • Extracting image using the APACHE PDF Box module.
  • Load the existing PDF document using file io.
  • Creating an object of PDFRenderer class.
  • Rendering an image from the PDF document using the BufferedImage class.
  • Writing the extracted image to the new file.
  • Close the document.

Note: External files are required to download for performing the operation. For more documentation of the module used to refer to this. 



// Extracting  Images  from a PDF using java
import java.awt.image.BufferedImage;
import javax.imageio.ImageIO;
import org.apache.pdfbox.pdmodel.PDDocument;
import org.apache.pdfbox.rendering.PDFRenderer;
class GFG {
    public static void main(String[] args) throws Exception
        // Existing PDF Document
        // to be Loaded using file io
        File newFile
            = new File("C:/Documents/GeeksforGeeks.pdf");
        PDDocument pdfDocument = PDDocument.load(newFile);
        // PDFRenderer class to be Instantiated
        // i.e. creating it's object
        PDFRenderer pdfRenderer
            = new PDFRenderer(pdfDocument);
        // Rendering an image
        // from the PDF document
        // using BufferedImage class
        BufferedImage img = pdfRenderer.renderImage(0);
        // Writing the extracted
        // image to a new file
            img, "JPEG",
            new File("C:/Documents/GeeksforGeeks.png"));
            "Image has been extracted successfully");
        // Closing the PDF document


PDF before execution:

Existing PDF Document which containing the image which is to be extracted 

Image after extraction:

Extracted Image from the PDF document

Last Updated : 17 Nov, 2020
Like Article
Save Article
Share your thoughts in the comments
Similar Reads