Extract text from PDFs as a text block list

Debenu Quick PDF Library provides an extensive API for programmatically extracting text from PDF files. This includes the options of just plain text output and also returning the text in a formatted CSV string with details about the font, size and style of the text. The API now includes additional text extraction functions for extracting […]

Incremental Updates in PDF files

Incremental updates provide a method for updating a PDF file without completely re-writing it, according to the PDF specification (1.7), incremental updates work like this: The contents of a PDF file can be updated incrementally without rewriting the entire file. Changes are appended to the end of the file, leaving its original contents intact. This […]

Programmatically find and replace URL links in PDF files

Debenu Quick PDF Library has an extensive API for programmatically working with links in PDF files. In the sample code below we demonstrate how to find all URLs in a PDF and then check to see if it matches our predefined URL and if it does then we replace it. This sample just works with […]

ASP.NET MVC Web Applications and Debenu Quick PDF Library

Debenu Quick PDF Library can be used in ASP.NET MVC Web Application projects to add rich PDF features to web applications. It works in much the same way as it does for regular desktop or server applications. This tutorial is done using Visual Studio 2013 and C#, but the steps are mostly the same if […]

ASP.NET Web Applications and Debenu Quick PDF Library

Debenu Quick PDF Library can be used in ASP.NET Web Application projects to add rich PDF features to web apps. It works in much the same way as it does for regular desktop or server applications. Please note: there is a difference between a ASP.NET Web Site project and a ASP.NET Web Application project. This article deals with […]

Merge content from multiple pages into one page

Sometimes it’s necessary to customize the content of a document on the fly from templates. There’s various different ways this can be done, but one way is to have a template with different content on each page of a document and then based on certain criteria (i.e. content from page 1, 5, 8 is required […]

Programmatically determine the content area of a page in a PDF

It’s easy to determine the size of a page by using the PageHeight and PageWidth, but it’s a little more difficult to determine the rectangular content area of a page. There are functions in the library for returning the text coordinates such as GetPageText and there’s functions for determining the coordinates of images, but there’s […]

Extract text from a defined rectangular area on a page

Debenu Quick PDF Library includes a range of functionality for extracting text from PDF files, but usually it’s for extract text from an entire page. The extract functions which include “area” in the name let you specify a rectangular area from which you wish to extract text. The key functions for this using regular memory […]

Programmatically impose multiple pages on one page (imposition)

There is not a single “imposition” function in Debenu Quick PDF Library but it’s easy to use the CapturePage and DrawCapturedPage functions to do the imposition. There is some sample code below that demonstrates how to take a few different PDFs, merge them together, and then impose each page onto one page in a new […]

Get embedded image coordinates from PDF files

Debenu Quick PDF Library lets you analyze, extract and replace embedded images in PDF files using the extensive image handling functions. The GetPageImageList function returns an ImageListID which you can use in the GetImageListItemDblProperty function. With this function you can get the coordinates for each image in the image list. The GetImageListItemIntProperty function useful for […]