Frequently Asked Question:
How can I get text in a text block and its bounding rectangle?
GetPageText() function gives text line-by-line . I am interested in full text block.
PDF files can be generated using all sorts of methods, fonts, styles etc..
Quick PDF does not currently have functions to report anything more than a line by line basis.
It gets very difficult to determine what should be in the block and what shouldn't. What happens when you have a bolded word in a paragraph. How do you report that ? Also it may be straight forward for letter but what how do we support invoices. Once we start adding a feature like this it would be impossible for it to keep everyone happy due to the endless variations and interpretations of what is a block and what is not.
We are looking at returning a formatted text (.TXT) file version of the file in the next release of Quick PDF library.