Hello,
I need a program which extract text chunks from a PDF file, together with the coordinates of the chunk, the font and its attributes (size, bold, etc). The program should run under Linux from the command line.
Actually, I've already written a C++ program, which uses the library poppler. Unfortunately, it doesn't work for some files, such as "[login to view URL]". If you want, you can investigate what is wrong and implement a fix. All the files are includes, compilation details are in the file "[login to view URL]".
In your bids please describe the approach to the project.
Thanks.