EvilZone

Programming and Scripting => Projects and Discussion => : PsychoRebellious February 19, 2015, 02:10:58 PM

: Need help with grabbing highlights from a pdf document
: PsychoRebellious February 19, 2015, 02:10:58 PM
Look at the screenshot here? First of all my skills. I am good with php and I know C++. Just that the only library I've worked with is allegro c++ and it has been a while. Now I've been looking for some program or someway to extract the highlights in a pdf document as I'm used to of reading and highlighting interesting verse, to be honest, I highlight a lot.
Anyway, I couldn't find an opensource tool that does exactly what I want, and I was wondering if something like that could be accomplished with c++. Since I don't know many c++ libraries and the limitations, I am wondering if there is a library that lets you modify/edit/create and manage pdfs and maybe then I'll be able to make something that extracts highlights/annotations from a pdf document. Anybody with prior experience with this?

EDIT: PS I tried opening the PDF in browser just wondering if there would be some class or other attribute added to the highlighted part but no luck. I can't find anything like that. I was speculating if I could just grab them with javascript.
DIT AGAIN: Found this. Not a free service. Only limited numbers of exports with the free offer.
https://www.sumnotes.net/
: Re: Need help with grabbing highlights from a pdf document
: Deque February 19, 2015, 09:11:30 PM
http://eric-blue.com/2010/12/17/learning-faster-automatically-extract-highlighted-text-from-pdf-documents/

http://socialdatablog.com/extract-pdf-annotations.html