The PDF::Reader library implements a PDF parser conforming as much as possible to the PDF specification from Adobe.http://rubyforge.org/projects/pdf-reader8412013-05-12T13:53:33Z
A tool and library that can extract various areas of text from a PDF, especially a scholarly article PDF.https://github.com/CrossRef/pdfextract702012-04-26T11:15:04Z