PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML format. It also supports compressing data to minimize size. PDF2JSON is available for Windows and Linux.
Usage: pdf2json [options]
[] -f : first page to convert -l : last page to convert -compress : Use compressed mode -q : dont print any messages or errors -h : print usage information -help : print usage information -i : ignore images -noframes : use standard output -xml : output for XML post-processing -split : split the output into separate files on every X page. use "%" as part of output file name to specify where page number should appear (e.g. Paper_%.js) -hidden : output hidden text -enc