/* ---- Google Analytics Code Below */

Friday, March 27, 2015

Extracting Data from PDF Documents

Could have used this idea a number of years ago. In projects meant to gather and archive enterprise knowledge.  Data tables are often embedded in PDF documents, and extracting these systematically, in volume, sometimes ends up as a manual task with potential for error.  In CWorld:  Tabula, a free open source tool to do this.    Have not tried.

No comments: