Friday, March 27, 2015
Extracting Data from PDF Documents
Could have used this idea a number of years ago. In projects meant to gather and archive enterprise knowledge. Data tables are often embedded in PDF documents, and extracting these systematically, in volume, sometimes ends up as a manual task with potential for error. In CWorld: Tabula, a free open source tool to do this. Have not tried.
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment