rtika - R Interface to 'Apache Tika'

Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.

Last updated 2 years ago

extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika

6.00 score 55 stars 12 scripts 231 downloads