pdftools: Text Extraction, Rendering and Converting of PDF Documents

Utilities based on 'libpoppler' for extracting text, fonts, attachments and metadata from a PDF file. Also supports high quality rendering of PDF documents into PNG, JPEG, TIFF format, or into raw bitmap vectors for further processing in R.

Version: 3.4.0
Imports: Rcpp (≥ 0.12.12), qpdf
LinkingTo: Rcpp
Suggests: png, webp, tesseract, testthat
Published: 2023-09-25
Author: Jeroen Ooms ORCID iD [aut, cre]
Maintainer: Jeroen Ooms <jeroen at berkeley.edu>
BugReports: https://github.com/ropensci/pdftools/issues
License: MIT + file LICENSE
URL: https://docs.ropensci.org/pdftools/ (website) https://github.com/ropensci/pdftools#readme (devel) https://poppler.freedesktop.org (upstream)
NeedsCompilation: yes
SystemRequirements: Poppler C++ API: libpoppler-cpp-dev (deb) or poppler-cpp-devel (rpm), and poppler-data (rpm/deb) package.
Materials: NEWS
CRAN checks: pdftools results

Documentation:

Reference manual: pdftools.pdf

Downloads:

Package source: pdftools_3.4.0.tar.gz
Windows binaries: r-devel: pdftools_3.4.0.zip, r-release: pdftools_3.4.0.zip, r-oldrel: pdftools_3.4.0.zip
macOS binaries: r-release (arm64): pdftools_3.4.0.tgz, r-oldrel (arm64): pdftools_3.4.0.tgz, r-release (x86_64): pdftools_3.4.0.tgz
Old sources: pdftools archive

Reverse dependencies:

Reverse imports: bridger, chatAI4R, daiR, disclosuR, doconv, dtrackr, eurlex, findR, gdiff, huito, IDEATools, iheiddown, mapscanner, OMICsPCA, pdfsearch, readtext, speech, staplr, SwimmeR, tesseract, texor, TextForecast, timeLineGraphics, vmeasur
Reverse suggests: caracas, easyr, fairadapt, fixest, flextable, fplot, gMOIP, gridGraphics, hunspell, LexisNexisTools, magick, pagedown, patientProfilesVis, piecepackr, plotgardener, ReactomeContentService4R, ricu, rjtools, rock, RRphylo, seqArchRplus, slickR, spelling, texPreview, tm, xmpdf

Linking:

Please use the canonical form https://CRAN.R-project.org/package=pdftools to link to this page.