textpress: A Lightweight and Versatile NLP Toolkit

A simple Natural Language Processing (NLP) toolkit focused on search-centric workflows with minimal dependencies. The package offers key features for web scraping, text processing, corpus search, and text embedding generation via the 'HuggingFace API' <https://huggingface.co/docs/api-inference/index>.

Version: 1.0.0
Depends: R (≥ 3.5)
Imports: data.table, httr, Matrix, rvest, stringi, stringr, xml2, pbapply, jsonlite, lubridate
Published: 2024-10-14
DOI: 10.32614/CRAN.package.textpress
Author: Jason Timm [aut, cre]
Maintainer: Jason Timm <JaTimm at salud.unm.edu>
BugReports: https://github.com/jaytimm/textpress/issues
License: MIT + file LICENSE
URL: https://github.com/jaytimm/textpress, https://jaytimm.github.io/textpress/
NeedsCompilation: no
Materials: README
CRAN checks: textpress results

Documentation:

Reference manual: textpress.pdf

Downloads:

Package source: textpress_1.0.0.tar.gz
Windows binaries: r-devel: textpress_1.0.0.zip, r-release: textpress_1.0.0.zip, r-oldrel: textpress_1.0.0.zip
macOS binaries: r-release (arm64): textpress_1.0.0.tgz, r-oldrel (arm64): textpress_1.0.0.tgz, r-release (x86_64): textpress_1.0.0.tgz, r-oldrel (x86_64): textpress_1.0.0.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=textpress to link to this page.