Load WARC (Web ARChive) files into Apache Spark using 'sparklyr'. This allows to read files from the Common Crawl project <http://commoncrawl.org/>.
| Version: | 0.1.6 |
| Imports: | DBI, sparklyr, Rcpp |
| LinkingTo: | Rcpp |
| Published: | 2022-01-11 |
| DOI: | 10.32614/CRAN.package.sparkwarc |
| Author: | Javier Luraschi [aut],
Yitao Li |
| Maintainer: | Edgar Ruiz <edgar at rstudio.com> |
| BugReports: | https://github.com/r-spark/sparkwarc |
| License: | Apache License 2.0 |
| NeedsCompilation: | yes |
| SystemRequirements: | C++11 |
| Materials: | README |
| CRAN checks: | sparkwarc results |
| Reference manual: | sparkwarc.html , sparkwarc.pdf |
| Package source: | sparkwarc_0.1.6.tar.gz |
| Windows binaries: | r-devel: sparkwarc_0.1.6.zip, r-release: sparkwarc_0.1.6.zip, r-oldrel: sparkwarc_0.1.6.zip |
| macOS binaries: | r-release (arm64): sparkwarc_0.1.6.tgz, r-oldrel (arm64): sparkwarc_0.1.6.tgz, r-release (x86_64): sparkwarc_0.1.6.tgz, r-oldrel (x86_64): sparkwarc_0.1.6.tgz |
| Old sources: | sparkwarc archive |
Please use the canonical form https://CRAN.R-project.org/package=sparkwarc to link to this page.