gbif_download() now uses minioclient as
a backend, offering dramatically (100x+) better performance, especially
on multi-core machines with high bandwidth network connections.
gbif_local() now defaults to duckdb backend, and
utilizes duckdbfs to streamline the interface. The latest performance of
duckdb is substantially better than alternatives.
Breaking changes
gbif_conn() is deprecatedgbif_version()). Works with local and remote sources, can
also report all available versions.gbif_local() to return a remote table instead of a
connection; paralleling the use of gbif_remote()gbif_conn() (and thus gbif_local() ) gain
the ability to use arrow as a backend to duckdb, and this is now the
default. This improves performance and avoids crashes when all columns
are requested.gbif_download() and gbif_remote())gbif_download() now automatically detects versions,
downloads parquet files to a path that parallels the remote path (using
release-specific subdirectories), and allows bucket to be
configured.to_duckdb=TRUE by default in
gbif_remote(), creating a consistent lazy-table interface
with support for windowed functionsgbif_conn() (and gbif_local()) now
automatically detect the path of most recent GBIF version in
gbif_dir(). No more need to to manually set path for
occurrence.parquet/ subfolder.