eurostat 4.0.0
Major updates
- Add data.table to package Imports and make using data.table
functions optional with get_eurostat()use.data.tableargument. This is especially useful with big
datasets that would otherwise take a long time to go through the
different data cleaning functions or crash R with their large memory
footprint. (issue #277, PR #278)
- switch from httrpackage tohttr2(issue
#273, PR #276)
- Rewritten caching functionalities, making it possible to cache
filtered queries and rely on local caches if the user attempt to filter
a complete dataset that has already been cached. A list of queries and
cached item hashes is stored in a cache_list.json file in cache folder.
This can be viewed with a new function:
list_eurostat_cache_items(). (Affects issues mentioned in
#144, #257, #258, fixed in PR #267)
- Column names in .eurostatTOCobject (returned byget_eurostat_toc()) now use dots instead of spaces in the
style ofbase::make.names(), e.g. turninglast update of datatolast.update.of.data(PR
#271)
- .eurostatTOCobject includes a new hierarchy column
that represents the position of each folder, dataset and table in the
folder structure.
- search_eurostat()includes the option to search Table
of Content items by dataset codes in addition to titles. This makes it
possible to make further queries from similar datasets
(e.g. “nama_10_gdp”, “nama_10r_2gdp”, “nama_10r_3popgdp”) that might
have different titles.
- label_eurostat_tables()has been rewritten to use the
new SDMX API instead of- table_dic.dicfile in Eurostat Bulk
Download Listing (PR #271)
- Remove legacy code related to downloading data from old bulk
download facilities and temporary functions added in package version
3.7.14.
- get_eurostat_geospatial()now leverages on- giscoR::gisco_get_nuts()for downloading geospatial data
(PR #264, thanks to @dieghernan):- 
- "spdf"output class soft-deprecated, it would return a- sfobject with a message.
- make_validparameter soft-deprecated.
- Added ...to the function so additional parametes can
be passed togiscoR::gisco_get_nuts().
- Dataset eurostat_geodata_60_2016updated.
 
- get_eurostat_geospatial()now requires sf package to
work at all (PR #280, thanks to @dieghernan)
Minor updates
- Added suppressWarnings() to some of the tests that use TOC’s
directly or indirectly as the tests are not directly related to TOC
files.
- Use more parameter inheritance in package function documentation to
reduce discrepancies between different functions (DRY-principle) (PR
#270)
- Documentation more explicitly explains how to use filter parameters
in get_eurostat()andget_eurostat_json()functions. The documentation now warns users about potential problems
caused bytime/TIME_PERIODparameters when
used to query datasets that contain quarterly data (issue #260)
- As continuation of the update done in 3.7.14, started to use the new
URL also for dictionary files in get_eurostat_dic()andlabel_eurostat()functions.
- get_bibentry()now outputs “Accessed YYYY-MM-DD” and
“dataset last updated YYYY-MM-DD” in note field as otherwise it would be
sporadically printed or not at all printed from- urldatefield.
- Print more informative API error messages. (issue #261, PR #262,
thanks to @ake123)
- Removed sp,methodsandbroompackages from dependencies.
- Added giscoRto Suggests. (PR #264)
New features
- Added new function: get_eurostat_interactive()for
interactively searching and downloading data from Eurostat SDMX API. The
function aims to make good data citation practices more prominently
visible and also make it easier to explore what different arguments inget_eurostat()function do.
- There is also a new internal function
eurostat:::fixity_checksum()to easily calculate a fixity
checksum for datasets downloaded from Eurostat. The fixity checksum can,
for example, be saved in research notes and reported in as part of data
appendices. Printing the fixity checksum is encouraged by including an
option to print it in everyget_eurostat_interactive()query.
- Added a new internal function clean_eurostat_toc()for
easy removal of TOC objects from .EurostatEnv environment. (PR
#278)
- New internal function check_lang()(PR #270)
- get_eurostat()function now explicity accepts a ‘lang’
argument, for passing onwards to- get_eurostat_json()and- label_eurostat()(PR #270)
- New user facing function: get_eurostat_folder()for
downloading all datasets in a folder. The function is limited to
downloading folders that contain at maximum 20 datasets. This function
relies on new internal helper functions:toc_count_whitespace(),toc_determine_hierarchy(),toc_count_children()andtoc_list_children().
(PR #270)
- EXPERIMENTAL: get_eurostat_toc()andset_eurostat_toc()now have experimental features that
support downloading TOCs in French and German as well. This support, in
turn, is leveraged inget_bibentry()which now has a
language parameter:lang(PR #270)
- Related to updates to get_eurostat_toc(),search_eurostat()now supports searching from French and
German TOC-files as well (PR #270)
Deprecated and defunct
- grepEurostatTOC()is completely marked as defunct and
is enroute to being removed from the package as- search_eurostat()is now the only way to fetch Eurostat TOC
items and search (grep) them (PR #270)
- During the development of the 4.0.0 version there was a temporary
function called label_eurostat_vars2that has been removed
in the final version, as promised earlier: “The old function will be
completely removed after October 2023 when Eurostat Bulk Download
Listing website is retired andlabel_eurostat_vars2will be
renamed tolabel_eurostat_vars()”. The newlabel_eurostat_vars()function uses the new SDMX API to
retrieve names for dataset columns. Function evolution is subject to
ongoing Eurostat API developments. (PR #270)
Bug fixes
- Added a more informatic warning message in situations where TOC
datasets downloaded from Eurostat might not have proper titles. For some
reason this was isolated to German and French language versions of TOC
while English language TOC had proper titles for all items. (PR
#278)
- get_bibentry()returns correct codes for titles and
warns the user if some / all of the requested codes were not found in
the TOC (PR #270)
- get_bibentry()uses the date field with the internal
BibEntry format that can be easily translated to other formats: bibtex,
bibentry (PR #270)
- get_bibentry()now outputs dataset codes in titles
correctly so that- bibtexand- biblatexentries
can be copypasted into bibliographies without adding escape characters
manually (PR #270)
- Fix issue related to downloading quarterly data (issue #260, PR
#271)
- Reduce RAM usage in eurotime2date()when handling big
datasets containing weekly data and tens of millions of rows (dataset
used for testing mentioned in issue #200).
eurostat 3.8.3 (2023-03-07)
Bug fixes
- Fix date handling bug in the get_eurostat_json()andeurotime2date()functions (issue #251, reported by @lz1nwm). Theget_eurostat_json()function uses the temporaryeurotime2date()function for date handling until the old
bulk download API is deprecated.
eurostat 3.8.2 (2023-03-06)
Minor updates
- use curl::curl_downloadon Windows platforms instead ofutils::download.fileas the latter causes the following
error: “error reading from the connection […] invalid or incomplete
compressed data”. This affects only files downloaded from the new
API.
eurostat 3.7.14 (2023-02-22)
Major updates
- Updated get_eurostat()and its assorted functions to
download data from the new dissemination API (related to issues #251,
#243). See Eurostat web page Transition - from Eurostat Bulk Download to
API for a list of differences between old and new data sources:
https://wikis.ec.europa.eu/display/EUROSTATHELP/Transition+-+from+Eurostat+Bulk+Download+to+API
- Added new temporary functions for downloading and handling data from
the new dissemination API: get_eurostat_raw2,tidy_eurostat2,convert_time_col2,eurotime2date2,eurotime2num2andlabel_eurostat2. When the old bulk download facilities are
decommissioned, these functions will replace the old functions with old
naming schemes (without the 2s at the end).
- tidy_eurostat2function is now able to handle multiple
time frequencies in one call: For example, you can download annual,
quarterly, and monthly data simply by using a vector c(“A”, “Q”, “M”) in
select_time instead of using these singular frequencies in separate
calls. The function will also return multiple time series in one dataset
if select_time is NULL (as it is by default). If the dataset contains
multiple time series and these are explicitly downloaded / no
select_time parameter is given, a message will be printed.
- eurotime2numcan now handle monthly and weekly data as
well.
- Added a new parameter to get_eurostat()function:
legacy_bulk_download (default = TRUE). By setting this parameter to
FALSE the user can download data from the new dissemination API. If you
want to test the new API before it becomes the only way to download the
data (and we very much encourage you to do so), set this parameter to
FALSE.
Minor updates
- Removed render-rmarkdown.yaml workflow used for rendering README.md
file. README.md must be generated locally from now on.
eurostat 3.7.13 (2023-02-01)
- Updated get_eurostat_json()to migrate from JSON web
service to API Statistics (addressed in issues #243, #251). Please note
that the output from JSON API is now slightly different than before: the
datasets now contain a freq column to indicate the frequency with which
data has been collected, for example annually “A”, monthly “M” or
quarterly “Q”. See Eurostat - Data browser online help website for more
information:
https://wikis.ec.europa.eu/display/EUROSTATHELP/API+Statistics+-+migrating+from+JSON+web+service+to+API+Statistics
- Minor fixes in get_bibentry()andget_eurostat_geospatial()
eurostat 3.7.12 (2022-06-28)
- Updated included dataset eurostat_geodata_60_2016to
fix the issue of old-style crs object (#237)
- Added information about different variables in
eurostat_geodata_60_2016so that the dataset is more
understandable and usable for testing purposes. Added the same
information toget_eurostat_geospatial()documentation as
well.
- Added the GISCO copyright disclaimer to
eurostat_geodata_60_2016andget_eurostat_geospatial()documentation.
- Get rid of unnecessary “No encoding supplied: defaulting to UTF-8.”
messages in get_eurostat_geospatial()by setting content
encoding to UTF-8 whenhttr::content()function is
called
- dplyr and tidyr namespaces are no longer imported completely, only
selected few functions with importFrom
eurostat 3.7.10 (2022-02-09)
- Fixed URL issues in tests and examples
eurostat 3.7.9 (2020-10-01)
- Function documentation migrated from old \code{},\link{}syntax to markdown (issue #230, PR #231 by @dieghernan)
eurostat 3.7.8 (2020-09-30)
- Package cache management updated: options()command is
no longer needed and the cache dir can be modified persistently with a
custom function (issue #223, PR #228 by @dieghernan)
eurostat 3.7.7 (2020-06-24)
eurostat 3.7.6 (2021-05-20)
- Deprecated add_nuts_level(),harmonize_geo_code(),recode_to_nuts_2016()andrecode_to_nuts_2013(); these functions were moved to
the new package regions. The problem of sub-national geo codes is
explained in the new vignette “Mapping Regional Data, Mapping Metadata
Problems”, which replaces the “Regional data examples for the eurostat R
package” vignette. This is a shared vignette, but the new regions
package has more articles on how to work with sub-national data. (issues
#218 and #219, PR #220 by @antaldaniel)
eurostat 3.7.5 (2020-05-12)
- Moved sf from Imports to Suggests and made
get_eurostat_geospatial()return a message if sf is not
installed. This is to increase compatibility of eurostat-package on
systems that have trouble installing sf (issue #213)
- Wrapped some problem causing examples to \dontrun{}for
a quick CRAN release
eurostat 3.7.3
- Removed outdated dependencies (mapproj, plotrix, rsdmx)
eurostat 3.7.2
- Non-intersecting sf-geometries in get_eurostat_geospatial (PR #202
by @retostauffer)
eurostat 3.6.4 (2020-05-12)
- Fixed stringsAsFactors for R-4.0.0 and moved default to FALSE
eurostat 3.6.3 (2020-04-21)
- Stabilized http requests (PR by @annnvv)
eurostat 3.5.3
- get_eurostat switched to v2.1
eurostat 3.5.2
- internet and proxy setting fixes
- bibentry fix
eurostat 3.4.1
- Fixed vignette
- Added automated error messages to URL download failures
eurostat 3.3.3
- Countries and Country Codes data.frames get label column for country
names in the Eurostat database.
- Fixed vignette duplicate entry issue and smaller issues
- Added get_bibentry
eurostat 3.3.1
- The label_eurostat()has new countrycode and
countrycode_nomatch arguments to label with countrycode package and
custom_dic argument to add custom dictionary.
- Vignette updated
eurostat 3.2.3
Minor features
- dplyr moved from Dependencies to Imports
- curl removed from Imports
- solved geospatial map issues
- eurostat_url moved to options
eurostat 3.2.1
Major updates
- Improved support for sf in map visualization
Minor features
- ./data/ generation script in ./data-raw/ updated to make all data
reproducible
Bug fixes
- Typo corrected from Cisco to Gisco
eurostat 3.1.5
Minor features
- Added new example data set to reduce repeated downloads from
eurostat service
- Now label_eurostat()gives always an error by default,
if labelling introduces duplicated labels. A newfix_duplicatedargument is add to fix duplicated labels
automatically. (#79, #90)
- Shrinked the package tarball size
Bug fixes
- Modified tutorial to accommodate the CRAN error
- Fixed cut_to_classes to generate unique breaks
eurostat 3.1.1
R Journal submission
- Release version associated with the R Journal manuscript 2017 final
version
- Git release added with Zenodo DOI
Minor features
- Changed maintainer email address from louhos to leo
- Added ./docs/ (automated package website generated with
pkgdown)
- Expanded unit tests
- Gitter badge added to README
- Added ./revdep/ to check possible reverse dependencies
automatically
- Cheat sheet added
Bug fixes
- search_eurostat()accepts new argument- fixed: if- TRUE(default),- patternprovided will used as is; if- FALSE,- patternwill be interpreted as a true regex pattern.
- Augmented the list of Suggested packages in the DESCRIPTION file,
including the Cairo package (#70)
- Updated the journal manuscript based on reviewer feedback
eurostat 2.2.20001
- Development version opened
eurostat 2.2.1
- Fixed canonical cran url in README
eurostat 2.1.1
- The complete package now using tibbles
- Rare encoding issues circumvented (#55)
- Improved functionality within firewall-protected systems (#63)
eurostat 2.0
- The get_eurostat()returns tibbles (#52)
- The get_eurostat_dic()andget_eurostat_toc()return tibbles
- Now read_tsv()is used instead ofread.csv()(#29)
eurostat 1.2.27
- Calls to extract_numeric are replaced by as.numeric (#60)
- The column ‘flags’ is not being labelled even if type = “label”
(#61)
eurostat 1.2.22
- The European Commission and the Eurostat generally uses ISO 3166-1
alpha-2 codes with two exceptions: EL (not GR) is used to represent
Greece, and UK (not GB) is used to represent the United Kingdom. This
now can be handled with harmonize_country_code()which
converts the raw data values from EL to GR and from UK to GB.
- Harmonized roxygen documentation to better follow CRAN
conventions
- Changed Windows encoding to UTF for input files
- Improved memory usage
eurostat 1.2.21
- The get_eurostat()can now get data also from the
Eurostat JSON API viaget_eurostat_json(). It also have a
new argumenttypeto select labels for variable values
instead of codes.
- Fix an error after update to tidyr 0.4.0(#47).
eurostat 1.2.13
- New select_timeargument forget_eurostat()to select a time frequency in case of
multi-frequency datasets. Now theget_eurostat()also gives
an error if you try to get multi-frequency with other time formats thantime_format = "raw". (#30)timecolumn is also
now in ascending order.
- get_eurostat()gets a new argument- compress_fileto control compression of the cache file.
Also cache filenames includes now all relevant arguments. (#28)
- For search_eurostat()a new type optiontype = "all"to search all types.
- For label_eurostat()new arguments. Acodeto retain also codes for specified columns. Aeu_orderto
order factor levels in Eurostat order, which uses the new functiondic_order().
- Now label_eurostat_vars(x)gives labels for names, if x
is other than a character or a factor andlabel_eurostat_tables(x)does not accept other than a
character or a factor.
- For get_eurostat()a new argumentstringsAsFactorsto control the factor conversion of
variables.
- eurotime2date(and- get_eurostat) convers
now also daily data.
eurostat 1.0.16
eurostat 1.0.14 (2015-03-19)
- Package largely rewritten
- Vignette added
- Changed the value column to values in the get_eurostat output
eurostat 0.9.1 (2014-04-24)
- Package collected from statfi and smarterpoland