daiR 1.0.1
daiR 1.0.0
- Major revision.
- Added several processor-related functions following an update of the
Google Document AI REST API. These include
list_processor_types(),create_processor(),enable_processor(),disable_processor(), anddelete_processor().
- Added get_text()andget_tables(), which
replacetext_from_dai_response(),text_from_dai_file(),tables_from_dai_response()andtables_from_dai_file().
- Added get_entities()anddraw_entities().
- Removed dai_tab_sync()anddai_tab_async()following Google’s discontinuation of the v1beta2 endpoint on 31 January
2024.
- Modified the parameters of the draw*()functions for
better consistency with other functions.
- Renamed the .Rfiles and regrouped the functions.
- Added prettier messages using the clipackage.
daiR 0.9.9
- Added the function make_hocrto convert DAI output to
hOCR files, thereby facilitating the creation of searchable PDFs.
daiR 0.9.8
- Changed the build_token_df()andbuild_block_df()functions so they can take as input
response objects fromdai_sync()in addition to json files
fromdai_async().
- Changed the build_token_df()andbuild_block_df()functions to include confidence scores in
the dataframe, so as to enable filtering on confidence.
daiR 0.9.7
- Changed the “draw_*” functions so they work with
response objects (fromdai_sync()anddai_sync_tab()) as well as with json files fromdai_async_tab().
- Changed the “draw_*” functions to allow customizing
color and thickness of lines around bounding boxes.
- Modified tables_from_dai_response()so that it handles
response objects fromdai_sync()with form parser
processors.
daiR 0.9.6
- Added three new functions relating to processors:
get_processors(),get_processor_info(), andget_processor_versions().
- Added parameter proc_vtodai_sync()anddai_async, allowing for specification of processor
version.
daiR 0.9.5
- Added two new functions: dai_notify()andmerge_shards()
- Modified text_from_dai_response()andtext_from_dai_file()to allow saving the output straight to
a text file.
- Fixed a bug in dai_status()that caused an error when
processing responses from the v1beta2 endpoint
(dai_tab_async()).
daiR 0.9.3
- Changed the “draw_*” functions to allow custom output
filenames.
daiR 0.9.2
- Added more support documentation in connection with JOSS
release.
daiR 0.9.1
- Added new function (draw_blocks_new()) to inspect block
bounding boxes after reprocessing.
daiR 0.9.0
daiR 0.8.0
- Simplified the auth functions to avoid modifying the global
environment. Tokens are no longer held in an internal .auth object.
Removed dai_has_token(), dai_deauth(), and create_folder().
daiR 0.7.0
- Revised draw_blocks(),draw_paragraphs(),draw_lines(), anddraw_tokens()functions.
These functions no longer require supplying a pdf file, as they get the
images from a base64-encoded string in the json file.
daiR 0.6.0
- New processing functions adapted to the new stable release of
Document AI API (v1). The
dai_sync()/dai_async()functions now access
the new v1 endpoint, which has foreign language support. However, the v1
endpoint currently does not support table extraction, so the old
processing functions (which access the v1beta2 endpoint) are kept under
the new namesdai_sync_tab()anddai_async_tab(). I expect this to be a temporary solution
until DAI’s capabilities are consolidated in a single endpoint, at which
stage the*_tab()functions will be phased out.
daiR 0.4.0
- New table extraction functions.
daiR 0.2.0
- New helper functions and more robust code.
daiR 0.1.0
- Revised and expanded auth functions.
daiR 0.0.1