Pre-process Timor Landings Survey data (step 1)
preprocess_landings_step_1.Rd
Downloads raw structured data from cloud storage services and pre-process into a binary format that is easier to deal with in R.
Usage
preprocess_landings_step_1(log_threshold = logger::DEBUG)
Arguments
- log_threshold
The (standard Apache logj4) log level used as a threshold for the logging infrastructure. See logger::log_levels for more details
Details
In order to not exceed CPU memory limits in Docker containers, the
preprocessing of raw landings data was splitted in two containers
(two separate jobs in GitHub actions), this function process the first half
of raw data, while the function preprocess_landings_step_2
process the second
half.
This function downloads the landings data from a given version (specified in
the config file conf.yml
.The parameters needed are:
surveys:
landings:
api:
survey_id:
token:
file_prefix:
version:
preprocess:
storage:
storage_name:
key:
options:
project:
bucket:
service_account_key:
Progress through the function is tracked using the package logger.