
Preprocess Price Data
preprocess_price_landings.Rd
This function preprocesses raw price data from Google Cloud Storage. It performs various data cleaning and transformation operations, including column renaming, data pivoting, and standardization of fish categories and prices.
Usage
preprocess_price_landings(log_threshold = logger::DEBUG)
Value
No return value. Function processes the data and uploads the result as a Parquet file to Google Cloud Storage.
Details
The function performs the following main operations:
Downloads raw price data from Google Cloud Storage
Renames columns and selects relevant fields (submission_id, landing_site, landing_date, and price fields)
Cleans and standardizes text fields
Pivots price data from wide to long format
Standardizes fish category names and separates size information
Converts data types (datetime, character, numeric)
Removes duplicate entries
Uploads the processed data as a Parquet file to Google Cloud Storage