2026-01-06 06:34:47,597 - data_prep_class.py - INFO - 98 - Get data from Big Query with SELECT firma, datum, fg_sg, fg_bez, ladenvkp, katalogvkp, oeffnungszeit, fg_typ
FROM `test-ww-ki-witt.co_stat_daily_prognosis_bronzelayer.sales`
2026-01-06 06:34:51,621 - data_prep_class.py - INFO - 107 - firma datum fg_sg fg_bez ladenvkp katalogvkp oeffnungszeit fg_typ date
0 Josef Witt GmbH 2025-10-27 194 Michelfeld 558.75 964.68 None Preislandgeschäft 2025-10-27
1 Sieh an! GmbH 2025-10-27 195 Kelheim 136.40 160.00 None Basisfiliale 2025-10-27
2 Josef Witt GmbH 2025-10-27 195 Kelheim 2250.93 2467.58 None Basisfiliale 2025-10-27
3 Josef Witt GmbH 2025-10-27 196 Straubing 2718.26 2995.18 None Basisfiliale 2025-10-27
4 Sieh an! GmbH 2025-10-27 196 Straubing 248.85 325.00 None Basisfiliale 2025-10-27
2026-01-06 06:34:51,636 - dataprep_main.py - DEBUG - 39 - shape: (328471, 9) shops: 123 start: 2021-01-04 end: 2026-01-03 sumladenvkp: 306232534
2026-01-06 06:34:51,713 - data_prep_class.py - DEBUG - 137 - output of shops and codes found in sales data to fg_sg_bez.xlsx
2026-01-06 06:34:51,954 - dataprep_main.py - DEBUG - 50 - shape: (156223, 3) shops: 123 start: 2021-01-04 end: 2026-01-03 sumladenvkp: 306232534
2026-01-06 06:34:51,955 - data_prep_class.py - DEBUG - 180 - max date in dataframe: 2026-01-03 00:00:00
2026-01-06 06:34:51,955 - data_prep_class.py - DEBUG - 193 - date_range: 2026-01-04 00:00:00 2026-03-31 00:00:00
2026-01-06 06:34:52,051 - data_prep_class.py - DEBUG - 206 - size of new dataset: (10701, 3) vs 10701
2026-01-06 06:34:52,064 - data_prep_class.py - DEBUG - 210 - final size: (166924, 3)
2026-01-06 06:34:52,068 - dataprep_main.py - DEBUG - 58 - shape: (166924, 3) shops: 123 start: 2021-01-04 end: 2026-03-31 sumladenvkp: 306232534
2026-01-06 06:34:52,374 - dataprep_main.py - DEBUG - 69 - shape: (162573, 24) shops: 113 start: 2021-01-04 end: 2026-03-31 sumladenvkp: 301999900
2026-01-06 06:34:52,374 - data_prep_class.py - DEBUG - 255 - check Friedrichshafen
2026-01-06 06:34:52,392 - data_prep_class.py - DEBUG - 259 - date fg_sg fg_bez ladenvkp
141400 2024-01-02 183 Friedrichshafen 643.92
2026-01-06 06:35:29,221 - dataprep_main.py - DEBUG - 76 - shape: (162573, 24) shops: 113 start: 2021-01-04 end: 2026-03-31 sumladenvkp: 301999900
2026-01-06 06:35:29,242 - dataprep_main.py - DEBUG - 80 - shape: (162573, 24) shops: 113 start: 2021-01-04 end: 2026-03-31 sumladenvkp: 301999900
2026-01-06 06:35:36,595 - dataprep_main.py - DEBUG - 94 - shape: (115218, 38) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781
2026-01-06 06:35:36,595 - data_prep_class.py - DEBUG - 216 - before removing holidays: (115218, 38)
2026-01-06 06:35:36,609 - data_prep_class.py - DEBUG - 218 - aftere removing holidays: (115218, 38)
2026-01-06 06:35:36,613 - dataprep_main.py - DEBUG - 99 - shape: (115218, 38) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781
2026-01-06 06:35:37,043 - data_prep_class.py - DEBUG - 476 - missing values last year: lastyear_date 37065
lastyear_ladenvkp 37065
lastyear_fg_sg 37065
dtype: int64
2026-01-06 06:35:38,507 - dataprep_main.py - DEBUG - 138 - shape: (115218, 259) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781
2026-01-06 06:35:48,221 - data_prep_class.py - DEBUG - 433 - merge on agp - missing: lastyear_date 31920
lastyear_ladenvkp 31920
lastyear_fg_sg 31920
imputed_lastyear_ladenvkp 28931
imputed2_lastyear_ladenvkp 27534
av_lastyear_ladenvkp 27534
agp 7945
dtype: int64
2026-01-06 06:35:48,225 - dataprep_main.py - DEBUG - 146 - shape: (110073, 260) shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 221824313
2026-01-06 06:35:48,227 - data_prep_class.py - DEBUG - 440 - fill missing agp: 0
2026-01-06 06:35:48,230 - dataprep_main.py - DEBUG - 151 - shape: (110073, 260) shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 221824313