2025-10-06 06:34:27,338 - data_prep_class.py - INFO - 98 - Get data from Big Query with SELECT firma, datum, fg_sg, fg_bez, ladenvkp, katalogvkp, oeffnungszeit, fg_typ
FROM `test-ww-ki-witt.co_stat_daily_prognosis_bronzelayer.sales`
2025-10-06 06:34:31,764 - data_prep_class.py - INFO - 107 - firma datum fg_sg fg_bez ladenvkp katalogvkp oeffnungszeit fg_typ date
0 Sieh an! 2022-08-24 20 Furth im Wald 58.00 130.00 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Preislandgeschäft 2022-08-24
1 Witt 2022-08-24 20 Furth im Wald 933.30 2703.70 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Preislandgeschäft 2022-08-24
2 Witt 2022-08-24 47 Traunstein I 2443.89 3534.33 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-08-24
3 Sieh an! 2022-08-24 47 Traunstein I 22.40 28.00 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-08-24
4 Witt 2022-08-24 102 Freising 1350.54 2807.20 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-08-24
2025-10-06 06:34:31,780 - dataprep_main.py - DEBUG - 39 - shape: (310101, 9) shops: 123 start: 2021-01-04 end: 2025-10-05 sumladenvkp: 288925898
2025-10-06 06:34:31,856 - data_prep_class.py - DEBUG - 137 - output of shops and codes found in sales data to fg_sg_bez.xlsx
2025-10-06 06:34:32,099 - dataprep_main.py - DEBUG - 50 - shape: (148189, 3) shops: 123 start: 2021-01-04 end: 2025-10-05 sumladenvkp: 288925898
2025-10-06 06:34:32,099 - data_prep_class.py - DEBUG - 180 - max date in dataframe: 2025-10-05 00:00:00
2025-10-06 06:34:32,100 - data_prep_class.py - DEBUG - 193 - date_range: 2025-10-06 00:00:00 2025-12-31 00:00:00
2025-10-06 06:34:32,193 - data_prep_class.py - DEBUG - 206 - size of new dataset: (10701, 3) vs 10701
2025-10-06 06:34:32,206 - data_prep_class.py - DEBUG - 210 - final size: (158890, 3)
2025-10-06 06:34:32,210 - dataprep_main.py - DEBUG - 58 - shape: (158890, 3) shops: 123 start: 2021-01-04 end: 2025-12-31 sumladenvkp: 288925898
2025-10-06 06:34:32,509 - dataprep_main.py - DEBUG - 69 - shape: (154612, 24) shops: 113 start: 2021-01-04 end: 2025-12-31 sumladenvkp: 284854001
2025-10-06 06:34:32,509 - data_prep_class.py - DEBUG - 255 - check Friedrichshafen
2025-10-06 06:34:32,526 - data_prep_class.py - DEBUG - 259 - date fg_sg fg_bez ladenvkp
134246 2024-01-02 183 Friedrichshafen 643.92
2025-10-06 06:35:08,602 - dataprep_main.py - DEBUG - 76 - shape: (154612, 24) shops: 113 start: 2021-01-04 end: 2025-12-31 sumladenvkp: 284854001
2025-10-06 06:35:08,620 - dataprep_main.py - DEBUG - 80 - shape: (154612, 24) shops: 113 start: 2021-01-04 end: 2025-12-31 sumladenvkp: 284854001
2025-10-06 06:35:16,102 - dataprep_main.py - DEBUG - 94 - shape: (115218, 38) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781
2025-10-06 06:35:16,103 - data_prep_class.py - DEBUG - 216 - before removing holidays: (115218, 38)
2025-10-06 06:35:16,117 - data_prep_class.py - DEBUG - 218 - aftere removing holidays: (115218, 38)
2025-10-06 06:35:16,120 - dataprep_main.py - DEBUG - 99 - shape: (115218, 38) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781
2025-10-06 06:35:16,557 - data_prep_class.py - DEBUG - 476 - missing values last year: lastyear_date 37065
lastyear_ladenvkp 37065
lastyear_fg_sg 37065
dtype: int64
2025-10-06 06:35:18,041 - dataprep_main.py - DEBUG - 138 - shape: (115218, 259) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781
2025-10-06 06:35:27,618 - data_prep_class.py - DEBUG - 433 - merge on agp - missing: lastyear_date 31920
lastyear_ladenvkp 31920
lastyear_fg_sg 31920
imputed_lastyear_ladenvkp 28931
imputed2_lastyear_ladenvkp 27534
av_lastyear_ladenvkp 27534
agp 7945
dtype: int64
2025-10-06 06:35:27,622 - dataprep_main.py - DEBUG - 146 - shape: (110073, 260) shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 221824313
2025-10-06 06:35:27,624 - data_prep_class.py - DEBUG - 440 - fill missing agp: 0
2025-10-06 06:35:27,627 - dataprep_main.py - DEBUG - 151 - shape: (110073, 260) shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 221824313