2025-03-06 06:34:16,764 - data_prep_class.py - INFO - 94 - Get data from Big Query with SELECT firma, datum, fg_sg, fg_bez, ladenvkp, katalogvkp, oeffnungszeit, fg_typ
FROM `test-ww-ki-witt.co_stat_daily_prognosis_bronzelayer.sales`
2025-03-06 06:34:22,158 - data_prep_class.py - INFO - 102 - firma datum fg_sg fg_bez ladenvkp katalogvkp oeffnungszeit fg_typ date
0 Witt 2022-07-27 20 Furth im Wald 764.94 1742.57 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Preislandgeschäft 2022-07-27
1 Sieh an! 2022-07-27 20 Furth im Wald 130.00 259.98 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Preislandgeschäft 2022-07-27
2 Witt 2022-07-27 47 Traunstein I 883.03 1008.93 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-07-27
3 Witt 2022-07-27 102 Freising 3307.51 5404.46 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-07-27
4 Sieh an! 2022-07-27 102 Freising 36.76 62.00 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-07-27
2025-03-06 06:34:22,172 - dataprep_main.py - DEBUG - 44 - shape: (263549, 9) shops: 122 start: 2021-01-04 end: 2025-03-05 sumladenvkp: 250276461
2025-03-06 06:34:22,236 - data_prep_class.py - DEBUG - 132 - output of shops and codes found in sales data to fg_sg_bez.xlsx
2025-03-06 06:34:22,465 - dataprep_main.py - DEBUG - 55 - shape: (128857, 3) shops: 122 start: 2021-01-04 end: 2025-03-05 sumladenvkp: 250276461
2025-03-06 06:34:22,465 - data_prep_class.py - DEBUG - 175 - max date in dataframe: 2025-03-05 00:00:00
2025-03-06 06:34:22,466 - data_prep_class.py - DEBUG - 188 - date_range: 2025-03-06 00:00:00 2025-05-31 00:00:00
2025-03-06 06:34:22,560 - data_prep_class.py - DEBUG - 201 - size of new dataset: (10614, 3) vs 10614
2025-03-06 06:34:22,571 - data_prep_class.py - DEBUG - 205 - final size: (139471, 3)
2025-03-06 06:34:22,574 - dataprep_main.py - DEBUG - 63 - shape: (139471, 3) shops: 122 start: 2021-01-04 end: 2025-05-31 sumladenvkp: 250276461
2025-03-06 06:34:22,843 - dataprep_main.py - DEBUG - 74 - shape: (135282, 24) shops: 113 start: 2021-01-04 end: 2025-05-31 sumladenvkp: 246211253
2025-03-06 06:34:22,844 - data_prep_class.py - DEBUG - 250 - check Friedrichshafen
2025-03-06 06:34:22,859 - data_prep_class.py - DEBUG - 254 - date fg_sg fg_bez ladenvkp
116856 2024-01-02 183 Friedrichshafen 643.92
2025-03-06 06:34:59,883 - dataprep_main.py - DEBUG - 81 - shape: (135282, 24) shops: 113 start: 2021-01-04 end: 2025-05-31 sumladenvkp: 246211253
2025-03-06 06:34:59,901 - dataprep_main.py - DEBUG - 85 - shape: (135282, 24) shops: 113 start: 2021-01-04 end: 2025-05-31 sumladenvkp: 246211253
2025-03-06 06:35:07,223 - dataprep_main.py - DEBUG - 99 - shape: (112763, 38) shops: 113 start: 2022-01-03 end: 2025-05-31 sumladenvkp: 208440391
2025-03-06 06:35:07,224 - data_prep_class.py - DEBUG - 211 - before removing holidays: (112763, 38)
2025-03-06 06:35:07,237 - data_prep_class.py - DEBUG - 213 - aftere removing holidays: (112763, 38)
2025-03-06 06:35:07,241 - dataprep_main.py - DEBUG - 104 - shape: (112763, 38) shops: 113 start: 2022-01-03 end: 2025-05-31 sumladenvkp: 208440391
2025-03-06 06:35:07,662 - data_prep_class.py - DEBUG - 436 - missing values last year: lastyear_date 37023
lastyear_ladenvkp 37023
lastyear_fg_sg 37023
dtype: int64
2025-03-06 06:35:09,120 - dataprep_main.py - DEBUG - 143 - shape: (112763, 259) shops: 113 start: 2022-01-03 end: 2025-05-31 sumladenvkp: 208440391
2025-03-06 06:35:19,137 - data_prep_class.py - DEBUG - 393 - merge on agp - missing: lastyear_date 31878
lastyear_ladenvkp 31878
lastyear_fg_sg 31878
imputed_lastyear_ladenvkp 28884
imputed2_lastyear_ladenvkp 27534
av_lastyear_ladenvkp 27534
agp 5488
dtype: int64
2025-03-06 06:35:19,142 - dataprep_main.py - DEBUG - 151 - shape: (107618, 260) shops: 113 start: 2022-03-01 end: 2025-05-31 sumladenvkp: 200775923
2025-03-06 06:35:19,144 - data_prep_class.py - DEBUG - 400 - fill missing agp: 0
2025-03-06 06:35:19,148 - dataprep_main.py - DEBUG - 156 - shape: (107618, 260) shops: 113 start: 2022-03-01 end: 2025-05-31 sumladenvkp: 200775923