2025-06-06 06:34:24,610 - data_prep_class.py - INFO - 98 - Get data from Big Query with SELECT firma, datum, fg_sg, fg_bez, ladenvkp, katalogvkp, oeffnungszeit, fg_typ
FROM `test-ww-ki-witt.co_stat_daily_prognosis_bronzelayer.sales`
2025-06-06 06:34:29,524 - data_prep_class.py - INFO - 107 - firma datum fg_sg fg_bez ladenvkp katalogvkp oeffnungszeit fg_typ date
0 Witt 2022-03-30 20 Furth im Wald 713.17 1623.11 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Preislandgeschäft 2022-03-30
1 Sieh an! 2022-03-30 20 Furth im Wald 8.50 15.00 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Preislandgeschäft 2022-03-30
2 Sieh an! 2022-03-30 47 Traunstein I 21.60 27.00 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-03-30
3 Witt 2022-03-30 47 Traunstein I 1388.73 1490.49 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-03-30
4 Witt 2022-03-30 102 Freising 459.13 593.23 Montag bis Freitag von 09.00 bis 18.00 Uhr | S... Basisfiliale 2022-03-30
2025-06-06 06:34:29,539 - dataprep_main.py - DEBUG - 39 - shape: (284056, 9) shops: 122 start: 2021-01-04 end: 2025-06-05 sumladenvkp: 267295402
2025-06-06 06:34:29,609 - data_prep_class.py - DEBUG - 137 - output of shops and codes found in sales data to fg_sg_bez.xlsx
2025-06-06 06:34:29,852 - dataprep_main.py - DEBUG - 50 - shape: (137210, 3) shops: 122 start: 2021-01-04 end: 2025-06-05 sumladenvkp: 267295402
2025-06-06 06:34:29,852 - data_prep_class.py - DEBUG - 180 - max date in dataframe: 2025-06-05 00:00:00
2025-06-06 06:34:29,853 - data_prep_class.py - DEBUG - 193 - date_range: 2025-06-06 00:00:00 2025-08-31 00:00:00
2025-06-06 06:34:29,947 - data_prep_class.py - DEBUG - 206 - size of new dataset: (10614, 3) vs 10614
2025-06-06 06:34:29,959 - data_prep_class.py - DEBUG - 210 - final size: (147824, 3)
2025-06-06 06:34:29,964 - dataprep_main.py - DEBUG - 58 - shape: (147824, 3) shops: 122 start: 2021-01-04 end: 2025-08-31 sumladenvkp: 267295402
2025-06-06 06:34:30,248 - dataprep_main.py - DEBUG - 69 - shape: (143635, 24) shops: 113 start: 2021-01-04 end: 2025-08-31 sumladenvkp: 263230193
2025-06-06 06:34:30,248 - data_prep_class.py - DEBUG - 255 - check Friedrichshafen
2025-06-06 06:34:30,263 - data_prep_class.py - DEBUG - 259 - date fg_sg fg_bez ladenvkp
124376 2024-01-02 183 Friedrichshafen 643.92
2025-06-06 06:35:07,896 - dataprep_main.py - DEBUG - 76 - shape: (143635, 24) shops: 113 start: 2021-01-04 end: 2025-08-31 sumladenvkp: 263230193
2025-06-06 06:35:07,919 - dataprep_main.py - DEBUG - 80 - shape: (143635, 24) shops: 113 start: 2021-01-04 end: 2025-08-31 sumladenvkp: 263230193
2025-06-06 06:35:15,410 - dataprep_main.py - DEBUG - 94 - shape: (115247, 38) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 225293267
2025-06-06 06:35:15,410 - data_prep_class.py - DEBUG - 216 - before removing holidays: (115247, 38)
2025-06-06 06:35:15,424 - data_prep_class.py - DEBUG - 218 - aftere removing holidays: (115247, 38)
2025-06-06 06:35:15,428 - dataprep_main.py - DEBUG - 99 - shape: (115247, 38) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 225293267
2025-06-06 06:35:15,848 - data_prep_class.py - DEBUG - 476 - missing values last year: lastyear_date 37065
lastyear_ladenvkp 37065
lastyear_fg_sg 37065
dtype: int64
2025-06-06 06:35:17,431 - dataprep_main.py - DEBUG - 138 - shape: (115247, 259) shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 225293267
2025-06-06 06:35:27,409 - data_prep_class.py - DEBUG - 433 - merge on agp - missing: lastyear_date 31920
lastyear_ladenvkp 31920
lastyear_fg_sg 31920
imputed_lastyear_ladenvkp 28931
imputed2_lastyear_ladenvkp 27534
av_lastyear_ladenvkp 27534
agp 7974
dtype: int64
2025-06-06 06:35:27,413 - dataprep_main.py - DEBUG - 146 - shape: (110102, 260) shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 217628800
2025-06-06 06:35:27,415 - data_prep_class.py - DEBUG - 440 - fill missing agp: 0
2025-06-06 06:35:27,419 - dataprep_main.py - DEBUG - 151 - shape: (110102, 260) shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 217628800