Logfiles

Logfile Content

2025-05-06 06:34:20,440 - data_prep_class.py - INFO - 98 - Get data from Big Query with SELECT firma, datum, fg_sg, fg_bez, ladenvkp, katalogvkp, oeffnungszeit, fg_typ  
FROM `test-ww-ki-witt.co_stat_daily_prognosis_bronzelayer.sales`

2025-05-06 06:34:24,901 - data_prep_class.py - INFO - 107 -       firma      datum  fg_sg         fg_bez  ladenvkp  katalogvkp                                      oeffnungszeit             fg_typ       date
0      Witt 2022-12-06     20  Furth im Wald    706.42     1208.85  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2022-12-06
1  Sieh an! 2022-12-06     20  Furth im Wald    104.50      116.00  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2022-12-06
2      Witt 2022-12-06     51    Frankenberg   2337.45     5671.02  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2022-12-06
3  Sieh an! 2022-12-06     51    Frankenberg     47.80      101.00  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2022-12-06
4      Witt 2022-03-30     20  Furth im Wald    713.17     1623.11  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2022-03-30

2025-05-06 06:34:24,916 - dataprep_main.py - DEBUG - 39 - shape: (277036, 9)  shops: 122 start: 2021-01-04 end: 2025-05-05 sumladenvkp: 261548378 

2025-05-06 06:34:24,983 - data_prep_class.py - DEBUG - 137 - output of shops and codes found in sales data to fg_sg_bez.xlsx

2025-05-06 06:34:25,221 - dataprep_main.py - DEBUG - 50 - shape: (134339, 3)  shops: 122 start: 2021-01-04 end: 2025-05-05 sumladenvkp: 261548378 

2025-05-06 06:34:25,221 - data_prep_class.py - DEBUG - 180 - max date in dataframe:   2025-05-05 00:00:00 

2025-05-06 06:34:25,222 - data_prep_class.py - DEBUG - 193 - date_range: 2025-05-06 00:00:00 2025-07-31 00:00:00

2025-05-06 06:34:25,316 - data_prep_class.py - DEBUG - 206 - size of new dataset: (10614, 3) vs 10614

2025-05-06 06:34:25,327 - data_prep_class.py - DEBUG - 210 - final size: (144953, 3) 

2025-05-06 06:34:25,331 - dataprep_main.py - DEBUG - 58 - shape: (144953, 3)  shops: 122 start: 2021-01-04 end: 2025-07-31 sumladenvkp: 261548378 

2025-05-06 06:34:25,605 - dataprep_main.py - DEBUG - 69 - shape: (140764, 24)  shops: 113 start: 2021-01-04 end: 2025-07-31 sumladenvkp: 257483170 

2025-05-06 06:34:25,606 - data_prep_class.py - DEBUG - 255 - check Friedrichshafen

2025-05-06 06:34:25,621 - data_prep_class.py - DEBUG - 259 -               date  fg_sg           fg_bez  ladenvkp
121793 2024-01-02    183  Friedrichshafen    643.92

2025-05-06 06:35:01,870 - dataprep_main.py - DEBUG - 76 - shape: (140764, 24)  shops: 113 start: 2021-01-04 end: 2025-07-31 sumladenvkp: 257483170 

2025-05-06 06:35:01,888 - dataprep_main.py - DEBUG - 80 - shape: (140764, 24)  shops: 113 start: 2021-01-04 end: 2025-07-31 sumladenvkp: 257483170 

2025-05-06 06:35:09,183 - dataprep_main.py - DEBUG - 94 - shape: (115282, 38)  shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 219577302 

2025-05-06 06:35:09,184 - data_prep_class.py - DEBUG - 216 - before removing holidays: (115282, 38)

2025-05-06 06:35:09,197 - data_prep_class.py - DEBUG - 218 - aftere removing holidays: (115282, 38)

2025-05-06 06:35:09,200 - dataprep_main.py - DEBUG - 99 - shape: (115282, 38)  shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 219577302 

2025-05-06 06:35:09,620 - data_prep_class.py - DEBUG - 476 - missing values last year: lastyear_date        37068
lastyear_ladenvkp    37068
lastyear_fg_sg       37068
dtype: int64

2025-05-06 06:35:11,121 - dataprep_main.py - DEBUG - 138 - shape: (115282, 259)  shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 219577302 

2025-05-06 06:35:20,815 - data_prep_class.py - DEBUG - 433 - merge on agp - missing: lastyear_date                 31923
lastyear_ladenvkp             31923
lastyear_fg_sg                31923
imputed_lastyear_ladenvkp     28931
imputed2_lastyear_ladenvkp    27534
av_lastyear_ladenvkp          27534
agp                            8009
dtype: int64

2025-05-06 06:35:20,821 - dataprep_main.py - DEBUG - 146 - shape: (110137, 260)  shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 211912834 

2025-05-06 06:35:20,823 - data_prep_class.py - DEBUG - 440 - fill missing agp: 0

2025-05-06 06:35:20,826 - dataprep_main.py - DEBUG - 151 - shape: (110137, 260)  shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 211912834