Logfiles

Logfile Content

2025-07-06 06:34:47,713 - data_prep_class.py - INFO - 98 - Get data from Big Query with SELECT firma, datum, fg_sg, fg_bez, ladenvkp, katalogvkp, oeffnungszeit, fg_typ  
FROM `test-ww-ki-witt.co_stat_daily_prognosis_bronzelayer.sales`

2025-07-06 06:34:54,185 - data_prep_class.py - INFO - 107 -              firma      datum  fg_sg                  fg_bez  ladenvkp  katalogvkp                                      oeffnungszeit             fg_typ       date
0             Witt 2023-04-01    132  Neuburg a. d. Donau II    547.35      748.08  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2023-04-01
1         Sieh an! 2023-04-01    132  Neuburg a. d. Donau II     84.00       93.00  Montag bis Freitag von 09.00 bis 18.00 Uhr | S...  Preislandgeschäft 2023-04-01
2  Josef Witt GmbH 2023-09-27    193              Dingolfing   1872.84     1894.52  Montag bis Freitag von 9.00 bis 18.00 Uhr | Sa...       Basisfiliale 2023-09-27
3    Sieh an! GmbH 2023-09-27    193              Dingolfing     43.00       43.00  Montag bis Freitag von 9.00 bis 18.00 Uhr | Sa...       Basisfiliale 2023-09-27
4  Josef Witt GmbH 2023-09-27     21     Neuburg a. d. Donau   2072.83     2057.62  Montag bis Freitag von 09.00 bis 18.00 Uhr | k...       Basisfiliale 2023-09-27

2025-07-06 06:34:54,201 - dataprep_main.py - DEBUG - 39 - shape: (290041, 9)  shops: 122 start: 2021-01-04 end: 2025-07-03 sumladenvkp: 271969645 

2025-07-06 06:34:54,271 - data_prep_class.py - DEBUG - 137 - output of shops and codes found in sales data to fg_sg_bez.xlsx

2025-07-06 06:34:54,520 - dataprep_main.py - DEBUG - 50 - shape: (139625, 3)  shops: 122 start: 2021-01-04 end: 2025-07-03 sumladenvkp: 271969645 

2025-07-06 06:34:54,520 - data_prep_class.py - DEBUG - 180 - max date in dataframe:   2025-07-03 00:00:00 

2025-07-06 06:34:54,521 - data_prep_class.py - DEBUG - 193 - date_range: 2025-07-04 00:00:00 2025-09-30 00:00:00

2025-07-06 06:34:54,616 - data_prep_class.py - DEBUG - 206 - size of new dataset: (10858, 3) vs 10858

2025-07-06 06:34:54,627 - data_prep_class.py - DEBUG - 210 - final size: (150483, 3) 

2025-07-06 06:34:54,632 - dataprep_main.py - DEBUG - 58 - shape: (150483, 3)  shops: 122 start: 2021-01-04 end: 2025-09-30 sumladenvkp: 271969645 

2025-07-06 06:34:54,921 - dataprep_main.py - DEBUG - 69 - shape: (146276, 24)  shops: 113 start: 2021-01-04 end: 2025-09-30 sumladenvkp: 267904437 

2025-07-06 06:34:54,921 - data_prep_class.py - DEBUG - 255 - check Friedrichshafen

2025-07-06 06:34:54,937 - data_prep_class.py - DEBUG - 259 -               date  fg_sg           fg_bez  ladenvkp
126549 2024-01-02    183  Friedrichshafen    643.92

2025-07-06 06:35:33,972 - dataprep_main.py - DEBUG - 76 - shape: (146276, 24)  shops: 113 start: 2021-01-04 end: 2025-09-30 sumladenvkp: 267904437 

2025-07-06 06:35:33,990 - dataprep_main.py - DEBUG - 80 - shape: (146276, 24)  shops: 113 start: 2021-01-04 end: 2025-09-30 sumladenvkp: 267904437 

2025-07-06 06:35:41,341 - dataprep_main.py - DEBUG - 94 - shape: (115218, 38)  shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781 

2025-07-06 06:35:41,341 - data_prep_class.py - DEBUG - 216 - before removing holidays: (115218, 38)

2025-07-06 06:35:41,356 - data_prep_class.py - DEBUG - 218 - aftere removing holidays: (115218, 38)

2025-07-06 06:35:41,360 - dataprep_main.py - DEBUG - 99 - shape: (115218, 38)  shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781 

2025-07-06 06:35:41,820 - data_prep_class.py - DEBUG - 476 - missing values last year: lastyear_date        37065
lastyear_ladenvkp    37065
lastyear_fg_sg       37065
dtype: int64

2025-07-06 06:35:43,453 - dataprep_main.py - DEBUG - 138 - shape: (115218, 259)  shops: 113 start: 2022-01-03 end: 2025-06-30 sumladenvkp: 229488781 

2025-07-06 06:35:53,687 - data_prep_class.py - DEBUG - 433 - merge on agp - missing: lastyear_date                 31920
lastyear_ladenvkp             31920
lastyear_fg_sg                31920
imputed_lastyear_ladenvkp     28931
imputed2_lastyear_ladenvkp    27534
av_lastyear_ladenvkp          27534
agp                            7945
dtype: int64

2025-07-06 06:35:53,691 - dataprep_main.py - DEBUG - 146 - shape: (110073, 260)  shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 221824313 

2025-07-06 06:35:53,693 - data_prep_class.py - DEBUG - 440 - fill missing agp: 0

2025-07-06 06:35:53,697 - dataprep_main.py - DEBUG - 151 - shape: (110073, 260)  shops: 113 start: 2022-03-01 end: 2025-06-30 sumladenvkp: 221824313