`y` and `yhat` from ProphetForecast don't make sense, and are very far apart

The [ProphetForecast](https://github.com/mozilla/docker-etl/blob/main/jobs/kpi-forecasting/kpi_forecasting/models/prophet_forecast.py) writes a [components table](https://github.com/mozilla/docker-etl/blob/main/jobs/kpi-forecasting/kpi_forecasting/configs/search_forecasting_ad_clicks.yaml#L68) to BigQuery. The components table has 2 columns called `y` and `yhat`. When I query this table in BigQuery, I see the following issues:

- `y` and `yhat` are very far apart, differing by up to 4 orders of magnitude.
- based on common sense about what these metrics mean, `y` is wrong. this is concerning because `y` is supposedly the target data used to train the model.
- based on how `yhat` is [set](https://github.com/mozilla/docker-etl/blob/main/jobs/kpi-forecasting/kpi_forecasting/models/prophet_forecast.py#L57), `yhat` should be list of 1,000 numbers, but in the BigQuery table, there is only 1 number.

Other suspicious things you could try investigating as a way to resolve the 3 issues above:
- Maybe the wrong [aggregation](https://github.com/mozilla/docker-etl/blob/main/jobs/kpi-forecasting/kpi_forecasting/models/prophet_forecast.py#L177) is getting used on `y`. It's not obvious to me why we'd apply the same aggregations to both `y` and `yhat`.
- This [comment](https://github.com/mozilla/docker-etl/blob/main/jobs/kpi-forecasting/kpi_forecasting/models/prophet_forecast.py#L323) sounds like it's doing the wrong thing. What does this sentence even mean? `In overlapping periods, the forecasted value will always be larger because it is the sum of the observed and forecasted values.`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`y` and `yhat` from ProphetForecast don't make sense, and are very far apart #270

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

y and yhat from ProphetForecast don't make sense, and are very far apart #270

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`y` and `yhat` from ProphetForecast don't make sense, and are very far apart #270