Model Evaluation

Overview

This guide systematically evaluates weather model performance for energy market applications, emphasizing quantitative validation and operational testing.

Evaluation Steps

Focus on metrics most relevant to your operations, such as power or price forecasts powered by Jua. If starting with weather data, prioritize real-life observations over analysis datasets like ERA5, as these often contain internal biases.

Step 1: Model Assessment

Review model performance reports (EPT-1.5, EPT-2)
Decision: Proceed with evaluation.

We recommend starting with our technical report to understand the model's capabilities before proceeding with your own evaluation.

Step 2.1: Evaluate Your Impact Forecast

Generate generation or demand forecasts using model data
Compare with your existing provider
Decision: Proceed to trade evaluation.

Step 2.2: Observations Benchmark

The best results are usually achieved by generating an impact forecast, but if you aim to have a weather comparison, we recommend comparing initial performance vs. station observations
Evaluate basic accuracy metrics
Decision: Continue to detailed testing?

Step 3: Trading Signal Test and PnL Backtest

Simulate historical market scenarios and strategies and conduct paper trading tests.
Decision: Is ROI there?

Example: Weather Model Testing Framework for Spot Markets

Approach 1: Generation or Demand Forecast

Prerequisites:

Existing operational forecast using the current forecast provider
New Impact forecast powered by Jua

Key Metrics:

Wind Power Forecast accuracy (12-36hr horizon)
Solar Power Forecast accuracy (12-36hr horizon)
Demand forecast accuracy (12-36hr horizon)
Price forecast accuracy (12-36hr horizon)

Evaluation Approach, example Day-Ahead Germany:

Jua EPT-1.5 & 3rd party forecast based Wind Power Forecast performance against realized power data in Germany
Compare each model performance against actuals
Calculate improvement metrics between models

Approach 2: Ground Truth Validation

Prerequisites:

Weather station data
Defined forecast window (typically 12-36hr for day-ahead markets)

Testing Protocol:

Forecast Evaluation:
- Compare ECMWF HRES or your preferred model 12-36hr forecasts against ground truth
- Compare Jua 12-36hr forecasts against ground truth
Performance Analysis:
- Calculate error metrics (RMSE, MAE, bias)
- Analyze temporal patterns in forecast accuracy
- Identify systematic biases or errors

Approach 3: Advanced Validation

Option A: Live Strategy Testing & Historical PnL Backtesting

Setup:
- Historical market simulation with trading strategy
- Trading strategy implementation
Analysis:
- Calculate theoretical PnL
- Stress test under different market conditions
- Sensitivity analysis to forecast errors

PreviousRoadmap NextGetting Started

Last updated 4 months ago

hashtagOverview

hashtagEvaluation Steps

hashtagStep 1: Model Assessment

hashtagStep 2.1: Evaluate Your Impact Forecast

hashtagStep 2.2: Observations Benchmark

hashtagStep 3: Trading Signal Test and PnL Backtest

hashtagExample: Weather Model Testing Framework for Spot Markets

hashtagApproach 1: Generation or Demand Forecast

hashtagApproach 2: Ground Truth Validation

hashtagApproach 3: Advanced Validation