HYPERION Freight Forecasting - Technical Report

Executive Summary

Rate Intelligence

HYPERION Freight Forecasting achieves 7‑day R² of 0.70 and MAE of 611 USD/TEU on Far East→US West Coast rates by leveraging a proprietary Trade Imbalance Ratio feature and time‑series validated gradient‑boosted models.

The system produces 7/14/30‑day forecasts, with 14‑day R² of 0.50 for operational planning and 30‑day directional guidance, all backed by a foundry of benchmarked algorithms and rigorous backtesting.

0.70

7-Day R²

611

MAE (USD/TEU)

0.50

14-Day R²

Trade Imbalance

Ratio

Project Scope

Operational Scale

Provide lane‑specific short‑ and mid‑term freight rate forecasts that support procurement timing, contract negotiation, and risk hedging under volatile market regimes.

Focus on robust out‑of‑sample performance and explainability via feature importance and domain‑grounded indicators rather than opaque black‑box signals.

Intention & Objectives

Mission Critical

Move from reactive price‑taking to proactive strategy using defensible, horizon‑specific forecasts and early‑warning indicators.
Codify domain intuition (capacity tension, fuel, demand imbalances) into predictive features with strict leakage controls.

Data Sources

Multi-Modal Fusion

Lane Rates

Far East→US West Coast and reciprocal lane (eastbound/westbound) for imbalance measures, daily since 2018.

Market Proxies

Dry bulk index proxy and Brent fuel prices to capture capacity and cost dynamics across macro conditions.

Target Variables

Output Specifications

Targets: Daily USD/TEU rates at 7/14/30‑day horizons; evaluation on R² and MAE with time‑series splits.
Predictors: Lagged rates (1/7/14/30), fuel and index lags, and the Trade Imbalance Ratio and its lags to encode capacity tension.

Methodology

Pipeline Architecture

Feature engineering: Compute FEUW/UWFE ratio and multi‑lag stacks; exclude contemporaneous target to prevent leakage.

Model foundry: Ridge, Random Forest, LightGBM, XGBoost, and CatBoost benchmarked per horizon with backtesting and holdout.

Selection: XGBoost champions 7‑day; LightGBM strong on 14‑day; ensemble diagnostics retained.

Key Signals

Leading Indicators

Key signals guiding the forecasting system:

Trade Imbalance Ratio

Higher values precede rate spikes:

$$ \text{TIR} = \frac{\text{FEUW}}{\text{UWFE}} $$

Market Dynamics

Market and fuel lags transmit macro and cost pressures; lane‑specific lags capture momentum and mean‑reversion.

Model Architecture

Gradient-Boosted Ensemble

Tree ensembles capture nonlinear interactions among fuel, macro proxies, and lane imbalances while retaining feature attributions.

Model Foundry

Ridge, Random Forest
LightGBM, XGBoost
CatBoost

Horizon Selection

XGBoost champions 7‑day; LightGBM strong on 14‑day; ensemble diagnostics retained.

Core Mathematics

Key Formulas

R²

$$ R^{2} = 1 - \frac{\sum (y-\hat{y})^{2}}{\sum (y-\bar{y})^{2}} $$

Primary variance‑explanation metric

MAE

$$ \text{MAE} = \frac{1}{N}\sum |y-\hat{y}| $$

Procurement‑aligned error in USD/TEU

Trade Imbalance Ratio

With lag features:

$$ \text{TIR}_{t} = \frac{\text{FEUW}_{t}}{\text{UWFE}_{t}} $$

Implementation

Code Snippets

Feature Assembly

df["tir"] = df["feuw"]/df["uwfe"]
lags = [1,7,14,30]
for col in ["feuw","uwfe","bdi_proxy","brent","tir"]:
    for L in lags:
        df[f"{col}_lag_{L}"] = df[col].shift(L)
features = [c for c in df.columns if "lag_" in c]

XGBoost Training

X = df[features].iloc[:-7].dropna()
y = df["feuw"].shift(-7).iloc[:-7].loc[X.index]
model = xgb.XGBRegressor(max_depth=6, n_estimators=400, learning_rate=0.05)
model.fit(X, y)
yhat = model.predict(X_test)
r2 = r2_score(y_test, yhat); mae = mean_absolute_error(y_test, yhat)

Performance Metrics

Results Obtained

7-Day Performance

$$ R^{2}=0.70 $$, $$ \text{MAE}=611 $$ USD/TEU; 14‑day: $$ R^{2}=0.50 $$; 30‑day: directional guidance under higher uncertainty.

Data Coverage

2,698 daily records processed with real‑time pipeline hooks and automated QC for production readiness.

Strategic Impact

Decision Relevance

Forecasts enable rate‑lock timing, spot vs. contract trade‑offs, and budget sensitivity analysis ahead of volatility bursts.
TIR provides interpretable, lane‑physics‑aligned signal that generalizes across regimes better than naive momentum alone.

Technical Foundation

What Makes It Work

Time‑Series Splits

Leakage controls are enforced; horizon‑specific champions avoid one‑size‑fits‑all degradation.

Tree Ensembles

Capture nonlinear interactions among fuel, macro proxies, and lane imbalances while retaining feature attributions.

Deployment

Operationalization & Outputs

Deliverables

7/14/30‑day forecasts (CSV)
Confidence intervals
Feature importance
Backtesting reports

Integration

Live ingestion, automated serving, monitoring for regime breaks, and alerting when predicted spikes exceed thresholds.

Constraints

Limitations & Risks

Structural Regime Shifts

Data feed issues require prompt recalibration; accuracy diminishes beyond 30‑day horizons.

Market Microstructure

Changes can modify TIR elasticities; periodic re‑estimation preserves edge.

Roadmap

Future Scope

Multi‑Lane Expansion

EU–Asia, Trans‑Pacific variants, real‑time API, and contract optimization decision support.

Alerting & Simulators

What‑if simulators for fuel shock or capacity‑shock scenarios using stress‑tested ensembles.