Spaces:

evgueni-p
/

fbmc-chronos2

Sleeping

Evgueni Poloukarov Claude commited on Nov 14

Commit

12f45c0

1 Parent(s): c0dc80e

refactor: convert HF Space from JupyterLab to Gradio API

Architecture Change:
- Replace interactive notebooks with API endpoint
- HF Space now serves as inference API (not development environment)

New Components:
- app.py: Gradio web interface for triggering forecasts
- chronos_inference.py: Production inference pipeline
- Supports smoke test (7 days) and full forecast (14 days)

Benefits:
- No SSH resource limit issues (exit code 137)
- API-first design (can call from local machine)
- Model caching (loaded once, stays in memory)
- Results downloadable as parquet files
- Local development → Remote GPU inference workflow

Usage:
1. Web UI: https://huggingface.co/spaces/evgueni-p/fbmc-chronos2
2. Python API: gradio_client.Client("evgueni-p/fbmc-chronos2").predict()

Co-Authored-By: Claude <[email protected]>

Files changed (4) hide show

README.md +33 -13
app.py +138 -0
requirements.txt +3 -6
src/forecasting/chronos_inference.py +296 -0

README.md CHANGED Viewed

@@ -3,32 +3,52 @@ title: FBMC Chronos-2 Forecasting
 emoji: ⚡
 colorFrom: blue
 colorTo: green
-sdk: docker
 pinned: false
 tags:
-  - jupyterlab
 suggested_storage: small
 ---
-# FBMC Flow-Based Market Coupling Forecasting
-Zero-shot electricity cross-border flow forecasting for 38 European FBMC borders using Amazon Chronos 2.
 ## 🚀 Quick Start
-This HuggingFace Space provides interactive Jupyter notebooks for running zero-shot forecasts on GPU.
-### Available Notebooks
-1. **`inference_smoke_test.ipynb`** - Quick validation (1 border × 7 days, ~1 min)
-2. **`inference_full_14day.ipynb`** - Production forecast (38 borders × 14 days, ~5 min)
-3. **`evaluation.ipynb`** - Performance analysis vs actuals
-### How to Use
-1. Open any notebook in JupyterLab
-2. Run all cells (Cell → Run All)
-3. View results and visualizations inline
 ## 📊 Dataset

 emoji: ⚡
 colorFrom: blue
 colorTo: green
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
 pinned: false
 tags:
+  - forecasting
+  - time-series
+  - electricity
+  - zero-shot
+suggested_hardware: t4-small
 suggested_storage: small
 ---
+# FBMC Flow-Based Market Coupling Forecasting API
+Zero-shot electricity cross-border flow forecasting for 38 European FBMC borders using Amazon Chronos-2.
 ## 🚀 Quick Start
+This HuggingFace Space provides a **Gradio API** for GPU-accelerated zero-shot forecasting.
+### How to Use (Web Interface)
+1. **Select run date**: Choose the forecast date (YYYY-MM-DD format)
+2. **Choose forecast type**:
+   - **Smoke Test**: 1 border × 7 days (~30 seconds)
+   - **Full Forecast**: All 38 borders × 14 days (~5 minutes)
+3. **Click "Run Forecast"**
+4. **Download results**: Parquet file with probabilistic forecasts
+### How to Use (Python API)
+```python
+from gradio_client import Client
+client = Client("evgueni-p/fbmc-chronos2")
+result_file = client.predict(
+    run_date="2025-09-30",
+    forecast_type="smoke_test"
+)
+# Download and analyze locally
+import polars as pl
+df = pl.read_parquet(result_file)
+print(df.head())
+```
 ## 📊 Dataset

app.py ADDED Viewed

	@@ -0,0 +1,138 @@

+#!/usr/bin/env python3
+"""
+FBMC Chronos-2 Forecasting API
+HuggingFace Space Gradio Interface
+"""
+import gradio as gr
+import os
+from datetime import datetime
+from src.forecasting.chronos_inference import run_inference
+# Global configuration
+FORECAST_TYPES = {
+    "smoke_test": "Smoke Test (1 border × 7 days)",
+    "full_14day": "Full Forecast (All borders × 14 days)"
+}
+def forecast_api(run_date_str, forecast_type):
+    """
+    API endpoint for triggering forecasts.
+    Args:
+        run_date_str: Date in YYYY-MM-DD format
+        forecast_type: 'smoke_test' or 'full_14day'
+    Returns:
+        Path to downloadable forecast results file
+    """
+    try:
+        # Validate run date
+        run_date = datetime.strptime(run_date_str, "%Y-%m-%d")
+        # Run inference
+        result_path = run_inference(
+            run_date=run_date_str,
+            forecast_type=forecast_type,
+            output_dir="/tmp"
+        )
+        return result_path
+    except Exception as e:
+        error_msg = f"Error: {str(e)}"
+        print(error_msg)
+        # Return error message as text file
+        error_path = "/tmp/error.txt"
+        with open(error_path, 'w') as f:
+            f.write(error_msg)
+        return error_path
+# Build Gradio interface
+with gr.Blocks(title="FBMC Chronos-2 Forecasting") as demo:
+    gr.Markdown("""
+    # FBMC Chronos-2 Zero-Shot Forecasting API
+    **Flow-Based Market Coupling** electricity flow forecasting using Amazon Chronos-2.
+    This Space provides GPU-accelerated zero-shot inference for cross-border electricity flows.
+    """)
+    with gr.Row():
+        with gr.Column():
+            gr.Markdown("### Configuration")
+            run_date_input = gr.Textbox(
+                label="Run Date (YYYY-MM-DD)",
+                value="2025-09-30",
+                placeholder="2025-09-30",
+                info="Date when forecast is made (data up to this date is historical)"
+            )
+            forecast_type_input = gr.Radio(
+                choices=list(FORECAST_TYPES.keys()),
+                value="smoke_test",
+                label="Forecast Type",
+                info="Smoke test: Quick validation (1 border, 7 days). Full: Production forecast (all borders, 14 days)"
+            )
+            submit_btn = gr.Button("Run Forecast", variant="primary")
+        with gr.Column():
+            gr.Markdown("### Results")
+            output_file = gr.File(
+                label="Download Forecast Results",
+                type="filepath"
+            )
+            gr.Markdown("""
+            **Output format**: Parquet file with columns:
+            - `timestamp`: Hourly timestamps (D+1 to D+7 or D+14)
+            - `{border}_median`: Median forecast (MW)
+            - `{border}_q10`: 10th percentile (MW)
+            - `{border}_q90`: 90th percentile (MW)
+            **Inference environment**:
+            - GPU: NVIDIA T4 (16GB VRAM)
+            - Model: Chronos-T5-Large (710M parameters)
+            - Precision: bfloat16
+            """)
+    # Wire up the interface
+    submit_btn.click(
+        fn=forecast_api,
+        inputs=[run_date_input, forecast_type_input],
+        outputs=output_file
+    )
+    gr.Markdown("""
+    ---
+    ### About
+    **Zero-shot forecasting**: No model training required. The pre-trained Chronos-2 model
+    generalizes directly to FBMC cross-border flows using historical patterns and future covariates.
+    **Features**:
+    - 2,553 engineered features (weather, CNEC constraints, load forecasts, LTA)
+    - 24-month historical context (Oct 2023 - Oct 2025)
+    - Time-aware extraction (prevents data leakage)
+    - Probabilistic forecasts (10th/50th/90th percentiles)
+    **Performance**:
+    - Smoke test: ~30 seconds (1 border × 168 hours)
+    - Full forecast: ~5 minutes (38 borders × 336 hours)
+    **Project**: FBMC Flow Forecasting MVP | **Author**: Evgueni Poloukarov
+    """)
+# Launch the app
+if __name__ == "__main__":
+    demo.launch(
+        server_name="0.0.0.0",
+        server_port=7860,
+        share=False
+    )

requirements.txt CHANGED Viewed

@@ -1,7 +1,5 @@
-# JupyterLab (from HF template)
-jupyterlab==4.2.5
-tornado==6.2
-ipywidgets
 # Core ML/Data
 torch>=2.0.0
@@ -14,9 +12,8 @@ pyarrow>=13.0.0
 # HuggingFace
 huggingface-hub>=0.19.0
-# Visualization
 altair>=5.0.0
-vega-datasets
 # Utilities
 python-dotenv

+# Gradio
+gradio==4.44.0
 # Core ML/Data
 torch>=2.0.0
 # HuggingFace
 huggingface-hub>=0.19.0
+# Visualization (for local analysis)
 altair>=5.0.0
 # Utilities
 python-dotenv

src/forecasting/chronos_inference.py ADDED Viewed

	@@ -0,0 +1,296 @@

+#!/usr/bin/env python3
+"""
+Chronos-2 Inference Pipeline
+Standalone inference script for HuggingFace Space deployment.
+"""
+import os
+import time
+from typing import List, Dict, Optional
+from datetime import datetime, timedelta
+import polars as pl
+import pandas as pd
+import torch
+from datasets import load_dataset
+from chronos import ChronosPipeline
+from .dynamic_forecast import DynamicForecast
+from .feature_availability import FeatureAvailability
+class ChronosInferencePipeline:
+    """
+    Production inference pipeline for Chronos-2 zero-shot forecasting.
+    Designed for deployment as API endpoint on HuggingFace Spaces.
+    """
+    def __init__(
+        self,
+        model_name: str = "amazon/chronos-t5-large",
+        device: str = "cuda",
+        dtype: str = "bfloat16"
+    ):
+        """
+        Initialize inference pipeline.
+        Args:
+            model_name: HuggingFace model identifier
+            device: Device for inference ('cuda' or 'cpu')
+            dtype: Data type for model weights
+        """
+        self.model_name = model_name
+        self.device = device
+        self.dtype = dtype
+        # Model loaded on first inference (lazy loading)
+        self._pipeline = None
+        self._dataset = None
+        self._borders = None
+    def _load_model(self):
+        """Load Chronos model (cached after first call)"""
+        if self._pipeline is None:
+            print(f"Loading {self.model_name}...")
+            start_time = time.time()
+            dtype_map = {
+                "bfloat16": torch.bfloat16,
+                "float16": torch.float16,
+                "float32": torch.float32
+            }
+            self._pipeline = ChronosPipeline.from_pretrained(
+                self.model_name,
+                device_map=self.device,
+                torch_dtype=dtype_map.get(self.dtype, torch.bfloat16)
+            )
+            print(f"Model loaded in {time.time() - start_time:.1f}s")
+            print(f"  Device: {next(self._pipeline.model.parameters()).device}")
+        return self._pipeline
+    def _load_dataset(self):
+        """Load dataset from HuggingFace (cached after first call)"""
+        if self._dataset is None:
+            print("Loading dataset from HuggingFace...")
+            start_time = time.time()
+            hf_token = os.getenv("HF_TOKEN")
+            dataset = load_dataset(
+                "evgueni-p/fbmc-features-24month",
+                split="train",
+                token=hf_token
+            )
+            # Convert to Polars
+            self._dataset = pl.from_arrow(dataset.data.table)
+            # Extract available borders
+            target_cols = [col for col in self._dataset.columns if col.startswith('target_border_')]
+            self._borders = [col.replace('target_border_', '') for col in target_cols]
+            print(f"Dataset loaded in {time.time() - start_time:.1f}s")
+            print(f"  Shape: {self._dataset.shape}")
+            print(f"  Borders: {len(self._borders)}")
+        return self._dataset, self._borders
+    def run_forecast(
+        self,
+        run_date: str,
+        borders: Optional[List[str]] = None,
+        forecast_days: int = 7,
+        context_hours: int = 512,
+        num_samples: int = 20
+    ) -> Dict:
+        """
+        Run zero-shot forecast for specified borders.
+        Args:
+            run_date: Forecast run date (YYYY-MM-DD format)
+            borders: List of borders to forecast (None = all borders)
+            forecast_days: Forecast horizon in days (7 or 14)
+            context_hours: Historical context window
+            num_samples: Number of probabilistic samples
+        Returns:
+            Dictionary with forecast results and metadata
+        """
+        # Load model and dataset (cached)
+        pipeline = self._load_model()
+        df, all_borders = self._load_dataset()
+        # Parse run date
+        run_datetime = datetime.strptime(run_date, "%Y-%m-%d")
+        run_datetime = run_datetime.replace(hour=23, minute=0)
+        # Determine borders to forecast
+        forecast_borders = borders if borders else all_borders
+        prediction_hours = forecast_days * 24
+        print(f"\nForecast configuration:")
+        print(f"  Run date: {run_datetime}")
+        print(f"  Borders: {len(forecast_borders)}")
+        print(f"  Forecast horizon: {forecast_days} days ({prediction_hours} hours)")
+        print(f"  Context window: {context_hours} hours")
+        # Initialize dynamic forecast system
+        forecaster = DynamicForecast(
+            dataset=df,
+            context_hours=context_hours,
+            forecast_hours=prediction_hours
+        )
+        # Run forecasts for each border
+        results = {
+            'run_date': run_date,
+            'forecast_days': forecast_days,
+            'borders': {},
+            'metadata': {
+                'model': self.model_name,
+                'device': self.device,
+                'num_samples': num_samples,
+                'context_hours': context_hours
+            }
+        }
+        total_start = time.time()
+        for i, border in enumerate(forecast_borders, 1):
+            print(f"\n[{i}/{len(forecast_borders)}] Forecasting {border}...")
+            border_start = time.time()
+            try:
+                # Extract data
+                context_data, future_data = forecaster.prepare_forecast_data(
+                    run_date=run_datetime,
+                    border=border
+                )
+                # Get target column name
+                target_col = f"target_border_{border}"
+                # Extract context values
+                context = context_data[target_col].values
+                # Run inference
+                forecast = pipeline.predict(
+                    context=context,
+                    prediction_length=prediction_hours,
+                    num_samples=num_samples
+                )
+                # Calculate quantiles
+                forecast_numpy = forecast.numpy()
+                # Store results
+                results['borders'][border] = {
+                    'median': forecast_numpy.median(axis=0).tolist(),
+                    'q10': forecast_numpy.quantile(0.1, axis=0).tolist(),
+                    'q90': forecast_numpy.quantile(0.9, axis=0).tolist(),
+                    'inference_time_s': time.time() - border_start
+                }
+                print(f"  ✓ Complete in {time.time() - border_start:.1f}s")
+            except Exception as e:
+                print(f"  ✗ Error: {str(e)}")
+                results['borders'][border] = {'error': str(e)}
+        # Add summary metadata
+        results['metadata']['total_time_s'] = time.time() - total_start
+        results['metadata']['successful_borders'] = sum(
+            1 for b in results['borders'].values() if 'error' not in b
+        )
+        print(f"\n{'='*60}")
+        print(f"FORECAST COMPLETE")
+        print(f"{'='*60}")
+        print(f"Total time: {results['metadata']['total_time_s']:.1f}s")
+        print(f"Successful: {results['metadata']['successful_borders']}/{len(forecast_borders)} borders")
+        return results
+    def export_to_parquet(self, results: Dict, output_path: str):
+        """
+        Export forecast results to parquet format.
+        Args:
+            results: Forecast results from run_forecast()
+            output_path: Path to save parquet file
+        """
+        # Create forecast timestamps
+        run_datetime = datetime.strptime(results['run_date'], "%Y-%m-%d")
+        forecast_start = run_datetime + timedelta(hours=1)
+        forecast_hours = results['forecast_days'] * 24
+        timestamps = [
+            forecast_start + timedelta(hours=h)
+            for h in range(forecast_hours)
+        ]
+        # Build DataFrame
+        data = {'timestamp': timestamps}
+        for border, forecast_data in results['borders'].items():
+            if 'error' not in forecast_data:
+                data[f'{border}_median'] = forecast_data['median']
+                data[f'{border}_q10'] = forecast_data['q10']
+                data[f'{border}_q90'] = forecast_data['q90']
+        df = pl.DataFrame(data)
+        df.write_parquet(output_path)
+        print(f"✓ Exported to: {output_path}")
+        print(f"  Shape: {df.shape}")
+        return output_path
+# Convenience function for API usage
+def run_inference(
+    run_date: str,
+    forecast_type: str = "smoke_test",
+    borders: Optional[List[str]] = None,
+    output_dir: str = "/tmp"
+) -> str:
+    """
+    Run forecast and return path to results file.
+    Args:
+        run_date: Forecast run date (YYYY-MM-DD)
+        forecast_type: 'smoke_test' (7 days, 1 border) or 'full_14day' (14 days, all borders)
+        borders: Specific borders to forecast (None = use forecast_type defaults)
+        output_dir: Directory to save results
+    Returns:
+        Path to forecast results parquet file
+    """
+    # Initialize pipeline
+    pipeline = ChronosInferencePipeline()
+    # Configure based on forecast type
+    if forecast_type == "smoke_test":
+        forecast_days = 7
+        if borders is None:
+            # Load just to get first border
+            _, all_borders = pipeline._load_dataset()
+            borders = [all_borders[0]]
+    else:  # full_14day
+        forecast_days = 14
+        # borders = None means all borders
+    # Run forecast
+    results = pipeline.run_forecast(
+        run_date=run_date,
+        borders=borders,
+        forecast_days=forecast_days
+    )
+    # Export to parquet
+    output_filename = f"forecast_{run_date}_{forecast_type}.parquet"
+    output_path = os.path.join(output_dir, output_filename)
+    pipeline.export_to_parquet(results, output_path)
+    return output_path