LSTM Forecasting of Bitcoin Implied Volatility (DVOL)

Abstract

This thesis develops and validates a Long Short-Term Memory (LSTM) neural network model for forecasting Bitcoin implied volatility (DVOL), the Deribit 30-day volatility index. Using a unified framework of 17 models (13 linear/tree baselines + 4 LSTM variants), we demonstrate that LSTM models achieve competitive performance (R² = 0.9287) when properly evaluated, narrowing the gap to linear/tree models to only 2%.

Key Contributions

Multi-window normalization analysis: 72-hour (3-day) window is optimal for R² level prediction
Standard Lee-Mykland (2008) implementation: Academically rigorous jump detection with 236 jumps (0.57%)
DataParallel evaluation artifact: Identified 13.3% R² degradation in multi-GPU LSTM evaluation
LSTM competitiveness: With fixed evaluation, LSTM (R²=0.9287) within 2% of best linear/tree models

Model Comparison Results

All 17 models use identical preprocessing (60/20/20 split, 720h rolling normalization):

Model	Type	Features	R²	RMSE
RF (Lags + Jumps)	Tree	11	0.9492	1.65
OLS (Lags + Jumps)	Linear	11	0.9490	1.65
HAR-RV	Linear	3	0.9454	1.71
LSTM market_lags	LSTM	7	0.9287	1.71

Practical Implications

Suitable for: Risk management, option pricing, volatility level estimation, VaR calculations
NOT suitable for: Directional trading, market timing strategies
Directional accuracy ~50% across all models indicates hourly direction is fundamentally unpredictable

Code & Data

The complete codebase, dataset documentation, and reproducibility instructions are available on GitHub.

Overview

Research Objective

Core Predictors

Lagged DVOL

Transaction Volume (USD)

Active Addresses Count

Network Value to Realized Value (NVRV)

DVOL-RV Spread

Technologies

Abstract

Key Contributions

Model Comparison Results

Practical Implications

Code & Data