This repository is a collection of public domain Ethereum MEV-Boost winning bid data.
Block data is extracted using cryo leveraging an Infura RPC, and merged with data from the following relays:
We then enrich the data with Flashbots's public builder mapping and with the mevboost.pics proposer mapping.
Data coverage begins on October 11, 2023. We may backfill more data in time.
The data is delivered in daily Parquet chunks allowing for incremental bandwidth for users who choose to keep their datasets updated.
# Validated with python 3.13.1
import os
import pandas as pd # pandas==2.2.1
base_path = './data/'
file_paths = os.listdir(base_path)
# this assumes that the data directory is in the working directory
dfs = []
for file in file_paths:
df_tmp = pd.read_parquet(os.path.join(base_path, file))
dfs.append(df_tmp)
df = pd.concat(dfs)
df.sort_values(by=['block_number', 'bid_timestamp_ms'], ascending=True, inplace=True)
# double sorting by block_number and bid_timestamp_ms allows the data to stay
# ordered inclusive of non-mev blocks missing bid_timestamp_ms data.
df.reset_index(inplace=True, drop=True)
block_datetime
:datetime64[ns]
slot
:int64
block_number
:int64
block_hash
:str
value_eth
:float64
base_fee_per_gas
:float64
gas_limit
:float64
gas_used
:int64
builder_pubkey
:str
relay
:str
bid_timestamp_ms
:float64
slot_time_ms
:float64
proposer_pubkey
:str
block_timestamp
:float64
extra_data
:str
builder_label
:str
proposer_label
:str
lido_node_operator
:str