Data Validation Reference¶
Validation settings are configured under the validation prefix and are parsed by DataSourceFactory.
Validation Keys¶
validation.require_monotonic_timestampsboolean.validation.check_price_boundsboolean.validation.check_gapboolean.validation.max_gap_secondsinteger.validation.check_price_jumpboolean.validation.max_jump_pctdouble.validation.check_future_timestampsboolean.validation.max_future_skew_secondsinteger.validation.check_trading_hoursboolean.validation.trading_start_secondsinteger (seconds since midnight).validation.trading_end_secondsinteger (seconds since midnight).validation.check_volume_boundsboolean.validation.max_volumeinteger.validation.max_pricedouble.validation.check_outliersboolean.validation.outlier_zscoredouble.validation.outlier_warmupinteger.
Actions¶
These keys control how the pipeline reacts:
validation.on_error:skip,fill,continue,fail.validation.on_gap:skip,fill,continue,fail.validation.on_warning:skip,fill,continue,fail.
Example¶
data:
type: csv
data_directory: data
file_pattern: "{symbol}.csv"
has_header: true
validation:
require_monotonic_timestamps: true
check_gap: true
max_gap_seconds: 600
check_outliers: true
outlier_zscore: 4.0
on_error: fail
on_gap: fill
on_warning: continue