BitBank
Blog > Backtests That Survive Live Crypto Markets

Backtests That Survive Live Crypto Markets

Most trading strategies look best before they meet a live exchange. The reason is rarely bad code. It is usually bad assumptions. Crypto backtests often bake in clean fills, stable universes, perfect data, and zero emotional pressure. Live trading removes all of that in a weekend.

Start With a Simple Rule

A backtest is only useful if it answers the question: "Would this process still make sense after costs, delays, and regime changes?" If the answer depends on one idealized assumption, the result is probably fragile.

1. Protect the Timeline

The first failure mode is leakage. If any feature, ranking, normalization, or model fit touches future data, the equity curve becomes fiction.

  • Use chronological splits: never random train/test splits for time series
  • Retrain only on past data: every validation decision should respect what was known at that timestamp
  • Walk forward: test the strategy the way it would actually be updated in production

In crypto, even small leakage matters because many short-horizon edges are small. A tiny timing mistake can look like alpha.

2. Model the Costs That Actually Hurt

There is no serious crypto backtest without friction. The minimum cost stack usually includes:

  • Fees: taker fees especially punish high-turnover systems
  • Slippage: fills get worse when volatility expands
  • Funding: perpetual positions can bleed even if price does not move much
  • Spread crossing: the first tax many naive simulations ignore

If a strategy dies when you add realistic costs, that does not mean the simulator is harsh. It means the original edge was never large enough.

3. Separate Forecast Skill From Execution Skill

Traders often bundle too many things together. A model may predict direction well but be attached to poor entries. Another may forecast volatility correctly but have weak sizing rules. Evaluate the pipeline in layers:

  • Forecast quality: did the model predict the move, range, or regime?
  • Decision logic: were entries and exits consistent with the forecast?
  • Portfolio construction: did correlated positions quietly multiply risk?

This decomposition makes debugging faster. Otherwise everything gets labeled "the strategy."

4. Treat the Trading Universe as Part of the Test

Crypto markets change fast. Pairs delist. Liquidity migrates. New narratives create temporary volume that never returns. A backtest that includes only survivors is cleaner than reality and usually overstates performance.

Survivorship bias matters even more in altcoins because many symbols only look tradeable after you forget the ones that disappeared.

5. Measure Stability, Not Just the Best Outcome

A strong backtest is not just profitable. It is robust across nearby settings and ugly conditions. Ask questions like:

  • Does performance collapse if fees rise slightly?
  • Does one month produce most of the profits?
  • Does the edge survive trending and choppy periods?
  • Does the strategy degrade gracefully, or fail all at once?

Robustness is usually a better signal than peak return.

6. Use Distribution Metrics, Not Just One Summary Number

Net return is the last line, not the whole report. Good reviews also include:

  • Maximum drawdown: what the worst path really looks like
  • Profit factor: whether gross winners pay for gross losers
  • Expectancy after costs: whether the strategy has edge per trade
  • Turnover: whether the edge depends on overtrading
  • Trade distribution: whether the result rests on a few outsized wins

A Practical Standard

If a strategy only works with one exact threshold, one exact date range, and one cost assumption, it is not production-ready. A better standard is boring but useful: walk-forward validation, realistic frictions, stable sizing, and results that remain acceptable after you stress them.

That approach produces fewer exciting screenshots. It also produces systems with a chance of surviving first contact with the market.

Plug: BitBank's live crypto dashboard is built around the same idea: evaluate signals in current market conditions, not just on a polished historical curve.