2026.06 / horse racing tips research
Horse Racing Tips Research: What RaceEdge Has Learned So Far
RaceEdge started as a horse racing tips app, but the research has pushed it in a more useful direction: less “today’s nap”, more transparent evidence lab. The interesting work so far is not whether one runner happened to win yesterday. It is whether a signal can survive real data, paper tracking, realistic prices, drawdown checks, and enough sample size to be worth trusting.
Nothing in this post is betting advice or a promise of profit. RaceEdge is an evidence-first racing research project. Strategies stay research-only until they survive live/paper validation with executable-price evidence.
The current product stance
The live app is intentionally conservative. RaceEdge shows UK and Ireland racecards, ranked selections, reasoning, confidence, risk labels, responsible-gambling copy, and a clear “no mock data” principle. If the real source fails, the app should fail visibly instead of inventing fake runners.
The research dashboard is even stricter. Its current promotion status is still collecting evidence. The live paper record has not reached the guardrails needed to promote anything as a tested strategy. The current threshold is at least 250 settled paper signals, plus positive live paper ROI and drawdown discipline. At the latest check, RaceEdge had 54 settled paper signals, with live paper ROI still negative.
What we are measuring
The key lesson has been to separate different kinds of “evidence”. They are not equal.
- Proxy or SP-only evidence can be useful for smoke tests, but it can flatter a signal that would not have been executable at a real price.
- Betfair delayed or Exchange-style evidence is more useful because it gets closer to the prices a real user could have faced.
- Official historical price validation, using BSP, PPWAP, and MorningWAP-style sources, is a stronger filter for historical slices.
- Forward paper tracking is the product guardrail: a promising backtest still has to prove it can produce signals and settle sensibly in the live system.
This is why RaceEdge reports mode-by-mode and source-by-source results rather than blending everything into one headline number.
The research database is no longer tiny
The VPS research dataset now contains a large historical store separate from the production app. At migration time it held roughly:
- 1.85 million runner training rows.
- 949k Betfair BSP price rows.
- Paper signal and Exchange snapshot tables for live evidence capture.
That does not magically create an edge, but it changes the quality of the questions. Instead of “did this tip win?”, we can ask: does this slice hold across years, countries, score bands, price sources, and drawdown constraints?
The strongest historical finding so far: narrow lay slices
The best current historical candidates are not broad “back the top-rated horse” rules. They are narrow lay slices found in official-price validation across 2023–2025, using BSP, PPWAP, and MorningWAP style sources with commission and slippage assumptions.
Two slices survived the cross-source filter:
- Odds 1.50–1.79, score 60–69, medium risk, GB+IRE: worst-source ROI around +4.96% on liability, average ROI around +6.72%, with positive results across the three price sources.
- Score 70–79, medium risk, GB only: worst-source ROI around +1.10% on liability, average ROI around +4.82%, also positive across the three price sources.
Those sound promising, but the important word is candidate. The first slice had a negative 2025 under some sources, and the second had a weak edge with a negative 2024 across sources. RaceEdge therefore keeps them in paper-only Exchange slice candidate tracking, not in promoted staking.
Historical slice passes filters
↓
Keep it research-only
↓
Capture live/paper Exchange-style prices
↓
Settle separately by mode and source
↓
Only consider promotion after sample-size and drawdown guards
The most useful failure: the proxy ML lay idea
The most valuable research result so far may be a rejection. A proxy model-lay idea looked attractive in sampled historical testing: it had more than 17k proxy lays and roughly +5% ROI on liability, with no losing folds in that proxy view.
Then official Betfair price validation broke it.
- Validated candidate lays: 5,289.
- Matched Betfair lays: roughly 4.15k.
- BSP ROI: -3.27%.
- PPWAP ROI: -2.72%.
- MorningWAP ROI: -11.99%.
- Losing years: 3/3 on every official-price source.
That result is exactly why the research process exists. A signal that looks good under proxy assumptions can be unusable at real market prices. RaceEdge should block those ideas, not bury the bad report.
What the live paper record says right now
The live research dashboard currently shows a mixed and immature evidence picture. The sample is still small. Some lay slice candidates are positive in early paper tracking, but the overall live paper ROI remains negative and the executable Betfair delayed bucket is worse than proxy-only settlement.
That is the warning sign RaceEdge is designed to surface: proxy-only results can look encouraging while executable-price evidence is still weak or negative. Until the live record is larger and more consistent, the honest answer is: keep collecting evidence.
How this changes the app
The research so far has pushed RaceEdge toward a clearer product identity:
- Daily tips are useful as a visible scoring layer, not a guarantee.
- The real product is the feedback loop: score, capture, settle, compare, and reject weak ideas.
- Backtest output should be labelled by evidence level, not presented as a finished tipster system.
- Any mode that cannot survive Betfair-style validation stays blocked or paper-only.
The next research priorities
The next useful RaceEdge work is not to add more hype. It is to improve forward evidence quality:
- Keep capturing Betfair delayed/VPN Exchange snapshots for open paper signals.
- Separate P&L by mode and price source so proxy-only performance cannot hide executable-price weakness.
- Let the candidate slices accumulate enough settled signals before changing any promotion status.
- Use the historical database to find robust, boring, cross-source patterns rather than one-off lucky slices.
- Keep the public app honest with risk warnings, sample-size labels, and responsible-gambling messaging.
That is slower than a normal horse racing tips site, but it is the only version of this project I trust. RaceEdge should earn confidence by showing the work: what passed, what failed, and what is still too early to call.
Live project: raceedge.uk.