Data parts – what they are and how they could be used?

The new SQ build 125 introduces multiple data range parts. Until now, you were able to divide your history data to two parts only:

In Sample – this is where the strategies are evolved using genetic evolution. This means that strategy is evaluated on this part of data, and its performance score (fitness) is computed from metrics in this part of data.This fitness then determines which strategies in population are selected to be crossed and mutated to create a new generation. The best strategies have highest probability to be chosen for this and thanks to this the population as a whole should get better with every generation.
Out Of Sample – this is “unknown” part of data that was not part of evolution. It is used to verify that strategies work also on “unknown” data.
Genetic evolution doesn’t see this part of data.

The new build 125 adds two more possible types, making it 4 in total:

In Sample Training (IST) – this is the same as In Sample that we had until now. Genetic evolution uses this part to determine fitness and rank the strategies in population.
In Sample Validation (ISV) – a new part in SQ X that is used to determine if strategy performance in IST part holds also in ISV part.
In machine learning it is used to determine if models trained on Training set (IST) holds also in Validation set.
In SQ X it can be used to restart genetic evolution when fitness stagnates in this part.
Out of sample – this is as same as before, it represents an “unknown” part of data that was not part of the evolution
No Trade – special part that means that strategy will not trade in this part. It can be used for example to skip a part in the middle of data that has low volatility.

The general recommendation in machine learning is to split the history data to 3 same parts: IST, ISV and OOS.

Another possible split could be 60/20/20, or move Out of Sample period to the front.

Multiple data sections

Another new feature if SQ X B 125 is that you can define multiple In Sample Validation or Out of Sample parts, not just one, and in any order you want.

In the picture above the white parts are all In Sample training – these are the data on which strategies are evolved.

Blue (ISV) parts are data on which strategies are verified, and evolution can be restarted when it is stagnating.

Gray (no trade) part is the one that is left out – strategy doesn’t trade here.

Green parts are OOS parts – they are not part of genetic evolution and strategies are evaluated on unknown data.

Conditions and filtering by each section

All metrics are now computed also independently for each of these parts, and you can use them in your conditions.

For example, you can filter out strategies where: Net profit (OOS1) is worse than 80% of Net profit (OOS2).

This allows you to use “stricter” filtering where strategy has to perform well on multiple parts of data.

Mark Fric

8 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Emmanuel

12. 4. 2022 9:14 am

Thank you

murty

17. 6. 2023 10:57 pm

Here, for Net profit (IS) , IS includes IST and ISV1 and ISV2. Am I right?

David

Reply to murty

14. 1. 2024 4:26 pm

Good question. Anybody from the SQ team who want to answer?

tomas262

Admin

Reply to murty

16. 1. 2024 10:29 pm

Yes, correct

Joshua Murlin

4. 12. 2023 9:44 pm

i feel very dumb trying to understand what this means.

tomas262

Admin

Reply to Joshua Murlin

5. 12. 2023 7:53 pm

if you have a specific questions feel free to ask

Juan Perez

6. 4. 2024 5:41 am

Can you explain with more details what’s the difference between ISV and OOS? Maybe is something related to machine learning but I don’t get it

tomas262

Admin

Reply to Juan Perez

12. 4. 2024 8:20 pm

That is a data part in SQ X that is used to determine if strategy performance in IST (training part) holds also in ISV part. In SQ X it can be used to restart genetic evolution when fitness stagnates in this part. So it is kind of semi-OOS used to verify performance before it is verified on OOS

Continue reading

Release of SQX 139 Dev 1 and what’s planned for year 2024

We’d like to announce the release of the new SX 139 Dev 1 version – note that this is a development version for testing, not the final 139 version. Most …

Mark Fric

19. 3. 2024

News

Algo Trading Full Course is now available on our YouTube channel

Dive into Algorithmic Trading Without the Coding Headache! Are you intrigued by algorithmic trading but dread the thought of coding? Today marks the beginning of our exciting series that’s about …

Tomas Vanek

5. 3. 2024

News Tips & Hints

Profitable & verified results of trader Naoufel in 2023

In this interview, we catch up with Naoufel, a seasoned trader, to explore his journey through the stormy market of 2023. Naoufel is successful trader with verfied track record who …

Ellie Souckova

12. 12. 2023

Data parts – what they are and how they could be used?

Multiple data sections

Conditions and filtering by each section

Continue reading

Release of SQX 139 Dev 1 and what’s planned for year 2024

Algo Trading Full Course is now available on our YouTube channel

Profitable & verified results of trader Naoufel in 2023

Products

Resources

Company

Follow us