Skip to content

Commit a1cddf5

Browse files
8orrin9gitbook-bot
authored andcommitted
GITBOOK-23: Data Split
1 parent e6b552b commit a1cddf5

File tree

3 files changed

+19
-0
lines changed

3 files changed

+19
-0
lines changed

docs/.gitbook/assets/image (146).png

81.9 KB
Loading

docs/.gitbook/assets/image (147).png

102 KB
Loading
Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,21 @@
11
# 2. Data Split
22

3+
4+
5+
<figure><img src="../.gitbook/assets/image (146).png" alt="" width="211"><figcaption></figcaption></figure>
6+
7+
1. Click on _**Data Split**_ in the _**Machine Learning**_ category.
8+
9+
10+
11+
<figure><img src="../.gitbook/assets/image (147).png" alt="" width="563"><figcaption></figcaption></figure>
12+
13+
2. _**Input Data**_: Choose whether the target data is included in the input data. If it is, select _**Feature Data**_ and _**Target Data**_ separately. You can also select specific columns from one dataset using the _**funnel icon**_.
14+
3. _**Test Size**_: Select the percentage of input data to use for testing purposes.
15+
4. _**Random State**_: Generate the same random state, ensuring consistent data splits each time. (If not set, data will be randomly split differently each time.)
16+
5. _**Shuffle**_: Shuffle the data randomly to prevent the model from relying on the order of the data, thereby reducing bias and improving generalization performance.
17+
6. _**Stratify**_: Maintain class ratios when splitting the data to prevent over-representation of certain classes (Classification).
18+
7. _**Allocate to**_: Assign variable names to the split data.
19+
8. _**Code View**_: Preview the code that will be output.
20+
9. _**Run**_: Execute the code.
21+

0 commit comments

Comments
 (0)