Skip to content

Commit 53daaf3

Browse files
8orrin9gitbook-bot
authored andcommitted
GITBOOK-26: Regressor
1 parent c136336 commit 53daaf3

36 files changed

+211
-12
lines changed
148 KB
Loading
21.8 KB
Loading

docs/.gitbook/assets/image (1).png

-106 KB
Loading
207 KB
Loading

docs/.gitbook/assets/image (10).png

-133 KB
Loading
208 KB
Loading

docs/.gitbook/assets/image (11).png

-128 KB
Loading
205 KB
Loading

docs/.gitbook/assets/image (12).png

-116 KB
Loading
205 KB
Loading

docs/.gitbook/assets/image (13).png

19.6 KB
Loading

docs/.gitbook/assets/image (150).png

81.9 KB
Loading

docs/.gitbook/assets/image (151).png

73.6 KB
Loading

docs/.gitbook/assets/image (152).png

73.7 KB
Loading
50 KB
Loading

docs/.gitbook/assets/image (2).png

7.99 KB
Loading
53.9 KB
Loading

docs/.gitbook/assets/image (3).png

-1.56 KB
Loading
163 KB
Loading

docs/.gitbook/assets/image (4).png

-107 KB
Loading
163 KB
Loading

docs/.gitbook/assets/image (5).png

-103 KB
Loading
143 KB
Loading

docs/.gitbook/assets/image (6).png

-67.3 KB
Loading
123 KB
Loading

docs/.gitbook/assets/image (7).png

-38.7 KB
Loading
152 KB
Loading

docs/.gitbook/assets/image (8).png

-68.2 KB
Loading
37.1 KB
Loading

docs/.gitbook/assets/image (9).png

31.5 KB
Loading

docs/.gitbook/assets/image.png

-143 KB
Loading

docs/data-analysis/3.-data-info.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,13 +6,13 @@ description: Check the Basic Information of the Data
66

77

88

9-
<figure><img src="../.gitbook/assets/image (8).png" alt="" width="312"><figcaption></figcaption></figure>
9+
<figure><img src="../.gitbook/assets/image (8) (1).png" alt="" width="312"><figcaption></figcaption></figure>
1010

1111
1. Click on _**Data Info**_ in the Data Analysis category.
1212

1313

1414

15-
<figure><img src="../.gitbook/assets/image (9).png" alt=""><figcaption></figcaption></figure>
15+
<figure><img src="../.gitbook/assets/image (9) (1).png" alt=""><figcaption></figcaption></figure>
1616

1717
2. Click on the _**Data tab**_ at the top left to select the DataFrame for which you want to view information.
1818

@@ -28,7 +28,7 @@ description: Check the Basic Information of the Data
2828

2929

3030

31-
<figure><img src="../.gitbook/assets/image (10).png" alt=""><figcaption></figcaption></figure>
31+
<figure><img src="../.gitbook/assets/image (10) (1).png" alt=""><figcaption></figcaption></figure>
3232

3333
1. _**Info**_ provides basic information such as Column, Non-Null Count, Dtype, etc.
3434
2. _**Describe**_ displays basic statistics for each column.
@@ -41,7 +41,7 @@ description: Check the Basic Information of the Data
4141

4242

4343

44-
<figure><img src="../.gitbook/assets/image (11).png" alt=""><figcaption></figcaption></figure>
44+
<figure><img src="../.gitbook/assets/image (11) (1).png" alt=""><figcaption></figcaption></figure>
4545

4646
1. _**Null Count**_ shows the count of Null and Non-Null values for each column.
4747
2. _**Duplicated**_ reveals the count of duplicated values.
@@ -54,7 +54,7 @@ description: Check the Basic Information of the Data
5454

5555

5656

57-
<figure><img src="../.gitbook/assets/image (12).png" alt=""><figcaption></figcaption></figure>
57+
<figure><img src="../.gitbook/assets/image (12) (1).png" alt=""><figcaption></figcaption></figure>
5858

5959
1. Check and confirm desired statistical values.
6060

@@ -68,7 +68,7 @@ description: Check the Basic Information of the Data
6868

6969

7070

71-
<figure><img src="../.gitbook/assets/image (13).png" alt=""><figcaption></figcaption></figure>
71+
<figure><img src="../.gitbook/assets/image (13) (1).png" alt=""><figcaption></figcaption></figure>
7272

7373
1. _**Correlation Table**_ shows a table indicating the correlation between each column.
7474
2. _**Correlation Matrix**_ represents the correlation table as a Heat Map.

docs/data-analysis/4.-frame/4-3.-frame-sort.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33

44

5-
<figure><img src="../../.gitbook/assets/image (1) (1).png" alt=""><figcaption></figcaption></figure>
5+
<figure><img src="../../.gitbook/assets/image (1) (1) (1).png" alt=""><figcaption></figcaption></figure>
66

77
1. _**Sort Index**_ sets the ascending/descending order of rows or columns.
88
2. _**Sort Values**_ sorts based on the data entered in each column. You must select one or more columns.
@@ -15,7 +15,7 @@
1515

1616

1717

18-
<figure><img src="../../.gitbook/assets/image (2).png" alt="" width="375"><figcaption></figcaption></figure>
18+
<figure><img src="../../.gitbook/assets/image (2) (1).png" alt="" width="375"><figcaption></figcaption></figure>
1919

2020
1. _**Axis**_: Select whether to change the order of rows or columns.
2121
2. _**Sort by level**_: When the rows or columns have multiple layers (multi-level), choose which layer to use as the basis for changing the order.
@@ -29,7 +29,7 @@
2929

3030

3131

32-
<figure><img src="../../.gitbook/assets/image (3).png" alt="" width="375"><figcaption></figcaption></figure>
32+
<figure><img src="../../.gitbook/assets/image (3) (1).png" alt="" width="375"><figcaption></figcaption></figure>
3333

3434
1. _**Sort by column**_ determines the order of columns to be the basis for sorting.
3535

docs/machine-learning/4.-automl.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,21 @@
11
# 4. AutoML
22

3+
4+
5+
<figure><img src="../.gitbook/assets/image (150).png" alt="" width="211"><figcaption></figcaption></figure>
6+
7+
1. Click on _**AutoML**_ in the _**Machine Learning**_ category.
8+
9+
10+
11+
<figure><img src="../.gitbook/assets/image (152).png" alt="" width="563"><figcaption></figcaption></figure>
12+
13+
2. _**Model Type**_: Choose the type of model to generate. You can conveniently install necessary packages through _**Install**_.
14+
3. _**Generation(TPOT)**_: Set the number of generations to generate, evaluate, and compare. Each generation generates and evaluates candidate model pipelines. Higher values increase execution time.
15+
4. _**Population Size(TPOT)**_: Determine the number of model candidates generated in each generation set in the previous step.
16+
5. _**Cv(TPOT)**_: Determine the number of folds used for cross-validation.
17+
6. _**Random State(TPOT)**_: Control randomness. If not set, different random sequences are generated each time.
18+
7. _**Allocate to**_: Specify variable names to assign to the generated model.
19+
8. _**Code View**_: Preview the code that will be output.
20+
9. _**Run**_: Execute the code.
21+

docs/machine-learning/5.-regressor.md

Lines changed: 180 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,182 @@
11
# 5. Regressor
22

3+
4+
5+
<figure><img src="../.gitbook/assets/image.png" alt="" width="209"><figcaption></figcaption></figure>
6+
7+
1. Click on the _**Regressor**_ in the _**Machine Learning**_ category.
8+
9+
10+
11+
<figure><img src="../.gitbook/assets/image (1).png" alt="" width="563"><figcaption></figcaption></figure>
12+
13+
2. _**Model Type**_: Choose the regression model.
14+
15+
* [Linear Regression](5.-regressor.md#linear-regression)
16+
* [Ridge / Lasso](5.-regressor.md#ridge-lasso)
17+
* [ElasticNet](5.-regressor.md#elasticnet)
18+
* [SVR(SupportVectorMachine Regressor)](5.-regressor.md#svr-supportvectormachine-regressor)
19+
* [DecisionTree Regressor](5.-regressor.md#decisiontree-regressor)
20+
* [RandomForest Regressor](5.-regressor.md#randomforest-regressor)
21+
* [GradientBoosting Regressor](5.-regressor.md#gradientboosting-regressor)
22+
* [XGB Regressor](5.-regressor.md#xgb-regressor)
23+
* [LGBM Regressor](5.-regressor.md#lgbm-regressor)
24+
* [CatBoost Regressor](5.-regressor.md#catboost-regressor)
25+
26+
3. _**Allocate to**_: Enter the variable name to assign to the created machine learning model.
27+
4. _**Code View**_: Preview the generated code.
28+
5. _**Run**_: Execute the code.
29+
30+
31+
32+
***
33+
34+
### Linear Regression
35+
36+
37+
38+
<figure><img src="../.gitbook/assets/image (2).png" alt="" width="563"><figcaption></figcaption></figure>
39+
40+
1. _**Fit Intercept**_: Choose whether to include the intercept.
41+
42+
43+
44+
***
45+
46+
### Ridge / Lasso
47+
48+
49+
50+
<figure><img src="../.gitbook/assets/image (3).png" alt="" width="563"><figcaption></figcaption></figure>
51+
52+
1. _**Alpha**_: Adjust the level of regularization.
53+
54+
55+
56+
***
57+
58+
### ElasticNet
59+
60+
61+
62+
<figure><img src="../.gitbook/assets/image (4).png" alt="" width="563"><figcaption></figcaption></figure>
63+
64+
1. _**Alpha**_: Adjust the level of regularization.
65+
2. _**L1 ratio**_: Adjusts the balance (ratio) between _**L1 (Lasso)**_ and _**L2 (Ridge)**_ regularization.
66+
67+
68+
69+
***
70+
71+
### SVR(SupportVectorMachine Regressor)
72+
73+
74+
75+
<figure><img src="../.gitbook/assets/image (5).png" alt="" width="563"><figcaption></figcaption></figure>
76+
77+
1. _**C**_: Represents the degree of freedom for model regularization. Higher values of C make the model more complex, fitting the training data more closely.
78+
2. _**Kernel**_: Function mapping data to a higher-dimensional space, controlling model complexity.
79+
80+
* _**Degree(Poly)**_: Determines the degree of polynomial.
81+
* _**Gamma(Poly, rbf, sigmoid)**_: Adjusts the curvature of the decision boundary.
82+
* _**Coef0(Poly, sigmoid)**_: Additional parameter for the kernel, controlling the offset. Higher values fit the training data more closely.
83+
84+
3. _**Random state**_: Sets the seed value for the random number generator used in model training.
85+
86+
87+
88+
***
89+
90+
### DecisionTree Regressor
91+
92+
93+
94+
<figure><img src="../.gitbook/assets/image (6).png" alt="" width="563"><figcaption></figcaption></figure>
95+
96+
1. _**Criterion**_: Specifies the measure used for node splitting.
97+
2. _**Max depth**_: Specifies the maximum depth of the tree.
98+
3. _**Min Samples Split**_: Specifies the minimum number of samples required to split a node.
99+
4. _**Random state**_: Sets the seed value for the random number generator used in model training.
100+
101+
102+
103+
***
104+
105+
### RandomForest Regressor
106+
107+
108+
109+
<figure><img src="../.gitbook/assets/image (7).png" alt="" width="563"><figcaption></figcaption></figure>
110+
111+
1. _**N estimators**_: Specifies the number of trees in the ensemble.
112+
2. _**Criterion**_: Specifies the measure used for node splitting.
113+
3. _**Max depth**_: Specifies the maximum depth of the tree.
114+
4. _**Min Samples Split**_: Specifies the minimum number of samples required to split a node.
115+
5. _**N jobs**_: Specifies the number of CPU cores or threads to be used during model training.
116+
6. _**Random State**_: Sets the seed value for the random number generator used in model training.
117+
118+
119+
120+
***
121+
122+
### GradientBoosting Regressor
123+
124+
125+
126+
<figure><img src="../.gitbook/assets/image (8).png" alt="" width="563"><figcaption></figcaption></figure>
127+
128+
1. _**Loss**_: Specifies the loss function used.
129+
2. _**Learning rate**_: Specifies the learning rate.
130+
3. _**N estimators**_: Specifies the number of trees in the ensemble.
131+
4. _**Criterion**_: Specifies the measure used for node splitting.
132+
5. _**Random State**_: Sets the seed value for the random number generator used in model training.
133+
134+
135+
136+
***
137+
138+
### XGB Regressor
139+
140+
141+
142+
<figure><img src="../.gitbook/assets/image (10).png" alt="" width="563"><figcaption></figcaption></figure>
143+
144+
1. _**N estimators**_: Specifies the number of trees in the ensemble.
145+
2. _**Max depth**_: Specifies the maximum depth of the tree.
146+
3. _**Learning rate**_: Specifies the learning rate.
147+
4. _**Gamma**_: Specifies the minimum loss reduction required to make a further partition.
148+
5. _**Random State**_: Sets the seed value for the random number generator used in model training.
149+
150+
151+
152+
***
153+
154+
### LGBM Regressor
155+
156+
157+
158+
<figure><img src="../.gitbook/assets/image (11).png" alt="" width="563"><figcaption></figcaption></figure>
159+
160+
1. _**Boosting type**_: Specifies the boosting type used in the algorithm.
161+
2. _**Max depth**_: Specifies the maximum depth of the tree.
162+
3. _**Learning Rate**_: Specifies the learning rate.
163+
4. _**N estimators**_: Specifies the number of trees in the ensemble.
164+
5. _**Random State**_: Sets the seed value for the random number generator used in model training.
165+
166+
167+
168+
***
169+
170+
### CatBoost Regressor
171+
172+
173+
174+
<figure><img src="../.gitbook/assets/image (12).png" alt="" width="563"><figcaption></figcaption></figure>
175+
176+
1. _**Learning rate**_: Specifies the learning rate.
177+
2. _**Loss function**_: Specifies the loss function used.
178+
3. _**Task Type**_: Specifies the hardware used for data processing.
179+
4. _**Max Depth**_: Specifies the maximum depth of the tree.
180+
5. _**N estimators**_: Specifies the number of trees in the ensemble.
181+
6. _**Random State**_: Sets the seed value for the random number generator used in model training.
182+

docs/visualization/2.-pandas-plot.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,13 +2,13 @@
22

33

44

5-
<figure><img src="../.gitbook/assets/image (4).png" alt="" width="285"><figcaption></figcaption></figure>
5+
<figure><img src="../.gitbook/assets/image (4) (1).png" alt="" width="285"><figcaption></figcaption></figure>
66

77
1. Click on _**Pandas Plot**_ in the _**Visualization**_ category.
88

99

1010

11-
<figure><img src="../.gitbook/assets/image (6).png" alt="" width="563"><figcaption></figcaption></figure>
11+
<figure><img src="../.gitbook/assets/image (6) (1).png" alt="" width="563"><figcaption></figcaption></figure>
1212

1313
2. Select the DataFrame to be represented as a plot.
1414

@@ -30,7 +30,7 @@
3030

3131

3232

33-
<figure><img src="../.gitbook/assets/image (7).png" alt="" width="563"><figcaption></figcaption></figure>
33+
<figure><img src="../.gitbook/assets/image (7) (1).png" alt="" width="563"><figcaption></figcaption></figure>
3434

3535
12. Set the _**range of values**_ to be displayed on the axes.
3636
13. Specify the _**interval**_ between axis tick marks.

0 commit comments

Comments
 (0)