Stock Price Trend Forecasting Using Supervised Learning Methods
Stock Price Trend Forecasting Using Supervised Learning Methods
Stock Price Trend Forecasting Using Supervised Learning Methods
The fluctuation of stock market is violent and there are many complicated
financial indicators. However, the advancement in technology, provides an
opportunity to gain steady fortune from stock market and also can help experts
to find out the most informative indicators to make better prediction. The
prediction of the market value is of paramount importance to help in maximizing
the profit of stock option purchase while keeping the risk low.
1. Preprocessing and Cleaning
2. Feature Extraction
6. Conclusions
Structure of Data
We have taken these Historical Daily Stock Market Data for 500 companies of
S&P 500 Market since the company came into existence.
Open, High, Low, Close, Volume, Adjusted Close for each company and S&P
index.
Feature Selection
To prune out less useful features, in Feature Selection, we selected features according to
the k highest scores, with the help of a linear model for testing the effect of a single
regressor, sequentially for many regressors. (SelectKBest Algorithm, with f_regression as
the scorer used)
A F-test (Wikipedia) is a way of comparing the significance of the improvement of a model, with
respect to the addition of new variables.
Twitter Sentiment Score as new feature
Social media plays important role in predicting the stock market return values.
Twitter’s Daily Sentiment Score for each company based upon the user’s tweets
about that particular company and also the tweets on that company’s page.
Once we were ready with complete set of features, we normalized our data for
better results.
Models Used
We have used the following regression models for forecasting and comparison :
1. Neural Networks
2. Bagging Regressor
3. Random Forest Regressor
4. Gradient Boosting Regressor
5. Adaboost Regressor
6. K Neighbour Regressor
Results : AdaBoost Regressor
RMSE : 0.909
Results : Gradient Boosting Regressor
RMSE : 0.961
Results : Bagging Regressor
American Airlines Group (AAL)
RMSE : 0.959
Results : K Neighbours Regressor
American Airlines Group (AAL)
R-squared : 0.00039
RMSE : 0.956
Results
Boosting:
1. Sequential ensemble: try to add new models that do well where previous models lack
2. Aim to decrease bias, and not variance
3. Suitable for low variance high bias models
4. An example of a tree based method is gradient boosting
Milestones