A Stock Price Prediction Method Based On Deep Lear
A Stock Price Prediction Method Based On Deep Lear
A Stock Price Prediction Method Based On Deep Lear
https://www.emerald.com/insight/2398-7294.htm
Stock price
A stock price prediction method prediction
based on deep learning technology method
Xuan Ji, Jiachen Wang and Zhijun Yan
School of Management and Economics, Beijing Institute of Technology,
Beijing, China
Received 31 May 2020
Revised 26 August 2020
Abstract Accepted 7 September 2020
Purpose – Stock price prediction is a hot topic and traditional prediction methods are usually based on
statistical and econometric models. However, these models are difficult to deal with nonstationary time series
data. With the rapid development of the internet and the increasing popularity of social media, online news
and comments often reflect investors’ emotions and attitudes toward stocks, which contains a lot of important
information for predicting stock price. This paper aims to develop a stock price prediction method by taking
full advantage of social media data.
Design/methodology/approach – This study proposes a new prediction method based on deep
learning technology, which integrates traditional stock financial index variables and social media text
features as inputs of the prediction model. This study uses Doc2Vec to build long text feature vectors from
social media and then reduce the dimensions of the text feature vectors by stacked auto-encoder to balance the
dimensions between text feature variables and stock financial index variables. Meanwhile, based on wavelet
transform, the time series data of stock price is decomposed to eliminate the random noise caused by stock
market fluctuation. Finally, this study uses long short-term memory model to predict the stock price.
Findings – The experiment results show that the method performs better than all three benchmark models
in all kinds of evaluation indicators and can effectively predict stock price.
Originality/value – In this paper, this study proposes a new stock price prediction model that incorporates
traditional financial features and social media text features which are derived from social media based on
deep learning technology.
Keywords Text mining, Deep learning, Financial social media, Stock price prediction
Paper type Research paper
1. Introduction
Stock is a financial product characterized by high risk, high return and flexible trading,
which is favored by many investors. Investors can get abundant returns by accurately
estimating stock price trends. However, the stock price is influenced by many factors such
as macroeconomic situation, market condition, major social and economic events, investors’
preferences and companies’ managerial decisions. Therefore, prediction of the stock price
has always been the focus and difficult research topic. Statistical and econometric models
© Xuan Ji, Jiachen Wang and Zhijun Yan. Published in the International Journal of Crowd Science.
Published by Emerald Publishing Limited. This article is published under the Creative Commons
Attribution (CC BY 4.0) license. Anyone may reproduce, distribute, translate and create derivative
works of this article (for both commercial and non-commercial purposes), subject to full attribution to
the original publication and authors. The full terms of this license may be seen at http://
creativecommons.org/licences/by/4.0/legalcode
International Journal of Crowd
Funding: This work was supported by National Key Research and Development Plan of China Science
(Grant No: 2017YFB1400101), National Natural Science Foundation of China (Grant No: 71572013, Emerald Publishing Limited
2398-7294
71872013, 72072011) and Beijing Municipal Social Science Foundation (Grant No: 18JDGLB040). DOI 10.1108/IJCS-05-2020-0012
IJCS are generally used in traditional stock price prediction, but these methods cannot deal with
the dynamic and complex environment of the stock market. Since 1970, with the rapid
development of computer technology, researchers have begun using machine learning to
predict stock prices and fluctuations, helping investors determine investment strategies to
reduce risk and increase returns.
The stock market is a highly complex time series scenario and has typical dynamic
characteristics. There will be a lot of stock dynamic trading after the opening of the market
and stock price will change accordingly. Moreover, the stock price is affected by many
unpredicted factors, which results in a typical nonstationary stock price time-series data.
Therefore, stock price prediction is one of the most challenging problems in all kinds of
prediction research. In the past decades, scholars have studied stock price prediction from
many perspectives, where the improvement of prediction models and the selection of model
features are the two most important directions among them. Most of the early studies used
econometric models, such as autoregressive integrated moving average (ARIMA) and
autoregressive conditional heteroskedastic-autoregressive integrated moving average
(ARCH-ARIMA) (Booth et al., 1994; Engle, 2001), to predict stock price. However, it is
difficult for econometric models to consider the impact of other factors on stock price
fluctuations and they have strong assumptions about the data, which are often difficult to
meet (Le and Xie, 2018). Therefore, machine learning has been widely used in stock price
prediction in recent years and many more suitable models for stock prediction have been
proposed. Many studies have shown that deep learning has superior efficiency than other
models (Marmer, 2008) and neural network models excel regression and discriminant
models (Refenes et al., 1994). In terms of feature selection, some scholars explore the
correlation between new features and stock price and some new features, including political
factors, macroeconomic factors and investors’ sentiment, etc., have been incorporated into
the prediction model (Cervello-Royo et al., 2015; Patel et al., 2015).
Previous literature extensively investigates the stock price prediction methods and many
advanced prediction models are proposed. However, existing approaches on stock price
prediction have two main limitations. First, although the text features are used in the
existing models to better incorporate the important information in social media, they are
usually mined based on traditional text mining technologies, such as the bag-of-word model.
These text mining technologies cannot consider the semantic and other information in social
media which are helpful to improve the performance of prediction models. Second, the
feature dimensionality reduction is a basic step when balancing text features and financial
features in stock price prediction. However, previous price prediction methods usually adopt
principal component analysis (PCA) and latent Dirichlet allocation (LDA) to reduce the
feature dimension. PCA method has problems of information loss and is unable to process
nonlinear data, while the LDA method cannot consider semantic information in social
media. Thus, these two methods are not suitable for the stock price prediction (Bao et al.,
2017).
To fill the research gap discussed above, this paper proposes a new stock price prediction
method based on deep learning technology, which integrates Doc2Vec, stacked auto-encoder
(SAE), wavelet transform and long short-term memory (LSTM) model. Feature extraction of
text information in social media can describe the emotional tendency of investors and help to
predict the stock price more accurately. First, we classify the prediction features into two
types, i.e. financial features and text features. We adopt the widely used financial features
and extract text features from social media by deep learning technology. Second, Doc2Vec
model is used to train original social media documents and obtain text feature vectors.
Doc2Vec model can retain semantic information of documents and the relationship between
different words, which overcomes the shortcomings of traditional text feature extraction Stock price
methods (such as dictionary matching method, term frequency–inverse document frequency prediction
and LDA). Third, SAE is adopted to reduce dimension of text feature vectors, which
balances the dimension of text features with financial features. Fourth, wavelet transform is
method
used to transform the target variable (stock price) and to remove the random noise in the
stock price time series data. Finally, stock finance features and excavated text features are
taken as input features, and LSTM is adopted to predict the stock price.
The rest of this paper is structured as follows. We review the literature on stock price
prediction in Section 2 and introduce our method in Section 3. We explain the research data
and experimental process in Section 4. Finally, we conclude the paper with a summary and
possible future research directions in Section 5.
2. Related literature
Our paper studies the stock price prediction method based on deep learning. The related
research work is mainly about the prediction model and feature selection of the prediction
model. This section will review the literature from these two aspects.
&DOFXODWH ILQDQFLDO
*HQHUDWH WH[W YHFWRUV 3UHSURFHVVHG LQGLFWRUV
E\ 'RFYHF GRFXPHQWV
)LQDQFLDO
'LPHQVLRQ UHGXFWLRQ WH[WYHFWRUV IHDWXUHVRIVWRFN
E\ 6$( RIHYHU\GD\
WH[WIHDWXUHV
3ULFH GDWD GHQRLVHG
DIWHUUHGXFWLRQ
3ULFHGDWDDIWHU
3UHGLFW E\ /670
Figure 1. GHQRLVHG
Stock price prediction
method 5HVXOWV
reflect basic information of stock prices. Text features of social media refer to the text
feature vector containing effective information in social media, which includes investors’
comments in financial social media and the news published by companies.
Financial features are described by daily transaction data and financial technical
indicators (Bao et al., 2017) and include 21 features. Daily transaction data include open/
close price, low/high price, trading volume, change amount and change rate, which are daily
first-hand trading information in the stock market. These characteristics can directly reflect
the historical trading situation of stocks. Financial technical indicators refer to indicators
calculated based on stock trading data, including CCI (commodity channel index), ATR
(average true range), SMI (stochastic momentum index), etc. These indicators can often
reflect some regular characteristics of stock movements. Table 1 gives the list of financial
features.
Text features are extracted from social media and can represent investors’ attitudes
toward the stock, the overall trend of public opinions and the company’s decision
information (Nassirtoussi et al., 2014; Kraus and Feuerriegel, 2017) and some samples of text
are shown in excerpt cases of text from social media (Translated).
Hurry up! Buy it, just think it as giving yourself in advance retirement salary. Meinian Health and
Zhonggong Education are the two long-term stocks I followed this year. They didn’t disappoint
me. Although I didn’t buy much, I was very happy.
Meinian health medical treatment develops rapidly, and sets up health physical examination
center all over the country. Physical examination crowd are becoming more and more, and
physical checks must be prepared in advance, so the potential of Meinian health is huge. Its
performance will get better and better, like that of Maotai, which may be the real reason why
Alibaba has become the second largest shareholder.
The proposed Doc-W-LSTM method uses the Doc2Vec model to extract high-dimensional
text feature vectors. However, the trained text feature vectors will have a large dimension,
while the financial feature only has 21 dimensions. The dimension imbalance between these
two kinds of features will affect the prediction effect of the proposed method, thus we will
reduce the dimension of text feature vectors to make two types of features have the same
dimension.
Feature Description
Stock price
prediction
Daily transaction data method
Open/close price Nominal daily open/close price
High/low price Nominal daily highest/lowest price
Volume Daily trading volume
Price_change Change volume
P_change Change rate
Technical indicators
MACD Moving average convergence divergence: displays trend following
characteristics and momentum characteristics
CCI Commodity channel index: helps to find the start and the end of a trend
ATR Average true range: measures the volatility of price
BOLL Bollinger band: provides a relative definition of high and low, which aids in
rigorous pattern recognition
EMA20 20 days exponential moving average
MA5/MA10 5/10 days moving average
V_MA5/V_MA10 5/10 days trading volume average
MTM6/MTM12 6/12 months momentum: helps pinpoint the end of a decline or advance
ROC Price rate of change: shows the speed at which a stock’s price is changing
SMI Stochastic momentum index: shows where the close price is relative to the
midpoint of the same range Table 1.
WVAD Williams’s variable accumulation/distribution: measures the buying and Stock financial
selling pressure features
Figure 2.
Structure of SAE
IJCS As a commonly used noise reduction method, wavelet transform can better deal with non-
stationary time series data and preserve the characteristics of original data as much as
possible. It is widely used in prediction tasks in financial scenarios (Papagiannaki et al.,
2005; Ramsey, 1999). Therefore, we choose Haar wavelet transform as the noise reduction
method for the stock price. This method can decompose data according to time and
frequency and has an acceptable processing time, with the time complexity of O(n)
(Abramovich et al., 2002).
The basic principle of wavelet transform is to generate some wavelet signals which
contain important information and noise after transforming the original data. The signal
coefficient of important information is larger and the signal coefficient of noise is smaller.
The algorithm will automatically select a suitable threshold. The wavelet signals greater
than the threshold is considered to contain important information and should be retained,
while the signals less than the threshold are considered as noise and will be removed.
4.2 Metrics
We used MAE, RMSE and R2 as measures to evaluate the performance of the prediction
methods. MAE measures error without considering the directions of the predicted
values. RMSE measures the average magnitude of estimation error in predicted values.
MAE and RMSE are measures of closeness which evaluates the accuracy of the
predicted value to the actual price. R2 measures the linear correlation between two
variables and eliminates the influence of dimension on different regression problems.
We hope the model has low MAE and RMSE and a high R2. The three metrics are
defined as follows:
m
1X yðiÞ yðiÞ
MAE ¼ true predict (3)
m i¼1
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
u m 2
u 1 X ðiÞ
RMSE ¼ t
ðiÞ
ytrue ypredict (4)
m i¼1
P iðÞ i ðÞ
2
SSresidual i ytrue ypredict
R2 ¼ 1 ¼1
P 2 (5)
SStotal ðiÞ
i y ytrue
ðÞ
i i ðÞ
where ytrue represents the true value of the target variable of sample i, ypredict represents the
predicted value of the target variable of sample i, y represents the mean of the target
variable’s true value of all samples and m represents the total number of samples.
Table 4.
Model MAE RMSE
The comparison of
DM-DBOW and DM-DBOW 1.073 1.339
SAE þ DM-DBOW SAE þ DM-DBOW 0.894 1.239
a lot of noise, which will result in serious over-fitting problems. The introduction of auto- Stock price
encoder can help to handle this problem. Table 4 shows that MAE and RMSE are improved prediction
after the text vector is processed by auto-encoder, which indicates better prediction
performance.
method
We derived the text features of prediction model based on DM-DBOW and SAE model
and integrated them with the financial features to form the feature matrix of the prediction
model. Then the noise of target variable (stock closing price) is smoothed by wavelet
transform. Finally, we used “Meinian Health” as the predicted object. We chronologically
divided the text and financial data of “Meinian Health” into three parts, i.e. the first 80%
data is train set, the middle 10% is validation set and the last 10% data is test set. The train
set and the validation set are used to train the model and to adjust parameters and the test
set is used to verify performance of models.
In the proposed prediction methods, LSTM is chosen as the prediction model, which also
has some parameters to be identified. The most commonly used parameters of LSTM
include hidden layers, dropout, number of neurons, optimizer, batch sizes and epochs. The
hidden layer is set to 1–3 layers according to experience and the computing power of the
machine. Dropout is usually between 0.2 and 0.5. According to Kolmogorov’s theorem,
the number of neurons in the hidden layer is set as double of the input dimensions plus one
(Greff et al., 2017). Finally, we adjusted the parameters of LSTM where the optimal
parameter combination was 2 hidden layers, 7 time windows, 85 neurons, 0.5 dropout, Adam
optimizer, 4 batch sizes and 50 epochs.
4.4 Results
To verify model performance improvement by combining text features and financial
features, we compare the performance of the LSTM model with and without text features.
The LSTM model without text features is denoted as LSTM-F. The experimental results are
shown in Table 5.
The result shows that the performance of our model is better than that of the LSTM
model using only financial features in MAE, RMSE and R2. It suggests that effective text
information mining in social media can effectively improve the performance of the
prediction model. To further test the proposed method’s convergence and robustness, a
piecewise time series prediction method is introduced. Data are divided into 10 groups. Each
group of data is predicted and paired T-test is carried out. Experimental results also prove
the effectiveness of this method as shown in Table 6.
Table 5.
Performance
Model MAE RMSE R2
comparison of Doc-
Doc-W-LSTM 0.019 0.110 0.957 W-LSTM and
LSTM-F 0.046 0.579 0.774 LSTM-F
5. Conclusion
We propose a new method to predict stock prices. This method adopts Doc2Vec to train
financial social media documents and to extract text feature vectors. Then, SAE is used to
reduce the dimension of text vectors to avoid a serious imbalance between text features and
financial features. Moreover, to avoid the impact of random noise in stock price data on the
prediction model, we use Haar wavelet transform to generate denoised stock price time-
series data. Finally, we combine the text features and financial features and use the LSTM
model to predict future stock prices. Experimental results show that the proposed method is
superior to other baseline methods in MAE, RMSE and R2. It suggests that our method
which incorporates text feature information can better predict stock prices.
The main contribution of this paper includes two parts. First, we propose a new stock
price prediction method combining text features from social media, which improves the
performance of traditional methods. Social media content contains a lot of important
information about the stock. The stock financial index variables can only represent the
development trend of the stock price, but the feeling of investors can describe the potential
trend of the stock price, which is usually neglected in traditional prediction methods. We use
the deep learning technology to extract text features, which can represent investors’
sentiment and help to improve prediction performance greatly. Second, we use SAE to solve
the problem of unbalanced stock features and text features, which helps to improve the
accuracy of stock price prediction methods. The traditional dimensionality reduction
methods are mainly statistical methods based on word frequency or PCA, but these methods
will cause information loss in original data (Nassirtoussi et al., 2014). In our method, the
dimension of text features is reduced by the SAE method, which is proved to be an excellent
method to reduce the data dimension and preserve information from the original data as
much as possible (Wang et al., 2016).
This study has several limitations that can provide new directions for future studies.
First, we only collected social media text data from one platform. Although we collected as
Figure 3.
Fitting curve of
different models
IJCS much data as possible from large companies, the investors of other platforms may present
different emotions and one website is less representative. We will try to collect more
financial social media documents from different platforms in the future. Second, only one
stock is selected for prediction in our study. In the future, the relationship among multiple
stocks can be considered and the prices of multiple stocks can be predicted to further verify
the robustness of the proposed method.
References
Abramovich, F., Besbeas, P. and Sapatinas, T. (2002), “Empirical Bayes approach to block
wavelet function estimation”, Computational Statistics and Data Analysis, Vol. 39 No. 4,
pp. 435-451.
Achkar, R., Elias-Sleiman, F., Ezzidine, H., Haidar, N. and Ieee (2018), “Comparison of BPA-MLP and
LSTM-RNN for stocks prediction”, in 2018 6th International Symposium on Computational and
Business Intelligence, pp. 48-51.
Baek, Y. and Kim, H.Y. (2018), “ModAugNet: a new forecasting framework for stock market index
value with an overfitting prevention LSTM module and a prediction LSTM module”, Expert
Systems with Applications, Vol. 113, pp. 457-480.
Bao, W., Yue, J. and Rao, Y.L. (2017), “A deep learning framework for financial time series using
stacked autoencoders and long-short term memory”, Plos One, Vol. 12 No. 7, p. e0180944.
Bollen, J., Mao, H. and Zeng, X. (2011), “Twitter mood predicts the stock market”, Journal of
Computational Science, Vol. 2 No. 1, pp. 1-8.
Booth, G.G., Martikainen, T., Sarkar, S.K., Virtanen, I. and Yliolli, P. (1994), “Nonolinear
dependence in Finnish stock returns”, European Journal of Operational Research, Vol. 74
No. 2, pp. 273-283.
Breidt, F.J., Crato, N. and de Lima, P. (1998), “The detection and estimation of long memory in stochastic
volatility”, Journal of Econometrics, Vol. 83 Nos 1/2, pp. 325-348.
Cervello-Royo, R., Guijarro, F. and Michniuk, K. (2015), “Stock market trading rule based on pattern
recognition and technical analysis: forecasting the DJIA index with intraday data”, Expert
Systems with Applications, Vol. 42 No. 14, pp. 5963-5975.
Chen, Y., Lin, Z., Zhao, X., Wang, G. and Gu, Y. (2014), “Deep learning-based classification of
hyperspectral data”, IEEE Journal of Selected Topics in Applied Earth Observations and Remote
Sensing, Vol. 7 No. 6, pp. 2094-2107.
Delong, J.B., Shleifer, A., Summers, L.H. and Waldmann, R.J. (1990), “Noise trader risk in financial-
markets”, Journal of Political Economy, Vol. 98 No. 4, pp. 703-738.
Ding, X., Zhang, Y., Liu, T. and Duan, J. (2015), “Deep learning for Event-Driven stock prediction”, in
Yang, Q. and Wooldridge, M. (Eds), Proceedings of the Twenty-Fourth International Joint
Conference on Artificial Intelligence, pp. 2327-2333.
Engle, R. (2001), “GARCH 101: the use of ARCH/GARCH models in applied econometrics”, Journal of
Economic Perspectives, Vol. 15 No. 4, pp. 157-168.
Greff, K., Srivastava, R.K., Koutnik, J., Steunebrink, B.R. and Schmidhuber, J. (2017), “LSTM: a search
space odyssey”, IEEE Transactions on Neural Networks and Learning Systems, Vol. 28,
pp. 2222-2232.
Hagenau, M., Liebmann, M. and Neumann, D. (2013), “Automated news reading: stock price prediction
based on financial news using context-capturing features”, Decision Support Systems, Vol. 55
No. 3, pp. 685-697.
Huang, C.J., Liao, J.J., Yang, D.X., Chang, T.Y. and Luo, Y.C. (2010), “Realization of a news
dissemination agent based on weighted association rules and text mining techniques”, Expert
Systems with Applications, Vol. 37 No. 9, pp. 6409-6413.
Jin, F., Self, N., Saraf, P., Butler, P., Wang, W. and Ramakrishnan, N. (2013), “Forex-Foreteller: currency Stock price
trend modeling using news articles”, 19th ACM Sigkdd International Conference on Knowledge
Discovery and Data Mining, pp. 1470-1473.
prediction
Kim, T. and Kim, H.Y. (2019), “Forecasting stock prices with a feature fusion LSTM-CNN model using
method
different representations of the same data”, Plos One, Vol. 14 No. 2, p. e0212320.
Kim, H.K., Kim, H. and Cho, S. (2017), “Bag-of-concepts: comprehending document representation
through clustering words in distributed representation”, Neurocomputing, Vol. 266, pp. 336-352.
Kim, D., Seo, D., Cho, S. and Kang, P. (2019), “Multi-co-training for document classification using
various document representations: TF-IDF, LDA, and Doc2Vec”, Information Sciences, Vol. 477,
pp. 15-29.
Kraus, M. and Feuerriegel, S. (2017), “Decision support from financial disclosures with deep neural
networks and transfer learning”, Decision Support Systems, Vol. 104, pp. 38-48.
Lau, J.H. and Baldwin, T. (2016), “An empirical evaluation of Doc2vec with practical insights into
document embedding generation”, Proceedings of the 1st Workshop on Representation Learning
for NLP, Berlin, Germany, pp. 78-86.
Le, Q.V. and Mikolov, T. (2014), “Distributed representations of sentences and documents”, The 31st
International Conference on Machine Learning (ICML-14), pp. 1188-1196.
Le, L. and Xie, Y. (2018), “Recurrent embedding kernel for predicting stock daily direction”, in Sill, A.
and Spillner, J. (Eds), 2018 IEEE/ACM 5th International Conference on Big Data Computing
Applications and Technologies, pp. 160-166.
Maknickas, A. and Maknickiene, N. (2019), “Support system for trading in exchange market by
distributional forecasting model”, Informatica, Vol. 30 No. 1, pp. 73-90.
Marmer, V. (2008), “Nonlinearity, nonstationarity, and spurious forecasts”, Journal of Econometrics,
Vol. 142 No. 1, pp. 1-27.
M’ng, J.C.P. and Mehralizadeh, M. (2016), “Forecasting east Asian indices futures via a novel hybrid of
Wavelet-PCA denoising and artificial neural network models”, Plos One, Vol. 11, p. e0156338.
Nassirtoussi, A.K., Aghabozorgi, S., Teh, Y.W. and Ngo, D.C.L. (2014), “Text mining for market
prediction: a systematic review”, Expert Systems with Applications, Vol. 41 No. 16, pp. 7653-7670.
Nelson, D.M.Q., Pereira, A.C.M. and de Oliveira, R.A. (2017), “Stock market’s price movement
prediction with LSTM neural networks”, in 2017 International Joint Conference on Neural
Networks, pp. 1419-1426.
Papagiannaki, K., Taft, N., Zhang, Z.L. and Diot, C. (2005), “Long-term forecasting of internet backbone
traffic”, IEEE Transactions on Neural Networks, Vol. 16 No. 5, pp. 1110-1124.
Patel, J., Shah, S., Thakkar, P. and Kotecha, K. (2015), “Predicting stock and stock price index
movement using trend deterministic data preparation and machine learning techniques”, Expert
Systems with Applications, Vol. 42 No. 1, pp. 259-268.
Peng, Y., Liu, Y. and Zhang, R. (2019), “Modeling and analysis of stock price forecast based on LSTM”,
Computer Engineering and Application, Vol. 55, pp. 209-212. (in Chinese).
Quan, Z.Y. (2013), “Stock prediction by searching similar candlestick charts”, in Chan, C.Y., Lu, J.,
Norvag, K. and Tanin, E. (Eds), 2013 IEEE 29th International Conference on Data Engineering
Workshops, pp. 322-325.
Ramsey, J.B. (1999), “The contribution of wavelets to the analysis of economic and financial data”,
Philosophical Transactions of the Royal Society a-Mathematical Physical and Engineering
Sciences, Vol. 357 No. 1760, pp. 2593-2606.
Refenes, A.N., Zapranis, A. and Francis, G. (1994), “Stock performance modeling using neural networks – a
comparative-study with regression-models”, Neural Networks, Vol. 7 No. 2, pp. 375-388.
Schölkopf, B., Platt, J. and Hofmann, T. (2007), “Greedy layer-wise training of deep networks”,
Advances in Neural Information Processing Systems, Vol. 19, pp. 153-160.
IJCS Schumaker, R.P. and Chen, H. (2009), “Textual analysis of stock market prediction using breaking
financial news: the AZFinText system”, ACM Transactions on Information Systems, Vol. 27
No. 2.
Shleifer, A. and Vishny, R.W. (1997), “The limits of arbitrage”, The Journal of Finance, Vol. 52 No. 1,
pp. 35-55.
Singh, R. and Srivastava, S. (2017), “Stock prediction using deep learning”, Multimedia Tools and
Applications, Vol. 76 No. 18, pp. 18569-18584.
Vo, N.N.Y., He, X., Liu, S. and Xu, G. (2019), “Deep learning for decision making and the optimization of
socially responsible investments and portfolio”, Decision Support Systems, Vol. 124,
UNSP 113097.
Wang, Y., Yao, H. and Zhao, S. (2016), “Auto-encoder based dimensionality reduction”,
Neurocomputing, Vol. 184, pp. 232-242.
Xie, X., Lei, X. and Zhao, Y. (2020), “Application of mutual information and improved PCA
dimensionality reduction algorithm in stock price forecasting”, Computer Engineering and
Applications, in Chinese.
Zhang, G.S. and Zhang, X.D. (2016), “A Differential-Information based ARMAD-GARCH stock price
forecasting model”, Systems Engineering – Theory and Practice, Vol. 36, pp. 1136-1145 (in
Chinese).
Zhang, Q., Yang, L.T., Chen, Z. and Li, P. (2018), “A survey on deep learning for big data”, Information
Fusion, Vol. 42, pp. 146-157.
Zhou, Z., Ke, X. and Jichang, Z. (2018), “Tales of emotion and stock in China: volatility, causality and
prediction”, World Wide Web-Internet and Web Information Systems, Vol. 21, pp. 1093-1116.
Zubiaga, A. (2018), “A longitudinal assessment of the persistence of twitter datasets”, Journal of the
Association for Information Science and Technology, Vol. 69 No. 8, pp. 974-984.
Corresponding author
Zhijun Yan can be contacted at: yanzhijun@bit.edu.cn
For instructions on how to order reprints of this article, please visit our website:
www.emeraldgrouppublishing.com/licensing/reprints.htm
Or contact us for further details: permissions@emeraldinsight.com