Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/30390
Title: | Bridging the Gap between Machine and Human in Stock Prediction: Addressing Heterogeneity in Stock Market |
Other Titles: | Institute of Electrical and Electronics Engineers (IEEE) |
Authors: | Lu, X Poon, J Khushi, M |
Keywords: | Bert;BiLSTM;financial markets;heterogeneity analysis;predictive modeling;restricted Boltzmann machine (RBM);stock prediction;technical indicators;textual data;transfer learning;transformer |
Issue Date: | 5-Dec-2024 |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
Citation: | Lu, X., Poon, J. and Khushi, M. (2024) 'Bridging the Gap between Machine and Human in Stock Prediction: Addressing Heterogeneity in Stock Market', IEEE Access, 12, pp. 186171 - 186185. doi: 10.1109/access.2024.3511613. |
Abstract: | Accurately predicting stock prices remains a formidable challenge in financial markets. Traditional predictive models often aggregate data from multiple companies, failing to account for the unique characteristics of each firm, which can hinder the model’s ability to identify company-specific patterns. Moreover, existing research on stock price prediction frequently trains and tests models within the same group of companies, neglecting to assess their generalizability on ‘Out-of-Sample’ companies. This study addresses these limitations by employing BERT to encode business descriptions into vectors, capturing the distinctive attributes of each company. We further enhance the predictive modeling framework by developing features that describe the percentage change of existing indicators, adding significant novelty to the existing research. Additionally, we apply a Restricted Boltzmann Machine (RBM) for dimensionality reduction after the BERT encoding process. In our approach, both the technical indicators and the vectorized descriptions are treated as distinct elements within the transformer encoder. By integrating these representations, our model is better equipped to differentiate between firms and recognize their individual patterns. The proposed model demonstrates superior performance over baseline models, particularly when tested on ‘Out-of-Sample’ companies, highlighting its ability to learn, understand, and analyze company-specific descriptions for more accurate predictions. This research offers novel insights into addressing the heterogeneity in stock price prediction. |
Description: | Data Access: The data for this study has been sourced from the public repository Yahoo Finance. Related code can be found at: https://github.com/xilu5047/Stock_prediction . |
URI: | https://bura.brunel.ac.uk/handle/2438/30390 |
DOI: | https://doi.org/10.1109/access.2024.3511613 |
Other Identifiers: | ORCiD: Xiaobin Lu https://orcid.org/0009-0007-6135-4813 ORCiD: Matloob Khushi https://orcid.org/0000-0001-7792-2327 |
Appears in Collections: | Dept of Computer Science Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2024 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. | 6.74 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License