Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/31805
Title: | A Spatio-Temporal Diffusion Model for Missing and Real-Time Financial Data Inference |
Authors: | Fang, Y Liu, R Huang, H Zhao, P Wu, Q |
Keywords: | diffusion model;financial data processing;missing value imputation;real-time nowcasting |
Issue Date: | 21-Oct-2024 |
Publisher: | Association for Computing Machinery (ACM) |
Citation: | Fang, Y. et al. (2024) 'A Spatio-Temporal Diffusion Model for Missing and Real-Time Financial Data Inference', Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM), , pp. 602 - 611. doi: 10.1145/3627673.3679806. |
Abstract: | Missing values and unreleased figures are common but highly important for backtesting and real-time analysis in the financial industry, yet underexploited in the existing literature. In this paper, we focus on the issue of empirical asset pricing, where the cross-section of future asset returns is a function of lagged firm characteristics that vary in time frequencies and missing ratios. Most of the existing imputation methods cannot fully capture the complex and evolving spatio-temporal relations among firm-level characteristics. In particular, these methods fail to explicitly consider the spatial relations and feature structure in the stock network where we have to process granular data of thousands of stocks and hundreds of characteristics for each stock. To address these challenges, we propose a spatio-temporal diffusion model (STDM) that gradually recovers the masked financial data conditioning on high-dimensional stock-and-characteristics historical data. We propose characteristic-specific projection to construct characteristic-level features at both ends of the STDM, meanwhile maintaining firm-level features in the middle of the STDM to largely reduce the computational memory. Moreover, along with the temporal attention, we design a spatial graph convolutional network, making it computationally efficient and effective to learn time-varying spatio-temporal interdependence across firms. We further employ an implicit sampler that greatly accelerates the inference procedure so that the STDM is able to produce high-quality point and density estimates of missing and real-time firm characteristics within a few steps. We evaluate our model on the most comprehensive open-source dataset 'OSAP' and generate state-of-the-art performance in extensive experiments. |
URI: | https://bura.brunel.ac.uk/handle/2438/31805 |
DOI: | https://doi.org/10.1145/3627673.3679806 |
ISBN: | 979-8-4007-0436-9 |
ISSN: | 2155-0751 |
Appears in Collections: | Brunel Business School Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
MeetingAbstract.pdf | © 2024 Copyright held by the owner/author(s). This meeting abstract is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (https://creativecommons.org/licenses/by-sa/4.0/). | 159.86 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License