Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/31878
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Huang, X | - |
dc.contributor.author | Yan, F | - |
dc.contributor.author | Xu, W | - |
dc.contributor.author | Li, M | - |
dc.date.accessioned | 2025-08-31T09:19:59Z | - |
dc.date.available | 2025-08-31T09:19:59Z | - |
dc.date.issued | 2019-10-14 | - |
dc.identifier | ORCiD: Xin Huang https://orcid.org/0000-0002-5470-1203 | - |
dc.identifier | ORCiD: Maozhen Li https://orcid.org/0000-0002-0820-5487 | - |
dc.identifier.citation | Huang, X. et al. (2019) 'Multi-Attention and Incorporating Background Information Model for Chest X-Ray Image Report Generation', IEEE Access, 7, pp. 154808 - 154817. doi: 10.1109/ACCESS.2019.2947134. | en_US |
dc.identifier.uri | https://bura.brunel.ac.uk/handle/2438/31878 | - |
dc.description.abstract | Chest X-ray images are widely used in clinical practice such as diagnosis and treatment. The automatic radiology report generation system can effectively reduce the rate of misdiagnosis and missed diagnosis. Previous studies were focused on the long text generation problem of image paragraph, ignoring the characteristics of the image and the auxiliary role of patient background information for diagnosis. In this paper, we propose a new hierarchical model with multi-attention considering the background information. The multi-attention mechanism can focus on the image's channel and spatial information simultaneously, and map it to the sentence topic. The patient's background information will be encoded by the neural network first, then it will be aggregated into a vector representation by a multi-layer perception and added to the pre-trained vanilla word embedding, which finally forms a new word embedding after fusion. Our experimental results demonstrated that the model outperforms all baselines, achieving the state-of-the-art performance in terms of accuracy. | en_US |
dc.description.sponsorship | 10.13039/501100003399-Science and Technology Commission of Shanghai Municipality (Grant Number: 16511102800); 10.13039/501100002663-Northwestern Polytechnical University (Grant Number: 22120180117). | en_US |
dc.format.extent | 154808 - 154817 | - |
dc.format.medium | Electronic | - |
dc.language | English | - |
dc.language.iso | en_US | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.rights | Creative Commons Attribution 4.0 International | - |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | - |
dc.subject | attention mechanism | en_US |
dc.subject | deep learning | en_US |
dc.subject | radiology report generation | en_US |
dc.subject | word embedding | en_US |
dc.title | Multi-Attention and Incorporating Background Information Model for Chest X-Ray Image Report Generation | en_US |
dc.type | Article | en_US |
dc.date.dateAccepted | 2019-10-10 | - |
dc.identifier.doi | https://doi.org/10.1109/ACCESS.2019.2947134 | - |
dc.relation.isPartOf | IEEE Access | - |
pubs.publication-status | Published | - |
pubs.volume | 7 | - |
dc.identifier.eissn | 2169-3536 | - |
dc.rights.license | https://creativecommons.org/licenses/by/4.0/legalcode.en | - |
dcterms.dateAccepted | 2019-10-10 | - |
dc.rights.holder | The Author(s) | - |
Appears in Collections: | Dept of Electronic and Electrical Engineering Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2019 The Author(s) Published under license by Institute of Electrical and Electronics Engineers (IEEE). This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/ | 10.43 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License