Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/33435
Title: FlashSAM: Lightweight Vision Model for Multi-UAV Token Communication in Low-Altitude Wireless Networks
Authors: Jiang, F
Tu, S
Dong, L
Wang, K
Yang, K
Liu, R
Pan, C
Wang, J
Keywords: low-altitude wireless networks;masked autoencoder;Segment Anything Model;large vision model;Token communication
Issue Date: 25-May-2026
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Citation: Jiang, F. et al. (2026) ‘FlashSAM: Lightweight Vision Model for Multi-UAV Token Communication in Low-Altitude Wireless Networks’, IEEE Journal of Selected Topics in Signal Processing, pp. 1–14. doi:10.1109/JSTSP.2026.3696920.
Abstract: Token Communication (TokenCom) is a promising paradigm for low-altitude wireless networks, as it focuses on transmitting task-relevant core information, particularly in environments with uncertainty, noise, and stringent bandwidth constraints. However, existing TokenCom systems still face several challenges, including inefficient knowledge base construction, ineffective token encoding, and limited support for multi-user token sharing. To address these issues, we propose a Lightweight Vision Model-based Multi-Unmanned Aerial Vehicle (UAV) To ken Communication (LVM-MTC) system. First, we develop a lightweight Segment Anything Model (SAM), termed FlashSAM, which incorporates a set of lightweight convolutional modules to significantly reduce the number of model parameters. Building on FlashSAM, we construct a Lightweight Knowledge Base (LKB) to enable efficient object-level perception. Next, we design an Efficient Token Codec (ETC) based on the Masked Autoencoder (MAE) architecture. ETC improves compression efficiency at both the pixel and token levels, and provides lightweight token decoding tailored for resource-constrained UAVs. Furthermore, we propose a Multi-UAV Token Sharing (MTS) scheme for multi UAV TokenCom. By measuring token similarity across UAVs, MTS consolidates similar tokens and transmits them through broadcast transmission, thereby further improving transmission efficiency. Finally, simulation results validate the feasibility and effectiveness of the proposed LVM-MTC system.
URI: http://bura.brunel.ac.uk/handle/2438/33435
DOI: http://dx.doi.org/10.1109/jstsp.2026.3696920
ISSN: 1932-4553
http://dx.doi.org/10.1109/jstsp.2026.3696920
1941-0484
Appears in Collections:Department of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdf9.17 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.