Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/30805
Title: | Optimized Feature Points and Keyframe Methods for VSLAM in High-Dynamic Indoor Environments |
Authors: | Hu, Z Qi, W Ding, K Qi, H Zhao, Y Zhang, X Wang, M |
Keywords: | visual SLAM;dynamic scene;ORB-SLAM3;YOLOv7-tiny;keyframe sequences;ParC_NetVLAD |
Issue Date: | 13-Jan-2025 |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
Citation: | Hu, Z. et al. (2025) 'Optimized Feature Points and Keyframe Methods for VSLAM in High-Dynamic Indoor Environments', IEEE Transactions on Intelligent Transportation Systems, 0 (early access), pp. 1 - 14. doi: 10.1109/TITS.2024.3520177. |
Abstract: | VSLAM is one of the key technologies for indoor mobile robots, used to perceive the surrounding environment, achieve accurate positioning and mapping. However, traditional VSLAM algorithms based on the assumption of a static environment still face certain challenges. The movement, occlusion, and appearance changes of dynamic objects can lead to feature point-matching errors, making data association difficult and causing biases in motion estimation. In order to address this challenge, this paper proposes a dynamic feature point removal method and a closed-loop detection method for high dynamic scenes, aiming to effectively improve the robustness and positioning accuracy in dynamic environments. First, the YOLOv7-tiny object detection network and LK optical flow algorithm are combined to detect the dynamic area, and the adaptive threshold keyframe selection method is adopted to solve the problem of poor quality of keyframe caused by the existing heuristic threshold selection method. Then, this paper proposes a dynamic keyframe sequence creation method based on the angle difference between keyframes, which reduces the workload of loop back detection and accelerates the efficiency of loop back detection in the system. Next, the ParC_NetVLAD image matching algorithm is proposed. In this paper, ConvNeXt-Tiny network is used for feature extraction of images, and ParC-Net network and CBAM attention mechanism are added to the feature extraction network. Finally, NetVLAD is used to cluster the extracted local features to obtain global features that can represent images. Experiments are conducted on public TUM RGB-D datasets and in real-world situations. The proposed algorithm reduces the ATE (Absolute Trajectory Error) by 96.4% and the RPE (Relative Trajectory Error) by 82.8% on average in highly dynamic scenarios. In the Pittsburgh30k dataset, the average accuracy of loop closure detection has been improved by 2.6%. |
URI: | https://bura.brunel.ac.uk/handle/2438/30805 |
DOI: | https://doi.org/10.1109/TITS.2024.3520177 |
ISSN: | 1524-9050 |
Other Identifiers: | ORCiD: Mingfeng Wang https://orcid.org/0000-0001-6551-0325 |
Appears in Collections: | Dept of Mechanical and Aerospace Engineering Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2024 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. See: https://journals.ieeeauthorcenter.ieee.org/become-an-ieee-journal-author/publishing-ethics/guidelines-and-policies/post-publication-policies/ | 3.6 MB | Adobe PDF | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.