Self-supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding

Chen, J; Jin, Z; Wang, Q; Meng, H

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/27535

Title:	Self-supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding
Authors:	Chen, J Jin, Z Wang, Q Meng, H
Keywords:	spatio-temporal interaction;contrastive learning;Poincaré model;hyperbolic space;homotopic mapping
Issue Date:	2-Nov-2023
Publisher:	Institute of Electrical and Electronics Engineers (IEEE)
Citation:	Chen, J. et al. (2023) 'Self-supervised 3D Behavior Representation Learning Based on Homotopic Hyperbolic Embedding', IEEE Transactions on Image Processing, 32, pp. 6061 - 6074. doi: 10.1109/tip.2023.3328230.
Abstract:	Behavior sequences are generated by a series of spatio-temporal interactions and have a high-dimensional nonlinear manifold structure. Therefore, it is difficult to learn 3D behavior representations without relying on supervised signals. To this end, self-supervised learning methods can be used to explore the rich information contained in the data itself. Context-context contrastive self-supervised methods construct the manifold embedded in Euclidean space by learning the distance relationship between data, and find the geometric distribution of data. However, traditional Euclidean space is difficult to express context joint features. In order to obtain an effective global representation from the relationship between data under unlabeled conditions, this paper adopts contrastive learning to compare global feature, and proposes a self-supervised learning method based on hyperbolic embedding to mine the nonlinear relationship of behavior trajectories. This method adopts the framework of discarding negative samples, which overcomes the shortcomings of the paradigm based on positive and negative samples that pull similar data away in the feature space. Meanwhile, the output of the network is embedded in a hyperbolic space, and a multi-layer perceptron is added to convert the entire module into a homotopic mapping by using the geometric properties of operations in the hyperbolic space, so as to obtain homotopy invariant knowledge. The proposed method combines the geometric properties of hyperbolic manifolds and the equivariance of homotopy groups to promote better supervised signals for the network, which improves the performance of unsupervised learning.
URI:	https://bura.brunel.ac.uk/handle/2438/27535
DOI:	https://doi.org/10.1109/tip.2023.3328230
ISSN:	1057-7149
Other Identifiers:	ORCiD: Jinghong Chen https://orcid.org/0000-0001-8650-790X ORCiD: Qicong Wang https://orcid.org/0000-0001-7324-0433 ORCiD: Hongying Meng https://orcid.org/0000-0002-8836-1382
Appears in Collections:	Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf	Copyright © 2023 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. See: https://journals.ieeeauthorcenter.ieee.org/become-an-ieee-journal-author/publishing-ethics/guidelines-and-policies/post-publication-policies/	2.5 MB	Adobe PDF	View/Open

Show full item record