Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective

Zhang, Ziwei; Wang, Xin; Zhang, Zeyang; Cui, Peng; Zhu, Wenwu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.12345 (cs)

[Submitted on 23 Dec 2021 (v1), last revised 27 Oct 2025 (this version, v2)]

Title:Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective

Authors:Ziwei Zhang, Xin Wang, Zeyang Zhang, Peng Cui, Wenwu Zhu

View PDF HTML (experimental)

Abstract:Deep neural networks have achieved great success in the last decade. When designing neural networks to handle the ubiquitous geometric data such as point clouds and graphs, it is critical that the model can maintain invariance towards various transformations such as translation, rotation, and scaling. Most existing graph neural network (GNN) approaches can only maintain permutation-invariance, failing to guarantee invariance with respect to other transformations. Besides GNNs, other works design sophisticated transformation-invariant layers, which are computationally expensive and difficult to be extended. In this paper, we revisit why general neural networks cannot maintain transformation invariance. Our findings show that transformation-invariant and distance-preserving initial point representations are sufficient to achieve transformation invariance rather than needing sophisticated neural layer designs. Motivated by these findings, we propose Transformation Invariant Neural Networks (TinvNN), a straightforward and general plug-in for geometric data. Specifically, we realize transformation invariant and distance-preserving initial point representations by modifying multi-dimensional scaling and feed the representations into existing neural networks. We prove that TinvNN can strictly guarantee transformation invariance, being general and flexible enough to be combined with the existing neural networks. Extensive experimental results on point cloud analysis and combinatorial optimization demonstrate the effectiveness and general applicability of our method. We also extend our method into equivariance cases. Based on the results, we advocate that TinvNN should be considered as an essential baseline for further studies of transformation-invariant geometric deep learning.

Comments:	13 pages; accepted by IEEE TPAMI
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2112.12345 [cs.CV]
	(or arXiv:2112.12345v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.12345

Submission history

From: Ziwei Zhang [view email]
[v1] Thu, 23 Dec 2021 03:52:33 UTC (2,486 KB)
[v2] Mon, 27 Oct 2025 06:45:09 UTC (4,418 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators