SensorLM: Learning the Language of Wearable Sensors

Zhang, Yuwei; Ayush, Kumar; Qiao, Siyuan; Heydari, A. Ali; Narayanswamy, Girish; Xu, Maxwell A.; Metwally, Ahmed A.; Xu, Shawn; Garrison, Jake; Xu, Xuhai; Althoff, Tim; Liu, Yun; Kohli, Pushmeet; Zhan, Jiening; Malhotra, Mark; Patel, Shwetak; Mascolo, Cecilia; Liu, Xin; McDuff, Daniel; Yang, Yuzhe

Computer Science > Machine Learning

arXiv:2506.09108 (cs)

[Submitted on 10 Jun 2025]

Title:SensorLM: Learning the Language of Wearable Sensors

Abstract:We present SensorLM, a family of sensor-language foundation models that enable wearable sensor data understanding with natural language. Despite its pervasive nature, aligning and interpreting sensor data with language remains challenging due to the lack of paired, richly annotated sensor-text descriptions in uncurated, real-world wearable data. We introduce a hierarchical caption generation pipeline designed to capture statistical, structural, and semantic information from sensor data. This approach enabled the curation of the largest sensor-language dataset to date, comprising over 59.7 million hours of data from more than 103,000 people. Furthermore, SensorLM extends prominent multimodal pretraining architectures (e.g., CLIP, CoCa) and recovers them as specific variants within a generic architecture. Extensive experiments on real-world tasks in human activity analysis and healthcare verify the superior performance of SensorLM over state-of-the-art in zero-shot recognition, few-shot learning, and cross-modal retrieval. SensorLM also demonstrates intriguing capabilities including scaling behaviors, label efficiency, sensor captioning, and zero-shot generalization to unseen tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2506.09108 [cs.LG]
	(or arXiv:2506.09108v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2506.09108

Submission history

From: Yuzhe Yang [view email]
[v1] Tue, 10 Jun 2025 17:13:09 UTC (8,273 KB)

Computer Science > Machine Learning

Title:SensorLM: Learning the Language of Wearable Sensors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SensorLM: Learning the Language of Wearable Sensors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators