Interpreting COVID Lateral Flow Tests' Results with Foundation Models

Pandey, Stuti; Myers-Dean, Josh; Reynolds, Jarek; Gurari, Danna

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.14990 (cs)

[Submitted on 21 Apr 2024]

Title:Interpreting COVID Lateral Flow Tests' Results with Foundation Models

Authors:Stuti Pandey, Josh Myers-Dean, Jarek Reynolds, Danna Gurari

View PDF HTML (experimental)

Abstract:Lateral flow tests (LFTs) enable rapid, low-cost testing for health conditions including Covid, pregnancy, HIV, and malaria. Automated readers of LFT results can yield many benefits including empowering blind people to independently learn about their health and accelerating data entry for large-scale monitoring (e.g., for pandemics such as Covid) by using only a single photograph per LFT test. Accordingly, we explore the abilities of modern foundation vision language models (VLMs) in interpreting such tests. To enable this analysis, we first create a new labeled dataset with hierarchical segmentations of each LFT test and its nested test result window. We call this dataset LFT-Grounding. Next, we benchmark eight modern VLMs in zero-shot settings for analyzing these images. We demonstrate that current VLMs frequently fail to correctly identify the type of LFT test, interpret the test results, locate the nested result window of the LFT tests, and recognize LFT tests when they partially obfuscated. To facilitate community-wide progress towards automated LFT reading, we publicly release our dataset at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2404.14990 [cs.CV]
	(or arXiv:2404.14990v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.14990

Submission history

From: Josh Myers-Dean [view email]
[v1] Sun, 21 Apr 2024 18:32:08 UTC (12,785 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Interpreting COVID Lateral Flow Tests' Results with Foundation Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Interpreting COVID Lateral Flow Tests' Results with Foundation Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators