This document discusses using eye movement data to infer image relevance in a content-based image retrieval system. An experiment was conducted where 10 participants viewed images from 101 categories to perform image searching tasks. Eye movement data including fixation duration, fixation count, and number of revisits was analyzed. The results show that fixation duration and count were significantly higher for positively relevant images compared to irrelevant images. This suggests eye movement data could provide an implicit form of relevance feedback to improve image retrieval systems.
A decision tree was then developed to predict user feedback during the tasks based on the eye tracking measures. It achieved over 87% accuracy, demonstrating the potential of using natural eye movement as a robust source of relevance feedback to bridge the semantic gap in