The document discusses an evaluation of the CITO citation classification tool, focusing on human annotators' behaviors in using its properties. An experiment revealed low inter-rater agreement among annotators, and usability assessments indicated variability in property use. The authors propose improvements to enhance clarity and usability while suggesting future work to refine the Citalo tool for automatic citation classification.