The document discusses methods to measure 'domainhood,' or the specificity of web corpora, with a focus on a case study involving the ecare corpus. It evaluates various statistical tests and metrics to quantify domainhood and establish differences between specialized and general-purpose corpora. The conclusion highlights the need for future work to refine these methodologies and improve gold standard designs for assessing domainhood.