Oscar Castañeda presented on learning to rank datasets for search using Elasticsearch's learning to rank capabilities. He discussed extracting relevance judgements during data pipelines to bootstrap a dataset ranking model, and leveraging click-through data on dataset profiles to continuously improve the model. The demo showed ranking movie datasets based on a relevance judgement file, with movie profile pages representing dataset profiles. Next steps included describing datasets in a structured way, building a knowledge graph to extract insights, and using topic models to understand datasets.
Related topics: