The thesis presents a framework for evaluating recommender systems in real-world scenarios, identifying the challenges and the need for empirical studies to assess factors like accuracy and user preferences. It proposes building an evaluation platform with defined use cases, focusing on user interactions and roles within the system. The conclusion highlights the implementation of a scalable platform already attracting interest from companies and the potential for ongoing research and user interaction development.