This paper presents the coupling of Arom, an object-based knowledge representation, with V-Storm, a multimedia presentation system, to enhance multimedia presentation creation and consistency. The proposed model allows users to build presentations using an intuitive UML-like approach while ensuring spatial and temporal consistency using Arom's inference mechanisms. The integration facilitates video data management and enables users to create interactive multimedia content compliant with the SMIL standard.