The document presents a semantic-based approach for information retrieval from HTML documents using a wrapper induction technique, aimed at enabling easier extraction of unstructured data. By leveraging semantic web technologies, primarily RDF, the proposed model facilitates the extraction, storage, and querying of information across multiple web pages, yielding efficient report generation. The methodology includes pre-processing HTML pages, extracting data, generating ontology, constructing SPARQL queries, and evaluating performance with notable improvements in precision and reduction in report generation time.