This document discusses using machine learning techniques for intelligence extraction from unstructured data sources. It proposes a system that uses various machine learning algorithms to extract and organize unstructured data like text, images, videos and PDFs into a structured format for easy user access. The key modules described are CSV extraction, web page extraction including text, images, email addresses and tables, video extraction, image extraction and PDF extraction. The goal is to take large amounts of unstructured data from various sources and categorize it in a graph database for future use.