This document presents a video shot boundary detection (SBD) method that employs a three-stage approach combining grayscale image transformation, object comparison, and the scale-invariant feature transform (SIFT) technique. The proposed system aims to accurately identify shot transitions while minimizing computational complexity and time consumption, achieving an accuracy of 0.97 according to the F-score. The research highlights the challenges of video analysis, particularly in distinguishing shot boundaries due to camera and object movements.