The paper presents a novel deep learning methodology for multilingual speech recognition, targeting accurate conversion of mixed-language audio input (Kannada and English) into text. It highlights the limitations of existing speech recognition methods and proposes a next word prediction model to enhance accuracy, achieving a 71% success rate on a custom dataset. The study emphasizes the importance of effectively recognizing and translating multilingual queries to improve communication in linguistically diverse regions like India.