LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Start free trial Sign in

From the course: Advanced Data Processing: Batch, Real-Time, and Cloud Architectures for AI

Unlock this course with a free trial

Join today to access over 24,500 courses taught by industry experts.

ML inference

ML inference

From the course: Advanced Data Processing: Batch, Real-Time, and Cloud Architectures for AI

Start my 1-month free trial Buy for my team

ML inference

“

- [Instructor] Having discussed feature engineering and model training, let's discuss ML inference in this video. Inference happens in production settings. Building an inference architecture for such a setting requires careful analysis of the tasks to be executed, the expected performance goals, and the infrastructure needed to achieve these goals. What are the tasks involved in model inference? First, raw data that is provided for inference need to be pre-processed and prepared for inference. In pre-processing, we need to ensure security of the model itself and protect it from intended and unintended hacks and misuse. Raw data may need to be temporarily stored or cached before it is processed. The feature engineering pipeline used for model training needs to be replicated to perform the same operations on inference data also. Reliable transformation of data needs to be insured either through APA calls or queues. Then comes serving. The model is usually in a model repository. The…

Contents