Use FastAPI to expose an HTTP API for fast live predictions using an ONNX Machine Learning Model. FastAPI is a Python web framework that provides easy development ofdocumented HTTP APIs by offering self-documented endpoints with Swagger – a tool to describe, document, and use RESTful web services.
Learn how to quickly put together an API which validates requests, and self-documents its endpoints using OpenAPI via Swagger. Quickly produce a robust interface for others to consumeyour Machine Learning model by following core best-practices of MLOps.
Parts of this video cover the basics of packaging Machine Learning models, as covered in the Practical MLOps book.
* Create a Python project to serve live predictions using FastAPI
* Use a Dockerfile to package the model and the API using Docker containerization
* With minimal Python code, expose an ONNX model to perform sentiment analysis over an HTTP endpoint
* Dynamically interact with the API using the self-documented endpoint in the container.
* Demo Github Repository with sample code
* Practical MLOps book
* FastAPI Intro tutorial
* RoBERTa ONNX Model for sentiment analysis