Useful Links
Computer Science
Artificial Intelligence
Machine Learning
Machine Learning in Production
1. Introduction to MLOps
2. Project Scoping and System Design
3. Data Engineering for Production
4. Model Development for Production
5. Model Deployment
6. Monitoring, Logging, and Maintenance
7. MLOps Infrastructure and Tooling
8. Governance, Ethics, and Security
Model Deployment
Deployment Architecture Design
System Architecture Patterns
Scalability Considerations
Security Requirements
Deployment Patterns
Online Prediction
Low-latency Serving
Synchronous APIs
Real-time Inference
Batch Prediction
Scheduled Batch Jobs
Large-scale Data Processing
Offline Inference
Streaming Prediction
Real-time Data Streams
Event-driven Inference
Stream Processing
Edge Deployment
Mobile Deployment
IoT Device Deployment
Offline Inference
Serving Infrastructure
Model Serialization
Serialization Formats
Pickle
ONNX
TensorFlow SavedModel
PyTorch TorchScript
Compatibility Considerations
Performance Optimization
Creating Prediction Services
REST APIs
API Design Principles
Authentication and Authorization
Rate Limiting
gRPC Services
Protocol Buffers
Performance Considerations
Streaming Support
GraphQL APIs
Schema Design
Query Optimization
Model Servers
Dedicated Model Serving Frameworks
TensorFlow Serving
TorchServe
MLflow Models
Scaling Model Servers
Load Balancing
Caching Strategies
Containerization and Orchestration
Docker for ML Applications
Building Docker Images
Managing Dependencies in Containers
Multi-stage Builds
Image Optimization
Kubernetes for ML Workloads
Deploying Containers to Kubernetes
Autoscaling and Load Balancing
Managing Deployments and Rollbacks
Resource Management
Service Mesh
Traffic Management
Security Policies
Observability
Deployment Strategies
Blue-Green Deployment
Traffic Switching
Rollback Procedures
Testing Strategies
Canary Releases
Gradual Traffic Shifting
Monitoring Canary Performance
Automated Rollback
Shadow Deployment
Parallel Inference
No-impact Testing
Performance Comparison
A/B Testing
Experiment Design
Statistical Analysis of Results
Multi-armed Bandit Testing
Rolling Deployments
Incremental Updates
Zero-downtime Deployment
Health Checks
Continuous Integration and Continuous Delivery for ML
CI/CD Pipeline Design
Pipeline Stages
Artifact Flow
Quality Gates
Automated Testing
Unit Tests for ML Code
Integration Tests for Pipelines
Model Validation Tests
Performance Tests
Automated Build Processes
Automated Build Pipelines
Artifact Generation
Dependency Management
Automated Deployment
Deployment Automation Tools
Rollback and Recovery Automation
Environment Promotion
Infrastructure as Code
Terraform
CloudFormation
Ansible
Previous
4. Model Development for Production
Go to top
Next
6. Monitoring, Logging, and Maintenance