Production¶

Stage after deploying the model to work with live data - Model conversion - Optimization - Performance - Latency - Throughput - Energy-consumption - Security & Privacy - Online learning

Drift¶

Train & live data distributions change over time

Causes - Structural break - Data integrity issues

Types¶

Type	Data Change	Relationship Change	Subtype	Change in	Solution	Example	Example cause
Data Drift	✅	❌	Feature/ Covariate	\(p(x)\)		Applicants from new market	Product launch in new market
			Prior/ Output/ Label	\(p(y)\)		Price of goods increase	Inflation
Concept Drift	❌	✅		\(p(y \vert x)\)	- Give higher sample weight to recent datapoints - Use batch-streaming hybrid - Works when we have the label associated with every data point, such as in Recommender Systems	Price-elasticity of demand changes	New competitor in your existing market

Check with - Adversarial Validation/Domain Classifier - Anomaly Detection

If label and drift happen together and cancel each other out, there is no concept drift.
Else, concept drift will be caused by one/both since they are linked by Bayes' equation

Speed¶

Deployment Checklist¶

IDK¶

Aspect	Type		Advantages	Disadvantages	Comment
Inference	Precomputed		Simple	Computationally-expensive	Recommendation systems to be used for email marketing campaigns
	Realtime/ Inference-time
Training	Batch	Train the model before inference
	Realtime/ Inference-time	Train the model at inference	- Reduces train cost - Reduces train time - Do not train on observations that will never be required - Improves model performance	- Possible latency	Useful if your inference use-case only requires a small subset of training dataset - filter at inference time - train at inference time - predict at inference time Complexity of model should be based on the amount of samples
Retraining	Full Batch	Periodically retrain on entire dataset	Simple	- Computationally-expensive
	New Batch	Periodically update the existing model with new observations
	Online/ Streaming/ On-the-fly	Update model as new observations appear	- Computationally-efficient	- Complex

Model Location¶

	Cloud	Edge/Browser
Cheaper	❌	✅
Small models (Load + Inference)	Slower	Faster
Large models (Load + Inference)	Faster	Slower
Offline support	❌	✅
User Privacy	❌	✅
Model Privacy	✅	❌

Compute requirements¶

CPU/GPU/Memory

Latency, throughput (QPS)¶

Logging¶

Security & Privacy¶

Scenarios of Deployment¶

New product/capability
Automate/assist with manual task
Replace previous ML system

Types of Deployment¶

Type
Canary	Roll out to small fraction of traffic initially Monitor system and ramp up traffic gradually
Blue-Green	Fully deploy new version (green) Keep old model dormant, and rollback to it if required (blue)

Degrees of Automation¶


Human-Only
Shadow Mode
AI Assistance
Partial Automation
Full automation

Monitoring¶

Brainstorm potential problems
Brainstorm appropriate metrics to identify the problems
Software Metrics
- Memory
- Compute
- Latency
- Throughput
- Server load
Data
- Data Distributions
- Input Metrics
  - Average Input length
  - Fraction of rows with missing values
  - Average image brightness
- Output metrics
  - Missing outputs
  - No of times user redoes search
  - CTR (ClickThrough Rate): No of clicks that your ad receives divided by the number of times your ad

Model Serving¶

Adversarial Attacks¶

Fool model by adding noise

This is not a problem with Deep Learning and/or ConvNets. Same issue comes up with Neural Networks in any other modalities. Primary cause of neural networks' vulnerability adversarial perturbation is their linear nature (and very high-dimensional, sparsely-populated input spaces).

The exact adversarial noise can easily be learnt - known model weights: directly - unknown model weights: through backpropagation

eg: Confidently predicting the class even though it is extrapolating

Solution - Data augmentation; not sufficient - Train for adversarial robustness; not sufficient 1. Create adversarial examples 2. Add them to train data, tagged as "adversarial class" - Not clear what is the guaranteed workaround

Last Updated: 2025-07-26 ; Contributors: AhmedThahir, web-flow