- Architecting a Machine Learning Pipeline by Semi Koen
- Uber's ML Platform - Michelangelo
- Architecture of a real-world Machine Learning system by Louis Dorard
- How to deploy Machine Learning models by Christopher Samiullah
- ML in Production by Luigi Patruno
- Chip Huyen's Blog on ML
Just some of the skills ML Engineering professionals use. One may not need to know all of them, but one technology from each area would make you well-rounded.
- Python
- Java
- Scala
- Flask
- SQL / Relational
- NoSQL
- Graph (not critically important)
- SQL
- Spark
- Pandas / Dask
- Pandas
- numpy
- scikit-learn
- AWS: S3, Lambda, DynamoDB, Kinesis, Batch ...
- Serverless frameworks, eg. AWS CDK, serverless, chalice
- Airflow
- Luigi
- Kubeflow Pipelines
- PyTorch
- TensorFlow
- Keras
- FastAPI
- MLFlow
- Sagemaker
- Docker
- Kubernetes
- KFServing (Kubeflow)
- TF-serving (Tensorflow)
- Data Versioning, eg. DVC
- Monitoring Drift, Performance
- Testing: Unit, Integration, Load, ML Robustness