Intermediate Docker
Mike Metzger
Data Engineering Consultant
Consider a data engineering project using the following software:
Possible to use a single image, but we would need to update the image each time we had an update to the ETL or web server setup.
FROM ubuntu
RUN apt update
RUN apt install -y postgresql
RUN apt install -y nginx
RUN apt install -y python3.9
...
bash> docker run -d postgresql:latest
bash> docker run -d nginx:latest ...
docker images
bash> docker images
REPOSITORY TAG SIZE
postgres latest 448MB
postgres 15 442MB
apache/airflow 2.7.1-python3.9 1.4GB
alpine latest 7.73MB
Intermediate Docker