Running Massively Parallel Deep-learning Inference Pipelines on Kubernetes

Dec 2019

Nearmap captures terabytes of aerial imagery daily. With the introduction of artificial intelligence (AI) capabilities, Nearmap has leveraged Kubernetes to generate AI content based on tens of petabytes of images effectively and efficiently.

This talk covers how using Kubernetes as the backbone of our AI infrastructure, allowed us to build a fully automated deep-learning inferential pipeline that despite not being embarrassingly parallel is actually massively parallel. This talk explains the architecture of this auto-scalable solution that has exhausted all K80 spot GPUs across all US data centres of AWS for weeks. This system has already produced semantic content on over a million km2 area at resolution as high as 5cm/pixel in just 2 weeks. In this talk, you will learn about the joys of building and running this system at scale, challenges encountered, their resolution, & future work.

Slides can be found here and video:

« Deep learning meets Kubernetes: Running massively parallel inference pipelines efficiently

Realizing End to End Reproducible Machine Learning on Kubernetes »

Suneeta Mall

Rambling of a curious engineer & data scientist

Running Massively Parallel Deep-learning Inference Pipelines on Kubernetes