Loading…
Edinburgh, Scotland, UK
October 21 & October 25 | Co-Located Events, Tutorials, & Workshops
October 22-24 | Conference
Find out more information for Open Source Summit + Embedded Linux Conference & OpenIoT Summit Europe 2018

Please note that you can view and download presentations on the Open Source Summit and Embedded Linux Conference + OpenIoT Summit slides pages. 
Back To Schedule
Sunday, October 21 • 13:30 - 17:15
Workshop: Building and Operating an OSS Data Science Platform - Jörg Schad, Mesosphere

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
There are many great tutorials for training your deep learning models using TensorFlow, Keras, Spark or one of the many other frameworks. But training is only a small part of the overall deep learning pipeline.

Ever wonder about how to set up a complete end-to-end data science pipeline starting with data storage and preparation, interactive notebooks, distributed training, CI/CD automation, and serving and monitoring the trained models.

In this workshop, we will build an end-to-end OSS data science platform including:
* Data preparation using Apache Spark
* JupyterLab self-service for data scientists
* Data storage using HDFS
* Distributed training
* Automation & CI/CD using Jenkins
* Resource sharing (including GPUs) between multiple user/jobs
* Model and metadata storage
* Model serving and monitoring

Speakers
avatar for Jörg Schad

Jörg Schad

CTO, ArangoDB
Jörg Schad is the CTO at ArangoDB. In a previous life, he has worked on or built machine learning pipelines in healthcare, distributed systems, including early Kubernetes code at Mesosphere, and in-memory databases. He received his Ph.D. for research about distributed databases and... Read More →


Sunday October 21, 2018 13:30 - 17:15 BST
Moorfoot, Level 0