In this blog I will cover how to access file or object storage from an MLrun job. The entire process was done using the Iguazio and Pure Storage MLOps platform as per this post.

As per the documentation MLRun is an open-source MLOps framework that offers an integrative approach to…


In this blog I will cover some recent work done with the great folks at Iguazio

Pure Storage and Iguazio work together to automate MLOps and cut the time to market for data science initiatives by enabling consistent, reliable performance and simplicity at scale. Please refer to the “How to…


Part 3 of the series on Cloudera S3 access to a Pure Storage FlashBlade covering Spark, Hive and distcp.

Part 1 : GUI configuration of Cloudera v7 to use on premise S3 storage

Part 2 : S3 credentials in Cloudera

Spark job using S3 dataset(s)

The following is an example spark job to leverage our…


Part 2 of the series on Cloudera S3 access to a Pure Storage FlashBlade.

In order to access our S3 storage we cannot always leverage the credentials added to our Administration > External Accounts. I will provide here the various other methods available.

Link to Part 1 : GUI configuration…


In this blog I’ll cover the steps to implement a JupyterHub environment with Portworx shared storage, but also how to move your shared data to a Portworx Proxy volume presented from a Pure Storage FlashBlade.

The benefit is the ability to leverage data locality for iterative work and when needed…


A quick step by step on getting Kubeflow up and running with GPU support. In this example I used a single node Kubernetes controller + worker. I will be using docker runtime (nvidia supporr) and kubernetes 1.19.8 with the calico network CNI and the NFS client for storage claims. …


When building out a Kubernetes as a service offering, storage has an integral part to play. …


In this blog I will cover the steps used to configure Pure Storage FlashBlade to output syslog via logstash to an ECK elasticsearch instance.

I am currently running a 7 worker node v1.19.3 Kubernetes cluster onto which both logstash and elasticsearch are deployed.

Elasticsearch is deployed using the ECK operator


In this blog I’ll cover how to use FlashBlade S3 replication with Portworx Backup to recover from a site failure.

Portworx Backup can use Pure Storage FlashBlade as a repository for backup images, providing the fastest time to recovery possible. Leveraging FlashBlade replication functionalities we can configure the recovery of…


Today let’s look at how to leverage Pure Storage FlashBlade as a backup target for Portworx. By using Pure Storage FlashBlade we will benefit from fast backup and rapid recovery for our containerised environments.

FlashBlade the highly parallel all Flash storage platform, built with web-scale applications in mind, is the…

jboothomas

Infrastructure engineering for modern data applications

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store