HPC Rebuild – 12 Months on

It’s great to be able to review work you have done in the past and then be able to re-factor it for the current climate and in 2023 we will be contracted to re-engineer and upgrade the HPC platform we designed and built for one of Australia’s most prestigious Medical Research Institutes back in 2020-2021. …

Build a Lustre High Performance Cluster

Coming Soon using v2.12.5 Overview Building a (Non HA) Cluster Todo – TCP LNET Overview Metadata Server Todo – Installing the Components – e2fsprogs, kmod, lustre Todo – Recommended Storage Needs (RAID-10 stripes) Todo – /etc/fstab entry for mount on boot. Object Storage Servers Todo – Installing the Components – e2fsprogs, kmod, lustre Todo – …

Infiniband Fat Trees

Overview Infiniband is primarily used in High Performance Computing (HPC) and provides a very fast network interconnect with an incredibly small latency. One of the most common topologies implented is the “Fat Tree” layout. The fat tree topology has host nodes connected at the end points of the network via PCI Infiniband cards. The Infiniband …

HPC Re-Engineering

Overview We have been tasked with proposing how we would re-engineer an aging SGI High Performance Computing platform. The platform is a 35 Node cluster running Red Hat 6.7 on each node with a Lustre file system mount and a /home file system presented via NFS from a Hierarchical Storage Node (over an FDR Infiniband …