Skip to main content

Hardware Specifications

Source: Marc Mercer (SRE Lead) — sre-iac repository + Bryan Lee (Engineering Director), Rev 2.0, 2026-03-22

Status Labels

Each server section is labeled with its current operational status. See Architecture Overview for the full context of Current vs Future state.


Server Inventory

UnitHostnameIPiLO IPStatusRole
HP #1pmx-0110.10.96.510.10.96.237ACTIVEProxmox — hosts all K8s VMs and services
HP #2(pending)(off)10.10.96.235🔵 PENDINGOpenStack controller/compute target
HP #3(pending)(off)10.10.96.236🔵 PENDINGOpenStack controller/compute target
HP #4(pending)(off)(offline)🟡 COMING SOONOpen Air GPU Farm (Proxmox, 3× GPU initially)
HP #5(planned)(TBD)(TBD)📋 PLANNEDOpenStack backup/spare node (not yet purchased)

HP #1 — Proxmox Dev Platform [ACTIVE — CURRENT STATE]

The only operational compute node. Hosts all current K8s VMs, application VMs, FreeIPA DCs, and the Proxmox hypervisor.

CPU

ComponentSpecificationPer Server
ProcessorIntel Xeon E5-2697 v42 CPUs
ArchitectureBroadwell, 14nm
Base Clock2.3 GHz
Turbo Clock3.6 GHz
Cores per CPU18 cores36 cores total
Threads per CPU36 threads72 threads total
L3 Cache45 MB per CPU90 MB
TDP145W per CPU290W

CPU features: VT-x, EPT, VT-d, AES-NI, Hyper-Threading, Turbo Boost 2.0.

Memory

ComponentSpecification
Memory TypeDDR4-2133 ECC RDIMM
Module Size32GB
Total Memory512GB (16× 32GB)
Memory Channels8 per CPU (16 total)

Storage

ComponentSpecificationPurpose
OS Drive1TB USB SSDProxmox VE boot and OS
Ceph SSD (ready)8× 1TB enterprise SSDAwaiting Ceph cluster deployment
Ceph HDD (ready)16× 1TB enterprise HDDAwaiting Ceph cluster deployment
Storage ControllerHP Smart Array P440ar (HBA mode)Direct disk access for Ceph

Network Interfaces

ComponentSpecification
Built-in NICIntel I350 Quad Port 1GbE — 4 ports
Bond: bond-neteno1 + eno2 (LACP 802.3ad) — 2 Gbps aggregate
Bond: bond-storeno3 + eno4 (LACP 802.3ad) — 2 Gbps aggregate
Upgrade NIC10Gtek 10Gb Dual SFP+ (Intel 82599ES) — being installed
ManagementDedicated iLO 4 NIC (10.10.96.237)

Power

ComponentSpecification
PSU2× 800W redundant hot-plug
CoolingRedundant hot-swap fans
Form Factor2U tower (rack-mountable)

HP #2 — OpenStack Controller Target [PENDING — POWERED OFF]

Powered off. iLO accessible at 10.10.96.235. Hardware identical to HP #1. Will become an OpenStack controller/compute node when the OpenStack deployment phase begins.

Action required before deployment: Connect to iLO, verify BIOS, RAID controller in HBA mode, memory seating.

Specifications (identical to HP #1)

ComponentSpecification
Processor2× Intel Xeon E5-2697 v4 (36 cores / 72 threads)
Memory512GB DDR4-2133 ECC RDIMM
SSD Storage8× 1TB enterprise SSD (Ceph SSD pool)
HDD Storage16× 1TB enterprise HDD (Ceph HDD pool)
Built-in NICIntel I350 Quad Port 1GbE
Upgrade NIC10Gtek 10Gb Dual SFP+ (Intel 82599ES) — to be installed
ManagementiLO 4 (10.10.96.235)

HP #3 — OpenStack Compute Target [PENDING — POWERED OFF]

Powered off. iLO accessible at 10.10.96.236. Hardware identical to HP #1 and HP #2. Will become an OpenStack compute node in the same deployment phase as HP #2.

Action required before deployment: Connect to iLO (10.10.96.236), verify BIOS, RAID controller mode, memory seating.

Specifications (identical to HP #1 and HP #2)

ComponentSpecification
Processor2× Intel Xeon E5-2697 v4 (36 cores / 72 threads)
Memory512GB DDR4-2133 ECC RDIMM
SSD Storage8× 1TB enterprise SSD (Ceph SSD pool)
HDD Storage16× 1TB enterprise HDD (Ceph HDD pool)
Built-in NICIntel I350 Quad Port 1GbE
Upgrade NIC10Gtek 10Gb Dual SFP+ (Intel 82599ES) — to be installed
ManagementiLO 4 (10.10.96.236)

HP #4 — Open Air GPU Farm [COMING SOON — POWERED OFF]

HP #4 is an HP ProLiant ML350 Gen9 that has been stripped of its outer chassis enclosure and remounted in an open-air rack frame with 12 active cooling fans. The ML350 motherboard, CPUs, RAM, PSUs, and cabling remain unchanged — only the tower enclosure has been removed. This configuration allows NVIDIA GPUs to be mounted externally via PCIe risers, accommodating larger GPU form factors that do not fit inside the standard tower.

This node runs Proxmox VE independently and is not part of the OpenStack cluster. No GPUs reside in any OpenStack node.

CPU and Memory

ComponentSpecification
Processor2× Intel Xeon E5-2683 v4 (32 cores / 64 threads)
Base / Turbo Clock2.1 GHz / 3.0 GHz
L3 Cache40 MB per CPU (80 MB total)
TDP120W per CPU (240W total)
Memory128GB DDR4 ECC RDIMM

OS Storage

ComponentSpecificationPurpose
Primary OS1TB USB SSDProxmox VE OS, VM disk images
Recovery64GB USB thumbdriveProxmox recovery ISO, emergency boot

No internal drive array. No Ceph OSDs. No contribution to storage cluster.

GPU Configuration

SlotGPUArchitectureVRAMMemory BusTDPStatus
1NVIDIA RTX 8000Turing (TU102)48GB GDDR6384-bit260W✅ Installed
2NVIDIA RTX 3090Ampere (GA102)24GB GDDR6X384-bit350W✅ Installed
3NVIDIA RTX 3090Ampere (GA102)24GB GDDR6X384-bit350W✅ Installed
4TBD📋 Reserved — will be selected post go-live

Total VRAM (initial): 96GB (48 + 24 + 24)

The 4th GPU will be selected based on observed workload demand after the farm goes live. Likely candidates: NVIDIA RTX 4090 (24GB), NVIDIA A5000 (24GB), or a second RTX 8000 (48GB) if large-model demand is high.

GPU Passthrough — Proxmox VM Layout

Each GPU is assigned to a dedicated Proxmox VM via PCIe passthrough (VT-d). This provides near-native GPU performance and complete isolation between workloads.

VMGPUvCPURAMPrimary Services
gpu-vm-01RTX 8000 (48GB)1648GBOllama (large models 70B+), vLLM
gpu-vm-02RTX 3090 #1 (24GB)840GBOllama (mid-size), Faster-Whisper ASR, TTS
gpu-vm-03RTX 3090 #2 (24GB)840GBFine-tuning (Axolotl/Unsloth), batch inference
gpu-vm-04Reserved (4th GPU)TBDTBDTBD post go-live

See Architecture Overview — GPU Farm for full VM architecture detail and AI services stack.

Network Interfaces

ComponentSpecification
NIC10Gtek 10Gb Dual SFP+ (Intel 82599ES) — to be installed
Port 1 (bond-net)→ TL-SX3008F-01 (network plane)
Port 2 (bond-stor)→ TL-SX3008F-02 (storage plane — NFS to QNAP / future Ceph)
ManagementiLO 4 (offline — reconnect before power-on)

Cooling

12 active fans in open-air rack frame. GPU TDP budget: RTX 8000 (260W) + 2× RTX 3090 (350W each) = 960W GPU-only. Combined with CPUs (~240W) and RAM, total draw approaches ~1,300–1,400W under full load. Verify PSU capacity and UPS assignment before powering on.


HP #5 — OpenStack Spare/Backup Node [PLANNED — NOT PURCHASED]

A fifth HP ProLiant ML350 Gen9 with specifications identical to HP #1–#3 will be purchased once the OpenStack cluster is fully live and stable. This node serves as:

  • OpenStack spare: Standby node for rolling maintenance (patch one node while keeping full capacity)
  • Capacity buffer: Absorbs workload if any of HP #2 or HP #3 fails unexpectedly
  • Future expansion: If workload demands a fourth compute node, HP #5 is ready to join
SpecificationValue (planned)
CPU2× Intel Xeon E5-2697 v4
Memory512GB DDR4-2133 ECC RDIMM
SSD8× 1TB enterprise SSD (Ceph SSD pool)
HDD16× 1TB enterprise HDD (Ceph HDD pool)
NICIntel I350 Quad Port 1GbE + 10Gtek dual SFP+

10G NIC Upgrade — All ML350 Servers

Four 10Gtek 10Gb PCI-E NIC cards (Intel 82599ES chipset, dual SFP+ ports) have been ordered — one per ML350 server (HP #1, #2, #3, #4). This NIC provides two independent 10G SFP+ ports, one per switch plane:

ServerPort 1 (bond-net)Port 2 (bond-stor)
HP #1 (pmx-01)→ SX3008F-01 (network 10G)→ SX3008F-02 (storage 10G, isolated)
HP #2 (OpenStack)→ SX3008F-01→ SX3008F-02
HP #3 (OpenStack)→ SX3008F-01→ SX3008F-02
HP #4 (GPU Farm)→ SX3008F-01→ SX3008F-02

DAC cables ordered: 8× H!Fiber 1.5m SFP+ DAC (4 servers × 2 planes). Additional 0.3m DACs needed for switch-to-router and cross-panel connections.


Aggregate Totals

Current State (HP #1 only)

ResourceValue
CPU Cores36 cores / 72 threads
RAM512GB
Proxmox VMs8+ VMs (6 K8s nodes + application VMs)
Storage (NFS)QNAP qnap-01 (10.10.96.31) — NFS for all K8s PVs

Future State — OpenStack Cluster (HP #1 + #2 + #3)

ResourceTotal
CPU Cores108 cores / 216 threads
RAM1,536GB (1.5TB)
SSD (Ceph raw)24TB (8 per node × 3 nodes)
SSD (Ceph usable, RF=2)12TB
HDD (Ceph raw)48TB (16 per node × 3 nodes)
HDD (Ceph usable, RF=2)24TB

Future State — GPU Farm (HP #4, initial 3 GPUs)

ResourceValue
CPU Cores32 cores / 64 threads
RAM128GB
VRAM96GB (48 + 24 + 24)
GPU Slots3 active / 1 reserved

Combined Future State (HP #1–#4, excluding HP #5)

ResourceTotal
CPU Cores140 cores / 280 threads
RAM1,664GB
VRAM96GB (expanding to TBD with 4th GPU)
Raw SSD24TB
Raw HDD48TB

Network Hardware

Current State (Juniper — Unmanaged)

DeviceModelIPStatusRole
Cisco Router(unknown model)10.10.96.1✅ ActiveInternet gateway, bridges lab + homestead (flat /20)
EX2200-24T #1Juniper EX2200-24T(unmanaged)✅ Active, unmanagedL2 switch — no VLANs
EX2200-24T #2Juniper EX2200-24T(unmanaged)✅ Active, unmanagedL2 switch — no VLANs
EX2200-24P #1Juniper EX2200-24P(unmanaged)✅ Active, unmanagedL2 switch — no VLANs
QNAP NAS #1QNAP (model TBD)10.10.96.31✅ ActiveNFS storage for K8s PVs
QNAP NAS #2QNAP (model TBD)10.10.101.142❌ Offline (bad drive)Decommissioned until repaired

Interim Configuration (Juniper — VLANs Pending)

DeviceHostnameModelPortsRole
Edge Firewallsrx320-01Juniper SRX3206× GbE + 2× SFPWAN, inter-VLAN routing, security zones
Network Switchex2200t-01Juniper EX2200-24T24× GbE + 4× SFPNetwork plane (VLANs 200, 500, 600, 610, 1000+)
Storage Switchex2200t-02Juniper EX2200-24T24× GbE + 4× SFPStorage plane (VLANs 300, 400) — isolated
OOB Switchex2200p-01Juniper EX2200-24P24× GbE PoE + 4× SFPManagement plane (VLAN 100) — isolated

SRX320 config is deployment-ready in sre-iac/network/interim/srx320-01.conf. EX2200 VLAN configs are in sre-iac/network/interim/.

DeviceHostnameModelPortsRole
Firewall Primaryopnsense-01Beelink EQ12 Pro + OPNsense CE2× 2.5GbE Intel i226-VWAN edge, IDS/IPS, VPN, CARP primary, HAProxy LB
Firewall Standbyopnsense-02Beelink EQ12 Pro + OPNsense CE2× 2.5GbE Intel i226-VCARP standby — auto-failover if primary fails
10G Network Switchsx3008-01TP-Link TL-SX3008F8× 10G SFP+Network plane (server bond-net uplinks + OPNsense LAN)
10G Storage Switchsx3008-02TP-Link TL-SX3008F8× 10G SFP+Storage plane (server bond-stor + QNAP) — isolated, no WAN uplink
1G Network Switchsg3428-01TP-Link TL-SG342824× GbE + 4× SFPNetwork plane 1G access (control VMs, workstations)
1G Storage Switchsg3428-02TP-Link TL-SG342824× GbE + 4× SFPStorage plane 1G fallback — isolated
OOB Switchex2200p-01Juniper EX2200-24P24× GbE PoE + 4× SFPManagement plane (VLAN 100) — repurposed from interim
Homestead Switchsg3452-01TP-Link TL-SG345248× GbE + 4× SFPResidential network (isolated from lab)
SDN Controlleroc200-01TP-Link Omada OC2001× GbEOmada SDN — manages all TP-Link switches (PoE from OOB switch)
No TP-Link ER Router

There is no TP-Link ER-series router in this design. OPNsense on the Beelink EQ12 Pro pair handles all WAN, firewall, IDS/IPS, VPN, and inter-VLAN routing. The Omada SDN controller manages only the switch fabric. This saves ~$399 (ER7412-M2 cost) and delivers a far superior security posture. See Network Edge Security for the full OPNsense architecture.


Other Devices

DeviceHostnameIPConnectionRole
BeeLink Mini PCblnk-fed420110.10.96.32× 1G (one per EX2200)Dev workstation, scan host, management access
QNAP NAS #1qnap-0110.10.96.311G to switchNFS storage for all K8s PVs
QNAP NAS #2(unnamed)10.10.101.142offlineBad drive — treat as decommissioned
UPS #1smt15c-apc01(TBD)1G to OOBPower management
UPS #2smt15c-apc02(TBD)1G to OOBPower management
UPS #3smt15c-apc03(TBD)1G to OOBPower management
UPS #4smt15c-apc04(TBD)1G to OOBPower management

Physical Specifications — HP ProLiant ML350 Gen9

MeasurementSpecification
Form Factor2U Tower (rack-mountable)
Height174.5 mm (6.87 in)
Width442.0 mm (17.4 in)
Depth660.0 mm (26.0 in)
Weight (loaded)~35 kg (77 lbs)
Operating Temperature10°C to 35°C
Humidity8% to 90% RH (non-condensing)

Note (HP #4 only): Physical form factor does not apply — outer chassis removed. Motherboard, PSUs, and cabling are mounted in an open-air rack frame with 12 fans. Environmental tolerances remain the same per component specs.


Document Control

RevDateAuthorDescription
1.02026-02-24Marc MercerInitial release
2.02026-03-22Anshin EngineeringAdded server status labels (active/pending/planned); corrected HP #4 as open-air ML350 (not standard tower); added HP #5 planned node; added 10G NIC upgrade specs; added GPU farm VM layout; corrected router spec (ER7412-M2 not ER8411); added current state network table