102608 - SRE
United States
Full Time
Senior Executive
102608 - SRE
Summary
The Purple Platform Engineer – SRE is a hybrid engineering role combining Site
Reliability Engineering excellence, cloudnative software engineering expertise,
and deep knowledge of our internal Purple Platform, HealthEquity’s cloudnative
application delivery ecosystem.
Responsibilities
- You will design, build, and operate highly reliable systems while enabling product teams to selfserve, deploy, and operate applications securely and efficiently—aligned with the platform’s core tenets: GitOps integration, and cloudnative operational excellence.
- This role requires an engineer who thrives in modern DevOps environments, understands distributed systems deeply, writes high-quality code, and can translate platform guardrails and policies into a world class developer experience.
Required Expert Skills & Technical Experience
CloudNative Ecosystem
- Strong Kubernetes expertise (workloads, scaling, networking, operators, CRDs).
- Advanced containerization practices (Docker multi-stage, security hardening).
- Hands On Experience implementing service mesh (ISTIO) and API gateways
- (Kong).
- Infrastructure-as-Code (Terraform).
- Understanding and ability to configure and troubleshoot MongoDB collections,
- Redis Cache, Azure Service Bus, Azure Document Storage etc.
Software Engineering Core
- Strong background in C#, Python and/or Node.js.
- Ability to build highly reliable distributed applications and automation tools.
- Building CI/CD pipelines.
- Experience with AI Assisted development to improve quality and productivity
GitOps & Platform Delivery
- Deep understanding of declarative deployment workflows (Argo CD, Flux).
- Expertise in Helm, Kustomize, deployment manifests, and environment modeling.
- Experience integrating automated tests, scans, and policy controls into Git workflows—supporting the platform’s “shift-left feedback and shift-right enforcement” model.
Observability & Monitoring
- Strong experience with configuring and using Dynatrace for observability, setting up OpenTelemetry integrations, App Insights
- Competence using Kusto (KQL), analyzing logs, distributed traces, and performance metrics.
- Incident response leadership, postmortem writing, error budget management.
Security & Governance
- Familiarity with container scanning, supply chain security, SBOM tools.
- Experience applying and troubleshooting policies for security and using secure
- secret management (Vault/KMS).
- Configuring and implementing Managed Identities for secure authentication
- Understanding of compliance frameworks relevant to healthcare systems.
Developer Tooling & Automation
- Building internal tools, CLIs, templates, plug-ins that improve velocity.
- Knowledge of Backstage or internal developer portals is a plus.
- Strong scripting skills (Bash, PowerShell, Python, Go utilities).
Preferred Qualifications
- 3+ Years Experience in large-scale, enterprise-grade cloud native platforms.
- Previous work in SRE, Platform Engineering, DevOps, or Production Engineering roles.
- Experience with self-service portals and cloud resource orchestration.
- Familiarity with classification-driven policy models and governance automation.
Apply for this position
Required*