Site Reliability Engineer (SRE)

Radiance Technologies · Huntsville, Alabama, United States

Posted 21 days ago

Apply in seconds with Jobply
One-click apply on Workday, Greenhouse, Lever & 50+ ATS systems
Apply with Jobply →

Skills

Incident ResponseToil ReductionReliability EvaluationsPlatform EnablementSystems ThinkingObservability FundamentalsBasic Software EngineeringLinuxNetworkingKubernetesGitOpsAutomationPythonGoPrometheusGrafana

Job description

Salary Range: $75,000 - $100,000

At Radiance our SREs own the reliability of systems they don't write - defining what "reliable enough" means from the user’s perspective, instrumenting and measuring against those targets, and building the tooling and runbooks that make failure recoverable. They partner with dev teams pushing operational quality upstream before code ships, and they lead the resolution in production when things go wrong. SREs are comfortable debugging distributed systems, resolving incidents, and translating findings into lasting reliability improvements.  Day to day responsibilities fall into four categories: Incident Response, Toil Reduction, Reliability Evaluations, Platform Enablement

Required Qualifications

  • 1+ years of experience in Operations, Sys Admin, DevOps, or Software engineering
  • Bachelor’s Degree in CS, Computer Engineering, or related technical field
  • US Citizenship & must have or be able to obtain a Top Secret Clearence
  • Systems thinking – understanding how systems fail together, blast radius, and more
  • Observability Fundamentals – not just the 3 signals, but knowing why and how to use telemetry to optimize services and engineering quality of life
  • Basic software engineering – building automation & non-trivial APIs, git workflows, effectively engaging in code reviews
  • Linux/networking fundamentals
  • Strong Communication, Collaboration, and Organizational Skills

Specialty Skills: (1 or more)

  • Platform & Infrastructure - Kubernetes, ArgoCD/GitOps, disaster recovery, capacity planning
  • Observability - OTel standards, Grafana/Perses, Tempo, Clickhouse, VictoriaMetrics
  • Automation & Toil Reduction - scripting, CI/CD, runbook automation, “DevOps”
  • Developer Enablement - instrumentation SDKs, SRE practice onboarding
  • Data & Alerting - dashboard quality, alert design, anomaly detection

Desired Qualifications

  • SRE Certifications from The DevOps Institute, AWS Solution Architect, or similar
  • Hands-on experience with: Python, Go, Kubernetes, Argo CD, GitLab/GitHub, Jenkins, Docker, Locust/Gatling, Prometheus, Grafana/Perses

Radiance Technologies is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.

Stop filling out the same form 100 times.

Install the free Jobply Chrome extension and auto-apply to Site Reliability Engineer (SRE) and 300,000+ other live jobs across Workday, Greenhouse, Lever, and 50+ other ATS systems.

Apply with Jobply — Free
✓ Free forever✓ No credit card✓ 4.8★ from 12k+ users