Staff Site Reliability Engineer Australia

vor 2 Wochen

Copper Coast Council, Österreich Aerospike, Inc. Vollzeit

Aerospike is thereal-time databaseformission-critical use cases and workloads, includingmachine learning, generative, and agentic AI.Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases. Global leaders, includingAdobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair,rely on Aerospike forcustomer 360, fraud detection, real-time bidding,profile stores, recommendation engines,and other use cases. At Aerospike, we dream big and deliver even bigger. Our mission is to unleash the power of the world’s real-time data with a database built for infinite scale, speed, and sustainability . If you're ready to shape the future of data, join us. Staff Site Reliability Engineer As a Staff Site Reliability Engineer at Aerospike, you’ll be a technical leader within our global SRE organization, helping drive reliability, performance, and scalability across our hybrid and multi-cloud environments. You’ll bring deep operational experience and lead by example—mentoring others, designing resilient systems, and championing modern SRE practices across new and legacy platforms. You’ll play a key role in shaping the direction of our infrastructure initiatives, from Kubernetes-based platforms like AKS and the Aerospike Kubernetes Operator to existing services in AWS and GCP. Your impact will span teams and systems as you solve complex problems, influence architecture, and foster a culture of ownership, resilience, and continuous improvement. Key Responsibilities Provide technical leadership across multiple systems and environments, proactively identifying risks, shaping architecture decisions, and improving reliability and performance at scale. Lead key infrastructure efforts including Kubernetes platform expansion (AKS, AKO), and application of SRE principles to legacy systems and new cloud offerings. Define, measure, and enforce reliability standards through SLIs/SLOs, observability tooling, and incident response frameworks. Mentor and guide other SREs by leading design sessions, conducting technical deep dives, and reviewing code, configurations, and infrastructure decisions. Partner with product, engineering, and cloud teams to align reliability goals with delivery objectives. Lead root cause analyses and implement systemic fixes for issues spanning multiple platforms or services. Drive automation-first approaches using IaC, CI/CD pipelines, and scripting to reduce toil and increase deployment confidence. Influence cross-functional roadmaps, identifying areas for innovation, technical debt reduction, and long-term scalability. Participate in the global on-call rotation, bringing senior-level calm and clarity during incidents and escalations. Required Experience 8+ years of experience in SRE, DevOps, or infrastructure engineering, including significant time operating production systems at scale. Deep hands-on experience with at least one major public cloud (AWS, GCP, Azure), and working knowledge of the others; Azure experience is a plus. Production experience with Kubernetes, including operating clusters, Helm, operators, and supporting microservices in real-world environments. Strong proficiency in infrastructure-as-code tools such as Terraform and CI/CD automation platforms. Expertise in observability tools and practices (Datadog, Prometheus, Grafana, ELK, etc.) and using them to define SLIs and SLOs.; DataDog experience is a plus Programming and scripting ability in one or more languages (Python, Go, Bash, etc.). Experience with large-scale incident response and post-incident review practices. Proven ability to mentor other engineers and influence technical strategy across multiple teams. Strong communication skills to articulate complex concepts to technical and non-technical stakeholders. Preferred Skills and Qualifications Hands-on experience managing and optimizing database deployments and services in production environments, ensuring high availability and performance. Familiarity with Aerospike or other distributed databases is a plus. Kubernetes or cloud certifications (CKA, CKS, AWS/GCP DevOps/Architect) a plus but not require Track record of influencing architectural decisions across teams or domains. Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law. Create a Job Alert Interested in building your career at Aerospike? Get future opportunities sent straight to your email. Apply for this job * indicates a required field First Name * Last Name * Email * Phone Resume/CV Enter manually Accepted file types: pdf, doc, docx, txt, rtf Enter manually Accepted file types: pdf, doc, docx, txt, rtf #J-18808-Ljbffr

Staff Site Reliability Engineer Australia

vor 1 Woche

Copper Coast Council, Österreich Aerospike, Inc. Vollzeit

Aerospike is thereal-time databaseformission-critical use cases and workloads, includingmachine learning, generative, and agentic AI.Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases. Global leaders, includingAdobe, Airtel, Barclays, Criteo, DBS
Senior Site Reliability Engineer

vor 4 Wochen

Council of the City of Sydney, Österreich Atlassian Vollzeit

Join to apply for the Senior Site Reliability Engineer role at Atlassian Join to apply for the Senior Site Reliability Engineer role at Atlassian Working at AtlassianAtlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and...
Site Reliability Engineer

vor 2 Wochen

Council of the City of Sydney, Österreich Luminance Vollzeit

Luminance – Site Reliability Engineer Location: Millers Point, New South Wales, Australia. Our Site Reliability team tackles complex infrastructure challenges, ensuring 24/7 high‑availability for our unique software applications. The team thrives on automation, scalability, security, and continuous improvement, working closely with development and...
Site Reliability Engineer

vor 1 Woche

Council of the City of Sydney, Österreich N2S.Global Vollzeit

Delivery Lead - Recruitment at Net2Source Inc. Sydney, New South Wales, Australia A$150,000.00-A$170,000.00 We are looking for a Site Reliability Engineer (SRE) to join our team and ensure the reliability, scalability, and performance of our software systems. This role bridges the gap between software development and IT operations, focusing on automation,...
Site Reliability Engineer – Elastic

vor 4 Wochen

Council of the City of Sydney, Österreich Ethos BeathChapman Vollzeit

Site Reliability Engineer – Elastic & Linux This role is provided by Ethos Beath Chapman. Your actual pay will be based on your skills and experience—talk with your recruiter to learn more. Base pay range A$150,000.00/yr - A$160,000.00/yr Must be an Australian Citizen or Permanent Resident. Title : Observability Engineer / Site Reliability Engineer –...
Lead Site Reliability Engineer: Drive Global Reliability

Vor 7 Tagen

Council of the City of Sydney, Österreich Teg Pty Ltd Vollzeit

A leading entertainment technology company in Australia is looking for a Lead Site Reliability Engineer to oversee a team focused on the performance and reliability of their global ticketing platforms. The ideal candidate will have a strong background in AWS system design and automation, and possess excellent communication and leadership skills....
Site Reliability Engineer

vor 3 Wochen

Council of the City of Sydney, Österreich CareCone Group Vollzeit

Site Reliability Engineer - Cloud Infrastructure Proficiency in software development and coding. Experience with systems administration, cloud infrastructure, and networks. Strong problem-solving and analytical skills. Familiarity with monitoring, automation, and CI/CD tools. Interested candidate can share share their resume at or call me on Seniority level...
DevOps Engineer

vor 4 Wochen

Council of the City of Sydney, Österreich Freelancer.com Vollzeit

Overview Join to apply for the DevOps Engineer / Site Reliability Engineer role at Freelancer.com . This role sits in the Systems Engineering team, partnering with software engineers to design and deliver mission-critical services and systems. You will work with infrastructure and services at scale across the Freelancer.com marketplace, deployed in Amazon...
Site Reliability Engineer

vor 3 Wochen

Council of the City of Sydney, Österreich BAH Partners Vollzeit

Quantitative Headhunter @ BAH Partners | Tech Recruitment Specialist A leading high frequency proprietary trading firm is looking for a Trading Systems Reliability Engineer to join its core engineering team in Sydney . This is a high-impact, business-critical role with strong front-office alignment, working directly on the systems that underpin global...
Site Reliability Engineer

vor 4 Wochen

Council of the City of Sydney, Österreich CareCone Group Vollzeit

Site Reliability Engineer Location: Sydney, NSW Employment Type: Permanent Must Have Full working rights. No sponsorship available. Observability This includes Designing and implementing SLIs/SLOs aligned to key customer journeys. Strong knowledge of observability concepts: logs, metrics, traces, SLIs/SLO. Integrating observability tools like Dynatrace,...

Amerika

Europa

Asien / Ozeanien

Afrika

Staff Site Reliability Engineer Australia