Site Reliability Engineer Job at Ursus, Menlo Park, CA

aGMyc2ZzZ2VBSWdIdEhPcmE2VGkwdWVEbnc9PQ==
  • Ursus
  • Menlo Park, CA

Job Description

JOB TITLE: Site Reliability Engineer LOCATION: Pleasanton, CA DURATION: 4-6 week contract to hire RATE RANGE: $90+ per hour

POSITION SUMMARY:

The Senior Site Reliability Engineer (SRE) plays a vital role in ensuring the reliability, scalability, and performance of our enterprise software platform. This is a senior-level position that requires deep technical expertise, strong problem-solving skills, and the ability to collaborate effectively in a fast-paced, demanding environment. Our customers, the largest enterprises in the world, expect 24/7 platform availability and top-tier performance. The ideal candidate has strong expertise in AWS cloud technologies , a deep understanding of serverless architectures (AWS Lambda), and a passion for building resilient systems to enhance the customer experience.

RESPONSIBILITIES:

Platform Reliability: Design, implement, and manage highly available and scalable systems to meet customer expectations for 24/7 uptime. Monitor, troubleshoot, and resolve platform incidents using tools such as Sentry, New Relic, and custom monitoring frameworks. Lead post-incident reviews to ensure root cause analysis and preventative measures are in place. Automation and Optimization: Develop and maintain automation for infrastructure management, monitoring, and incident response. Optimize platform performance and scalability, proactively identifying and addressing bottlenecks. Contribute to the development of CI/CD pipelines to improve deployment reliability and speed. Collaboration: Partner with L2 engineers to resolve complex customer issues, providing guidance and technical expertise as needed. Work closely with product engineering to ensure platform improvements align with customer needs. Actively contribute to the documentation and sharing of best practices to improve team performance and customer outcomes. Leadership: Mentor junior engineers and provide technical leadership in reliability engineering. Drive cross-functional initiatives to improve platform stability and customer satisfaction.

QUALIFICATIONS:

Bachelor's degree in Computer Science or related discipline. 8+ years in a Site Reliability Engineering or DevOps role, with experience supporting enterprise-grade software platforms. 3+ years of experience in cloud services, in particular AWS. Experience building observability systems on New Relic, Cloudwatch or similar. Experience implementing rate-limiting, API gateways, and load balancing for highly available systems. Exposure to security best practices and compliance frameworks (e.g., SOC2, ISO27001). Proficient in infrastructure as code (IaC) using tools such as Terraform or CloudFormation. Hands-on experience with scripting and programming languages like Python, Go, or Bash. Strong troubleshooting and debugging skills. Excellent communication and collaboration skills. Experience with incident management and post-mortem practices. Soft Skills: Exceptional problem-solving and critical thinking abilities. Strong verbal and written communication skills, with the ability to navigate ambiguity and provide clarity. Ability to work collaboratively in cross-functional teams under pressure. #J-18808-Ljbffr Ursus

Job Tags

Hourly pay, Contract work,

Similar Jobs

DriveLine Solutions

1786 Class A CDL Solo Truck Driver - No Experience OK Job at DriveLine Solutions

 ...~ Weekly Pay via Direct Deposit ~ Great Benefits Requirements Must be at least 21 Years of Age No Experience Required (Must have Class A CDL) Must be ok with working weekends Must be ok with Winter Driving & Large Cities Must be ok with Day & Night... 

Gama Aviation LLC

Junior DevOps Engineer Job at Gama Aviation LLC

 ...Farnborough HQ (Office work preferred with Hybrid 3 days a week in office for the right candidate)/ Type: PermanentAre you a DevOps Engineer with Azure Experience who wants to become an AKS guru?myairops offers a unique and exciting opportunity to the correct candidate... 

Novant Health - Agency - Reddot

Registered Nurse, RN- Intermediate/Stepdown Job at Novant Health - Agency - Reddot

 ...and the American Association of People with Disabilities One of the Best Places for Diverse & Women Managers to Work by Diversity MBA Magazine Top ratings in patient safety from The Leapfrog Group Quality and safety recognition from CMS Novant Health is committed... 

Diverse Lynx

Game Tester Job at Diverse Lynx

 ...Title: Game Tester Location: Austin, TX Job Description: Job Background: Test games across multiple product lines. Work with QA leads to understand test areas and design test plan. Execute tests across multiple games from various development studios and across... 

Zenex Staffing Solutions Pvt Ltd.

Registered Nurse - PCU Job at Zenex Staffing Solutions Pvt Ltd.

 ...Ratios: 1:4-5 (could fluctuate based on unit needs/surges); Avg. Daily Census: 16 **Any of our units have the potential to receive COVID-19 patients and every team member is expected to take care of COVID-19 patients as needed. Unit will accept contracts for 13 Weeks! *...