Company Logo

Software Engineer

Netflix - 1d ago

Company Logo

Senior Software Engineer

Reddit - 4d ago

Reliability Engineer

OpenAI - San Francisco, United States

Requirements:

  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent work experience)
  • Proven experience as a reliability engineer or similar role in a fast-paced, rapidly scaling company
  • Strong proficiency in cloud infrastructure
  • Proficiency in programming/scripting languages
  • Experience with containerization technologies and container orchestration platforms like Kubernetes
  • Knowledge of Infrastructure as Code tools such as Terraform or CloudFormation
  • Excellent problem-solving and troubleshooting skills
  • Strong communication and collaboration skills
  • Experience with observability tools such as DataDog, Prometheus, Grafana, Splunk, and ELK stack
  • Experience with microservices architecture and service mesh technologies
  • Knowledge of security best practices in cloud environments

Nice to Haves:

  • Enjoy seeking out and addressing bottlenecks and areas for performance improvement in systems
  • Utilize Infrastructure as Code (IaC) principles to automate infrastructure provisioning and configuration management
  • Experienced in collaborating with cross-functional teams for reliability and scalability
  • Track record of accelerating engineering reliability through tooling
  • Create a diverse, equitable, and inclusive culture
  • Humble attitude, willingness to help colleagues, and commitment to team success
  • Ownership of problems end-to-end and willingness to learn for success

What You'll Be Doing:

  • Design and implement solutions for infrastructure scalability
  • Collaborate with development teams to enhance system reliability
  • Implement and manage monitoring systems to identify issues proactively
  • Develop and maintain service level objectives and indicators
  • Implement fault-tolerant design patterns
  • Build automation tools for system reliability
  • Work with cross-functional teams to bring new features and research capabilities
  • Participate in on-call rotation for critical incident response

Perks and Benefits:

  • Exclusive San Francisco HQ location with relocation assistance
Experience: Senior
Posted: March 25, 2024

Get notified about new job opportunities

Subscribe