Software Engineer - Site Reliability Engineer at Affirm

San Francisco, CA, United States

Posted on Mar 17 2015 (over 1 year ago)

Visas:

H-1B
TN1
E3

ABOUT US

At Affirm we believe the financial industry is fundamentally broken. Not only is the core infrastructure built with technology from the 1970s, but there are a dwindling number of people who say "I trust my bank to look out for me". It doesn’t have to be this way, and it’s our mission to fix this problem. We are using technology to re-imagine and re-build core parts of financial infrastructure to enable cheaper, friendlier and more transparent financial products and services that improve lives. We are based in San Francisco; founded by Max Levchin (founding CTO PayPal), Jeff Kaditz (CDO DeNA/ngmoco), and Nathan Gettings (founding CTO of Palantir); and looking for exceptionally talented and passionate people to join us on our mission.

This role combines software development, networking, and systems administration expertise to help design, build, and run our proprietary distributed, fault-tolerant, financial platform and infrastructure, which is central to our long term company mission. The primary goals of SRE are reliability, scalability, monitoring, and performance of the entire system. We are looking for technology professionals who love being in the center of the action, and are able to routinely address complex software and systems issues ranging from distributed change propagation on live serving systems, to designing and deploying complex multi-stage data pipelines. We hire people from both systems and software backgrounds. Strong candidates will have experience with both.

RESPONSIBILITIES

  • Design, build, and deliver software that will enhance the scalability, availability, and efficiency of the Affirm platform and products.
  • Work proactively across the company to ensure the Affirm infrastructure is never a constraint for the engineering team or any aspect of the company.
  • Automate as much of your work as possible, using a language of your choice.
  • Design & review design, architecture, and methods for operating services and systems.
  • Participate in software and system performance analysis and tuning, service capacity planning and demand forecasting.
  • Maintain 99.99% service uptime.
  • Participation in a 24x7 on-call rotation.

REQUIRED

  • Experience in one or more of: Python, C, C++, Go, or an equivalent language.
  • Experience with system monitoring & alerting for availability and performance
  • Experience working with Unix/Linux systems.
  • Understanding of network protocols (TCP/IP, UDP, ICMP, etc), MAC addresses, IP packets, DNS, OSI layers, and load balancing.
  • Excellent analytical skills, coupled with a strong sense of ownership, urgency and drive.
  • Experience with configuration management software like Salt, Chef, Puppet, etc.

BONUS

  • BS/MS in Computer Science or 3+ years of equivalent technical experience in a high-volume or critical production service environment
  • Experience with AWS or other PaaS frameworks.
  • Experience with Docker.
  • SQL and/or MySQL experience.
  • Proven technical troubleshooting and performance tuning experience.

Questions?

Do you have any question or comment for Affirm about their position Software Engineer - Site Reliability Engineer?

You

Please log in to ask a question

Get noticed by being the first to ask Affirm a question.
No question right now? Subscribe to this job post to be notified when other applicants ask something.