Senior Site Reliability Engineer

Kraków, Małopolskie, Poland

Remitly is on a mission to transform the lives of immigrants and their families by providing the most trusted financial products and services on the planet. Since 2011, we have been tirelessly delivering on our promises to immigrants sending their hard earned money home. Today, we are incredibly proud to have served millions of customers globally with Remitly and our newly launched Passbook app to provide immigrants access to banking. We strive daily to meet our promise to our customers by building peace of mind into everything we do. Join over 2,200 employees across 8 offices who are growing their careers while having a positive impact on people globally.

About the Role:

Come be a founding member of Remitly’s Site Reliability Engineering team! In this role you will be working across Remitly’s engineering organization driving programs that improve reliability and availability. You’ll develop repeatable reliability patterns teams can use to solve common challenges such as rate limiting, circuit breaking, traffic management, and game days.  You will own Remitly’s observability tooling to ensure production impacting events are detected and escalated, and use that data to propose, design, and lead programs to address the availability risks that are found. 

What You’ll Do:

  • Implement a company wide strategy for Observability tooling. Define best practices, KPIs, and metrics. Work with engineering teams to implement.
  • Own the operation and design of Observability tools such as New Relic and Victoria Metrics / Prometheus  
  • Evaluate and integrate/build SRE systems in domains such as observability, rate limiting, traffic control, incident management, and problem management
  • Provide technical mentorship to Remitly software teams on managing availability and performance of their systems. 
  • Identify company wide opportunities to improve availability, and implement automation and tooling to address. 
  • Participate in an on-call rotation for production issues.
  • Provide mentorship to junior team members

Who You Are:

  • 5+ years as a Site Reliability Engineer, Software Development Engineer, or related role
  • 2+ years of experience with Kubernetes and cloud infrastructure such as AWS
  • Proficient in at least one high-level programming language such as Java, Go, Python and Typescript
  • 2+ years experience driving collaborative projects from conception to delivery using Agile/Kanban methodology
  • Experience implementing Observability tooling at scale (Prometheus, CloudWatch, New Relic, Dynatrace, etc)
  • Deep understanding and passion for SRE concepts, patterns and tooling

Our Benefits

  • Stock grant package (RSUs)
  • Unlimited paid time off
  • Hybrid or fully remote work model
  • Office in a Kraków city center (High5ive)
  • Health/Dental coverage - LUX MED VIP
  • Life Insurance
  • Multisport or Sodexo lunch card
  • Education budget
  • Equipment of your choice
  • Standing desks


We are committed to nondiscrimination across our global organization and in all of our business operations. Employment is determined based upon personal capabilities and qualifications without discrimination on the basis of race, creed, color, religion, sex, gender identification and expression, marital status, military status or status as an honorably discharge/veteran, pregnancy (including a woman's potential to get pregnant, pregnancy-related conditions, and childbearing), sexual orientation, age (40 and over), national origin, ancestry, citizenship or immigration status, physical, mental, or sensory disability (including the use of a trained dog guide or service animal), HIV/AIDS or hepatitis C status, genetic information, status as an actual or perceived victim of domestic violence, sexual assault, or stalking, or any other protected class as established by law.