Posted on May 12, 2020

Site Reliability Engineer

Software Development Vancouver, Canada Full-time

We’re looking for an experienced Site Reliability Engineer to join our SRE team at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. 

We depend on SRE's ability to both architect, build, and iterate on resilient, scalable systems, and also to guide the rest of the engineering team. 

You’ll join our highly motivated team and help us launch reliable products and services with your experience and skills. You’ll play a key role in improving our in-house systems and you’ll work closely with Engineering as a whole to research and apply the latest and greatest technology to our stack. You’ll be empowered to fully apply your experience, lessons learned, and leadership skills in an environment with little tech debt, no on-prem servers, and a strong foundation based on GKE. Every day, you’ll collaborate with a world-class team in our Vancouver office.

Every one of us shares a common vision: to create the future we want to live in. We need the right people to help us realize that vision.

What we’ll accomplish together:

  • Develop effective infrastructure for our projects to deploy onto, ensuring projects are scalable, resilient, and reliable in support of growing products.
  • Iterate on processes to improve our ability to ship fast while maintaining high quality systems that we can depend on.
  • Enhance tools and automation to fill the gaps in our current systems as well as build entirely new ones as we face bigger and more complex challenges.
  • Respond to infrastructure incidents and support the larger Engineering team with their product incident response strategy. 
  • Perform postmortems and in-depth root cause analysis to ensure we are always improving.

A little about you:

  • You have experience working with orchestrations systems like Kubernetes.
  • You have experience collecting and processing metrics from tools such as Prometheus/Datadog/NewRelic, and can walk teams through setting up SLO and SLI targets.
  • You have experience building and working on deployment systems.
  • You are comfortable with responding to production incidents and can fight fires with a calm and level head, leveraging post mortems to apply lessons learned.
  • You have experience coding and developing applications. Bonus points for Go experience.
  • You have experience working with Infrastructure as Code systems like Terraform or CloudFormation.
  • You are comfortable diving into an unfamiliar system and finding your way around.
  • You have a strong ability to collaborate with cross-functional teams and build solid working relationships with everyone in the organization, from individual contributors to the CEO.
  • While you believe in processes and the power of planning, you understand that you will often have to roll with the punches and prioritize the most impactful tasks on the fly.

More about Dapper Labs:

At Dapper Labs we recruit the best and foster an environment that empowers our team. That means a workplace that is diverse, inclusive, and open-minded. We welcome applicants of all backgrounds, regardless of race, colour, religion, sexual orientation, gender identity, national origin, or disability.   

Because we care about our team, we work hard to provide perks that make their lives better by offering:- Flexible vacation & remote work policy - most team members take between 15-20 days off per year, but we have no hard limit (for our co-ops & interns, vacation is instead paid out on each pay)- Diverse opportunities for learning and development- On-site gym and fitness reimbursements- Dog friendly office!