Skip to main content

Senior Site Reliability Engineer

Department:

GeoPhy - Engineering

We are Walker & Dunlop. We are one of the largest providers of capital to the commercial real estate industry, enabling real estate owners and operators to bring their visions of communities — where people live, work, shop, and play — to life. We are committed to creating meaningful social, environmental, and economic change in our communities. We believe seeking diverse talent and promoting the inclusion of all perspectives are more than moral imperatives – they are critical to our success and ability to innovate and grow.

Department Overview  

WDTech Engineering builds solutions that impact not only our products but also the people and processes across our organization. A commitment to innovation and a passion for disrupting the old-fashioned real estate industry are our highest priorities.

The Impact You Will Have  

We are looking for an experienced DevOps/Site Reliability Engineer to help transform our infrastructure and operational practices. You will be responsible for the design, automation, and operation of Geophy’s Cloud Platform. You will build secure, scalable infrastructure and work alongside our development organization to enable better, faster, and more reliable deployments. You will join the stellar Geophy SRE team, where you will be able to do all of the above while having a lot of fun with colleagues from different parts of the world and have room to learn and grow as a world class Site Reliability Engineer.

Some of the deliverables we are working on:

  • Setting up and deploying AWS infrastructure based on Landing Zone recommendations (including AWS SSO) via Terraform
  • Setting up and deploying Kubernetes clusters using Helm charts through Terraform
  • Configuring applications in Kubernetes using the horizontal pod autoscaler and external metrics
  • Helping software engineers to develop and deploy CICD pipelines
  • Designing and deploying immutable infrastructure
  • Using metrics to scale infrastructure and reduce cloud spending
  • Primary Responsibilities

  • Design and build secure, scalable, and efficient infrastructure using Infrastructure as Code and CICD tools.
  • Collaborate with software and data science engineers to help them understand cloud best practices and utilize our infrastructure effectively.
  • Evaluate new and emerging technologies to determine which ones can help solve problems and then prototype them.
  • Integrate performance and cost monitoring into all our systems.
  • Look for opportunities to improve our AWS and Kubernetes infrastructure.
  • Automate all the things!
  • Mentor junior team members and help them become more productive.
  • Contribute to the SRE/DevOps team rituals, best practices, knowledge sharing, and cross-functional initiatives.
  • Be a team player and constantly coordinate with other disciplines to deliver excellent products and stellar customer service
  • Infrastructure as Code (IaC):

  • Lead the development and maintenance of Infrastructure as Code (IaC) using GitOps practices and tools such as Terraform, FluxCD, Gitlab, Atlantis, and Helm.
  • Architect and implement scalable and automated solutions for infrastructure provisioning and management.
  • Containerization and Orchestration:

  • Expertise in containerization technologies (Docker) and container orchestration platforms (Kubernetes).
  • Design and manage containerized applications for scalability, security, and efficiency.
  • Continuous Integration/Continuous Deployment (CI/CD):

  • Work with teams to design, implement, and maintain robust CI/CD pipelines for automating the deployment of applications across various environments.
  • Ensure the integration of automated testing and code quality checks in CI/CD workflows.
  • Observability:

  • Design, implement, and maintain robust observability solutions using Prometheus and Grafana stacks to ensure the health and performance of applications and infrastructure.
  • Proactively identify and resolve issues to maintain high availability.
  • Education and Experience  

  • 5 plus years’ experience in AWS (Esp. Strong IAM knowledge).
  • 3 plus years’ experience in Kubernetes.
  • 3 plus years’ experience in Terraform.
  • Knowledge, Skills, and Abilities  

  • Solid understanding of information security concepts and concerns.
  • Experience with monitoring and tracing (e.g. Prometheus, Grafana, etc.)
  • Amazing communicator who can convey the importance of their work to both peers and non-technical individuals.
  • Demonstrable experience of influencing and driving Engineering strategy.
  • Excellent communication skills in verbal and written English.
  • Ability to show ownership of your work, take on challenges and acknowledge growth 
    opportunities, and demonstrate patience when learning new processes 
  • Courtesy, respect, and thoughtfulness in teaming with colleagues and other stakeholders
  • Bonus Points for:

  • Hands-on experience with large database systems such as Amazon Redshift.
  • Experience with Information Security Certifications such as ISO27001 or SOC2.
  • Experience with networking and mesh components such as Istio, Linkerd, ingress, Load Balancers, etc.
  • Knowledge of programming languages such as Python and Go.
  • #LI-JC1

    #LI-Remote

    What We Offer

  • You will have the opportunity to accelerate our rapidly growing organization. 
  • We’re a lean team, so your impact will be felt immediately. 
  • Agile working environment with flexible working hours and location, career advancement, and competitive compensation package. 
  • We are a family friendly company. 
  • We arrange social activities to help our employees and families become familiar with each other and our culture. 
  • Diverse, unique colleagues from every corner of the world.
  • EEO Statement

    We are committed to equity in all steps of the recruitment and employment experience. We believe in equal access to opportunities in our workplace. We do not tolerate discrimination, including harassment, based on any characteristic protected by applicable law, such as race, color, national origin, religion, gender identity, sexual orientation, sex, age, disability, veteran or military status, and genetic information. We strive to be a safe place to ask questions, build professional relationships, and develop careers.

    SPAM
    Please be wary of recruitment scams.

    Senior Site Reliability Engineer

    Bedrijf:
    Walker & Dunlop
    Gemeente:
    Zwolle
    Contracttype: 
    Vast contract, Voltijds
    Categorieën: 
    DevOps Engineer
    Opleidingsniveau: 
    Master
    Carriereniveau: 
    Senior
    Gepubliceerd:
    15.02.2024
    Deel nu: