Back to jobs

DevOps / SRE Technical Lead

Dublin, Ireland
Full Time Senior Development & Product Management

Sustainability that means business

 

Who we are:

Sustainability software specialist, AMCS, is headquartered in Ireland, with offices in Europe, the USA, and Australasia. With over 1,300 highly-skilled employees across 22 countries, we specialize in delivering technology solutions to facilitate a carbon neutral future.

 

What we do:

Our innovative SaaS solutions increase efficiency and boost sustainability in resource-intensive industries. Over 5,000 customers across 23 countries already benefit from our Performance Sustainability software, ensuring we deliver practical solutions for improved profitability and environmental resilience across the globe.


Our people

AMCS offers team members more than just a job, but an opportunity to map out a career with a company that is growing, evolving and setting out new ways of working that are having a positive impact on the world around us. AMCS was established in Ireland and holds onto those local roots and ‘start-up’ mentality with a culture of connection. Connection to our work, our customers, our colleagues and our community that creates a working environment that fosters openness, collaboration and creativity.


Job Description:

We are seeking a highly skilled and motivated DevOps/SRE Tech Lead to join our dynamic engineering team. The ideal candidate will have a deep understanding of cloud technologies, a strong technical background and a passion for driving operational excellence. As a Tech Lead, you will not only mentor and guide our DevOps engineers but also participate in architectural and key decision-making forums regarding our infrastructure and application development processes ensuring a focus is always on the reliability of our systems and centred on positive customer experience. You will collaborate with cross-functional teams to ensure the reliability, scalability, and security of our systems and infrastructure.

Key Responsibilities:

  • Cloud Technologies Expertise: Possess a deep understanding of cloud platforms (e.g., Azure, AWS, GCP) to design, implement, and manage cloud infrastructure.

  • Architectural Oversight: Participate in architectural design and decision-making processes, ensuring that design choices align with organizational goals and best practices.

  • Technical Issue Leadership: Lead the investigation and resolution of complex infrastructure issues within our cloud environment, acting as a liaison between product development and operations. Foster a culture of ownership among development teams while guiding them in identifying and addressing performance or reliability concerns. Ensure that issues are transparently surfaced and discussed, promoting continuous improvement and learning across teams.

  • Containerization Technologies: Utilize containerization technologies such as Docker and Kubernetes to facilitate application deployment and scaling.

  • Team Collaboration on Service Metrics: Work closely with development, QA, and business teams to define Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) that align with business goals and customer expectations.

  • Monitoring and Logging: Implement and manage monitoring and logging tools (e.g., Prometheus, Grafana, Mimir, Loki, Tempo & OpenTelemetry) to ensure system health and performance.

  • Automation Tools: Use automation frameworks (e.g. Ansible, Terraform) to streamline configuration management and deployment processes.

  • Mentorship and Guidance: Mentor and guide DevOps engineers and other team members, fostering a culture of learning and professional development.

  • Cross-Functional Collaboration: Collaborate effectively with development, QA, and operations teams to ensure seamless software releases and improve operational workflows.

  • Effective Communication: Communicate technical information clearly and effectively to both technical and non-technical audiences.

  • System Reliability and Security: Ensure the reliability, scalability, and security of systems and infrastructure by implementing best practices and industry standards.

  • Performance Optimization: Identify and address performance bottlenecks, providing solutions to enhance system efficiency.

  • Monitoring and Alerting Solutions: Implement robust monitoring and alerting solutions to proactively detect and address system anomalies.

Qualifications:

  • Education: Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).

  • Experience: 5+ years of experience in DevOps, Site Reliability Engineering (SRE), or related fields, with at least 2 years in a leadership or mentoring role.

  • Cloud Technologies: Deep understanding of cloud providers (Azure, AWS, GCP) and hands-on experience with cloud architecture.

  • Architectural Design: Proven experience in providing architectural oversight, with a strong ability to make informed decisions that drive system performance and scalability.

  • Containerization: Proven experience with container orchestration platforms, particularly Kubernetes.

  • Scripting: Proficiency in scripting languages such as PowerShell and Bash.

  • Monitoring and Logging: Familiarity with monitoring and logging tools like Prometheus, Grafana, and the Grafana stack.

  • Automation Tools: Experience with automation tools such as Ansible, Terraform, or Chef.

  • Soft Skills: Strong leadership qualities, excellent communication skills, and a collaborative mindset.

Preferred Qualifications:

  • Experience with CI/CD pipelines and relevant tools (Jenkins, GitLab CI, CircleCI, etc.).

  • Kubernetes certification (CKA, CKAD) and/or cloud certifications (Azure, AWS, GCP) are highly desirable.

  • Knowledge of security best practices and compliance standards in cloud environments.

  • Familiarity with Agile methodologies and project management tools.



Please complete all required fields
Please complete all required fields