DE Jobs

Search from over 2 Million Available Jobs, No Extra Steps, No Extra Forms, Just DirectEmployers

Job Information

PamTen, Inc. DevOps SRE in United States

Role: DevOps SRE Notice Period : Immediate joiner Experience: 5+ years Requirements: ● Bachelor's or Master's degree in Computer Science, Information Technology, or a related field. Advanced degrees or relevant certifications are a plus. ● Proven experience in setting up and managing Observability tools like Prometheus, Grafana, Alert Manager, and Loki. ● Strong background in designing and implementing CI/CD pipelines using Azure DevOps. ● Proficiency in Infrastructure as Code (IAC) using Terraform for managing deployments in AWS. ● Extensive knowledge of the AWS ecosystem and its services. ● Experience in incident management, PagerDuty integration, change management, and release management processes. Responsibilities As a DevOps Site Reliability Engineer, your primary responsibilities will include: ● Observability Tools Management:Demonstrate a solid understanding of setting up and managing key Observability tools such as Prometheus, Grafana, Alert Manager from Grafana, and Loki for efficient monitoring and logging. ● CI/CD Pipeline Expertise:Showcase proficiency in designing and implementing Continuous Integration and Continuous Deployment pipelines using Azure DevOps to automate software delivery processes. ● Infrastructure as Code Implementation: ● Utilize Terraform for Infrastructure as Code (IAC) to efficiently manage deployments in AWS, ensuring scalability and reliability in infrastructure setups. ● AWS Ecosystem Proficiency:Be well-versed in the AWS ecosystem, demonstrating a comprehensive understanding of AWS services and best practices for effective infrastructure deployment and management. ● Incident Management and Monitoring:Handle incident management efficiently, ensuring prompt resolution of issues, and integrate PagerDuty into the incident response process for streamlined communication and incident handling. ● Change and Release Management:Implement robust change management and release management processes to ensure smooth transitions and updates in the system while maintaining high availability and reliability.

DirectEmployers