Platform Engineer Role
In this role you will be responsible for designing, implementing, deployment and maintenance of Ezra Platforms which encompasses the containerized platforms and the micro-services. You will play a crucial role in managing the infrastructure and driving the DevOps practices.
Key Responsibilities
Architecture and Design
Design and implement scalable, resilient, and secure platform solutions
Develop and maintain infrastructure-as-code using tools like Terraform, Cloud-Formation and Ansible
Architect cloud-native solutions leveraging containerization and microservices
Implement disaster recovery and business continuity strategies as per requirement.
Infrastructure Management
Manage and optimize our Public cloud infrastructure (AWS, Alibaba, or GCP)
Manage and optimize private cloud infrastructure in partner premises.
Implement best practices for cloud security, compliance, and cost optimization
Design and implement multi-region and multi-cloud strategies
Design and maintain containerized application environments using Docker
Architect, deploy, and manage Kubernetes clusters for container orchestration
Automation and DevOps
Develop automation scripts and tools to streamline operations and reduce manual tasks
Ensure Standardized QA and Production environments
Configure and manage load balancers (e.g., NGINX, HAProxy, cloud-native solutions)
Implement and manage service mesh technologies (e.g., Istio, Linkerd, Traefik) for microservices architectures.
Performance Optimization
Analyze and optimize system performance, identifying and resolving bottlenecks
Conduct capacity planning and implement auto-scaling solutions
Optimize container resource allocation and performance
Security and Compliance
Implement and maintain security best practices across our platform
Develop strategies for container security, including image scanning and runtime protection
Ensure compliance with industry standards and regulations (e.g., SOC 2, GDPR, ISO 27001:2013)
Assist the infosec team in conducting regular security audits and vulnerability assessments
TLS certificate management in new market deployments
Documentation and Knowledge Sharing
Maintain comprehensive technical documentation for platform components and processes
Contribute to internal knowledge bases and conduct knowledge-sharing sessions
L2 Support and Escalation Management
Serve as the secondary escalation point for complex technical issues from the operations team
Provide expert-level troubleshooting and resolution for critical platform and infrastructure problems
Collaborate with the operations team to improve support processes and knowledge transfer
Required Qualifications
Bachelor's degree in Computer Science, Engineering, or a related field
4+ years of experience in platform engineering, DevOps, or similar roles
Strong proficiency in at least one cloud platform (AWS, Azure, or GCP)
Expert-level knowledge of containerization technologies (Docker, Kubernetes)
Extensive experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation, pulumi)
Proficiency in scripting languages (e.g.Bash, )
Strong understanding of networking concepts, load balancing, and CDNs
Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack)
Excellent problem-solving skills and ability to troubleshoot complex systems
Experience with multi-cloud architectures
Knowledge of service mesh technologies (e.g., Istio, Linkerd)
Familiarity with serverless computing platforms
Relevant certifications (e.g., AWS Certified Solutions Architect, CKAD, CKA)