Senior Cloud Infrastructure Engineer
Praha, CZ
Alpiq IT General context:
The Cloud Infrastructure & Site Reliability team at Alpiq Group IT provides various internal services, including 24x7 support of business-critical systems, database services (e.g., fine-tuning, SQL query optimization), the operation and management of applications, and the integration of new cloud applications/services. The team is also responsible for the operation and continuous improvement of Amazon Web Services (AWS) environments provided by the Alpiq AWS Framework.
We are looking for a Senior Cloud Infrastructure Engineer based in Prague who is eager to take responsibility, learn, and continuously improve services.
About the position:
As a Senior Cloud Infrastructure Engineer, you will be part of the infrastructure team that maintains and develops enhancements for an AWS multi-account organization framework with over 140 accounts and projects. You will work closely with architects to enforce security requirements, cost best practices, and automate operational tasks to maintain the evolving framework. Additionally, you will be responsible for managing networking components like AWS VPCs, Transit Gateway (TGW), and VPN configurations as part of the overall infrastructure.
Your responsibilities:
- Support ramp-up of AWS usage with the implementation of automations to reduce manual operational effort
- Conduct improvements in operational practices, tools, and processes
- Day-to-day operational tasks (monitoring performance, availability, and utilization, planning, and executing changes)
- Handle incidents (analysis, provide workarounds) in cooperation with service providers
- 24x7 on-call duty on a rotation basis
- Drive root cause analysis (RCA) and problem management
- Manage and optimize application services
- Deploy and manage AWS networking infrastructure, including VPCs, Transit Gateway (TGW), VPNs, and associated security configurations
Your qualification:
- Advanced knowledge of AWS services and best practices, including AWS Organizations, AWS Lambda, AWS VPC, Transit Gateway (TGW), VPN, and others (Solutions Architect certification preferred)
- Advanced Terraform coding (critical)
- Experience with migrating applications from on-premises to cloud (important)
- Scripting skills in at least 1-2 languages: Python, Bash, PowerShell (very important)
- Familiarity with GIT and CI/CD best practices
- Experience with networking (DNS, certificates, routing, and VPNs) is preferred
- SQL and basic database knowledge is a plus
- Experience with Datadog or similar log monitoring tools is a plus
- Web app experience (front-end + back-end) is a plus
- Familiarity with serverless architecture is a plus