Key Responsibilities:
Cloud Infrastructure Management: Design, deploy, and manage cloud infrastructure Azure, Google Cloud to support applications and services.
OS infra troubleshooting and Issue Resolution: Setting up and configuring operating systems (Linux, Windows, etc.). Diagnosing and fixing issues related to OS, network, and system performance. This includes log file analysis and root cause analysis.
Automation & CI/CD Pipelines: Develop and manage automated CI/CD pipelines for application deployment and infrastructure provisioning using tools like Azure DevOps, GitLab CI, or Github Actions.
Containerization & Orchestration: Manage containerized applications using Docker, Kubernetes, and other container orchestration tools to ensure scalability and reliability.
Monitoring & Performance Optimization: Implement monitoring, alerting, and logging solutions (e.g., Splunk, Prometheus, Grafana, ELK stack) to ensure optimal system performance and availability.
Collaboration: Work closely with Client, Developer team, and Operations teams to understand requirements and provide cloud-based solutions that meet performance, security, and compliance needs.
Security & Compliance: Knowledge of security best practices in cloud infrastructure, ensuring compliance with industry standards.
Disaster Recovery & Backup: Knowledge of disaster recovery and backup strategies to ensure business continuity in the event of outages or failures.
Troubleshooting & Support: Provide technical support for cloud-based applications and infrastructure, troubleshooting issues and implementing solutions to prevent future occurrences.
Documentation & Reporting: Document infrastructure configurations, processes, and procedures to ensure best practices and knowledge sharing across teams application.