Key Responsibilities
-
Manage and maintain large-scale Linux server infrastructure (physical, virtual, and cloud).
-
Plan and execute OS upgrades, patching, and kernel updates with minimal downtime.
-
Perform capacity planning, performance tuning, and system optimization.
-
Design and implement automated scripts and Ansible playbooks for configuration management and deployments.
-
Configure and secure services such as Apache, Nginx, MySQL/PostgreSQL, Tomcat, JBoss, WebLogic, FTP, NFS, and Samba.
-
Troubleshoot complex system, network, and application issues across multiple environments.
-
Set up and manage firewall rules, SELinux, auditing, and system hardening measures.
-
Monitor and manage logs, system health, and alerts using tools like Zabbix, Prometheus, Grafana, or Nagios.
-
Manage centralized backup systems, disaster recovery plans, and storage (LVM, RAID, SAN/NAS).
-
Participate in incident, change, and problem management processes.
-
Support DevOps processes and CI/CD pipeline integrations.
Required Skills
-
Expertise in Linux (RHEL/CentOS/Ubuntu) system administration and troubleshooting.\
-
Strong scripting experience in Bash, Python, or Perl.
-
Proficiency in server hardening, auditing, and securing Linux systems.
-
Hands-on experience with virtualization technologies (KVM, VMware, Proxmox).
-
Basic knowledge of database administration (MySQL, MariaDB, PostgreSQL).
-
Familiarity with container technologies (Docker) and basic Kubernetes.
-
Solid understanding of networking fundamentals: TCP/IP, DNS, DHCP, LDAP, VPN.
-
Experience with Git, Jenkins, and CI/CD pipelines.