Senior System Administrator
Experience: 10–15+ years
About the Opportunity
We are seeking a highly experienced Senior System Administrator to join our
Infrastructure / Site Reliability Engineering team.
This role is critical to ensuring the stability, security, and operational excellence of
enterprise infrastructure spanning cloud platforms, Windows and Linux systems, identity
services, networking, and security tooling.
The ideal candidate is a hands-on senior individual contributor who has deep operational
experience, can independently own infrastructure domains, lead complex maintenance
activities, and act as an escalation point during production incidents.
Role Overview
As a Senior System Administrator, you will:
● Own day-to-day reliability and lifecycle management of enterprise infrastructure
● Lead patching, maintenance, and security remediation initiatives
● Act as a senior escalation point for incidents and operational issues
● Ensure compliance with security, audit, and change management processes
● Partner closely with SRE, DevOps, Security, Database, and Application teams
● Drive operational discipline, documentation, and best practices
You will operate across cloud and hybrid environments, balancing stability with continuous
improvement.
Key Responsibilities
1. Infrastructure Maintenance & Lifecycle Management
● Plan and execute regular infrastructure maintenance and patching cycles
● Coordinate system updates, service restarts, and reboots with minimal impact
● Perform pre- and post-maintenance validation and reporting
● Support infrastructure upgrades, migrations, and decommissioning
● Participate in capacity planning and resource optimization
2. Security, Patching & Vulnerability Remediation
● Own remediation of infrastructure vulnerabilities identified by scanning tools
● Prioritize and resolve Critical, High, and Medium severity findings within SLAs
● Apply security patches to operating systems and core infrastructure components
● Partner with security teams on compliance, audits, and remediation tracking
● Implement hardening best practices across servers and services
3. Cloud & Core Infrastructure Administration
● Administer cloud-based compute, networking, storage, and IAM services
● Manage firewall rules, routing, and secure connectivity
● Configure and troubleshoot storage and file system integrations
● Build, configure, and maintain virtual machines and system services
● Support hybrid environments spanning on-premise and cloud systems
4. Identity, Access & Certificate Management
● Manage enterprise identity and access systems (directory services, SSO, MFA)
● Handle access provisioning, role management, and authentication issues
● Own certificate lifecycle management (generation, renewal, deployment)
● Validate certificate chains, keystores, and encryption standards
● Prevent certificate-related outages through proactive monitoring
5. Incident Response, Troubleshooting & Reliability
● Respond to infrastructure alerts and production incidents
● Perform deep troubleshooting across OS, network, storage, and services
● Lead root cause analysis and implement preventive measures
● Improve monitoring, alerting, and operational runbooks
● Act as a senior escalation point during major incidents
Required Skills & Qualifications
Experience
● 10–15+ years of experience in System Administration, Infrastructure Engineering, or
SRE roles
● Proven experience supporting enterprise, production-critical environments
● Strong background in operations, reliability, and incident management
Core Technical Skills
● Operating Systems
○ Deep expertise in Windows Server administration
○ Strong hands-on experience with Linux (RHEL/CentOS or equivalent)
● Cloud Infrastructure
○ Experience administering cloud compute, networking, storage, and IAM
○ Strong understanding of cloud security and operational best practices
● Security & Compliance
○ Experience with vulnerability management and remediation workflows
○ Understanding of infrastructure hardening and compliance standards
● Certificates & Identity
○ SSL/TLS certificate lifecycle management
○ Enterprise directory services, SSO, and MFA systems
● Automation & Scripting
○ Proficiency in PowerShell, Bash, and/or Python for automation
Process & Operational Discipline
● Strong experience with ITIL-based change management
● Comfortable authoring change plans, rollback procedures, and documentation
● Experience working with ticketing and workflow tools
● Familiarity with monitoring, alerting, and incident tracking systems

