Reflections Info Systems (P) Ltd

9A2, Carnival Technopark, Kariyavattom P.O, Thiruvananthapuram, Kerala, India , 695581

http://www.reflectionsglobal.com

Senior Infrastructure Engineer

Closing Date:19,June 2026

Job Published: 13,June 2026

Contact Email: Careers@reflectionsinfos.com

Brief Description

Introduction

We are looking for 8+years experienced candidates for this role.

Responsibilities include:

Lead deployment and configuration of on-prem Linux-based platforms for AI workloads.
Install, configure, and harden approved Linux OS on GPU and non-GPU servers.
Design and configure KVM virtualization
Provision and manage VMs for application, database, observability, and platform components.
Architect and implement GPU-enabled environments for production LLM inference.
Deploy and operate containerized LLM serving stacks in production.
Configure NVIDIA GPU drivers, CUDA runtime, GPU monitoring, and GPU health validation.
Design and validate GPU utilization, isolation, workload placement patterns and health monitoring.
Configure server networking, VLANs, Linux bridges, bonding, storage pools, and access controls.
Apply security hardening, RBAC, encryption, and audit-ready configurations.
Design HA and DR strategies and prepare systems for future scale-out.
Coordinate with infrastructure, network, security, and hardware vendor teams during implementation.
Lead troubleshooting, performance tuning, and stabilization.
Produce architecture documentation, runbooks, and handover materials.

Preferred Skills

Primary Skills :

8+ years of experience in Infrastructure Engineering, Linux Engineering, Platform Engineering, or Data Center Engineering roles.
Strong Linux system administration skills, including networking, storage, performance tuning, and security hardening.
Hands-on experience deploying and operating LLMs in production on on-premises environments.
Proven experience managing GPU infrastructure using NVIDIA GPUs (H100, A100, H200 or equivalent).
Hands-on experience installing, configuring, and troubleshooting CUDA drivers and GPU runtimes.
Hands-on experience with LLM serving frameworks such as vLLM, TensorRT-LLM, or Triton Inference Server.
Experience designing and supporting GPU utilization, isolation, and performance monitoring.
Strong hands-on experience with KVM virtualization in production.
Experience working with modern AI application stacks, including backend APIs, PostgreSQL, vector databases (e.g., Qdrant), and observability tools.
Strong experience using Infrastructure as Code and automation tools, including Ansible.
Hands-on experience with Kubernetes on-premises setup.
Experience designing or supporting high availability and disaster recovery strategies, including backup, restore, and failover concepts.
Strong experience working in on-prem, air-gapped, or regulated environments.
Good documentation skills for architecture diagrams, build procedures, operational runbooks, and handover documents.

Interested candidates may forward their detailed resumes to Careers@reflectionsinfos.com along with their notice period, current and expected CTC details. This is to notify jobseekers that some fraudsters are promising jobs with Reflections Info Systems for a fee. Please note that no payment is ever sought for jobs in Reflections. We contact our candidates only through our official website or LinkedIn and all employment related mails are sent through the official HR email id. Please contact careers@reflectionsinfos.com for any clarification/ alerts on this subject.