Summary
Overview
Work History
Education
Skills
Certification
Personal Information
Awards
Profile - Aboutme
Accomplishments
Timeline
Generic

Soujith Sikinderpurkar

Tech Lead - Linux & Cloud, Systems Engineering
Karlsruhe

Summary

8 years of experience in Infrastructure management, Virtualization, provisioning, Automation, Cloud Management & Dev-ops solution's, StackStorm & ChatOps Automation through ansible & puppet, Jenkins, Kubernetes in addition to the below. Providing role-based access management using LDAP & Sudo. Automation, code version control, CI/CD, test automation and containerization. Managing Kubernetes Master & worker nodes, deployment of applications in Kube containers. Managing projects and build deployment using Jenkins. Exposure to working with storage activities like Lun Masking, zoning, addition of Data stores / LUNs to VMware esxi hosts & HBAs for physicals. Known to have strong exposure in managing Servers, Blades, Clusters (Esxi Vmware & Veritas Volume Manager) with Operation systems Redhat Linux / CentOS (5, 6 & 7), Solaris 10 & 11, Vmware Esxi 6 & 6.5, HP Bl360s & DL380s, Dell Power-edge, Tegile storage Units (T4100, HD2040-100 & T3700). Strong container concepts like Dockers & Solaris Global & local zones. Real time experience with Backup solutions with Veritas Netbackup ( Adding in nodes to Backup rotations & restoring them from master netback up server) Working with object based storage solutions & allocations using ceph. Experience in managing servers remotely & supporting them on-site (in Data Center)

Overview

15
15
years of professional experience
2005
2005
years of post-secondary education
4
4
Certifications
3
3
Languages

Work History

Tech Lead - Linux & Cloud, Systems Engineering

Trapoc GMBH / Copart
03.2020 - Current

Linux Systems Management Automation Virtualization Cloud Infrastructure Kubernetes & Containerization Networking & Hardware Infrastructure Monitoring and Observability Team Leadership & Collaboration Disaster Recovery & Backup Documentation & Compliance Scaling & Optimization

  • Administration of 100+ Linux Servers: Expertise in deploying, configuring, and managing a large-scale Linux environment (e.g., Red Hat, CentOS, Ubuntu).
  • Performance Tuning: Proficient in monitoring system resources (CPU, memory, disk usage, etc.) and optimizing Linux performance for optimal resource utilization.
  • Security: Implementing hardening techniques, patch management, and security auditing to ensure the systems are secure.
  • Troubleshooting: Resolving server issues (hardware/software), log analysis, kernel panics, and troubleshooting complex system failures.
  • Infrastructure as Code (IaC): Experience with tools like Terraform, Ansible, Chef, or Puppet to automate server provisioning, configuration, and application deployment.
  • Scripting: Writing custom scripts (in Bash, Python, or Perl) for automating tasks such as log rotation, system monitoring, backups, and service restarts.
  • CI/CD Integration: Implementing automated workflows for continuous integration and continuous deployment using tools like Jenkins, GitLab CI, or CircleCI.
  • VM Management: Managing virtual machines using technologies such as VMware, KVM, or Xen.
  • Resource Allocation: Ensuring optimal allocation of resources to VMs (e.g., CPU, memory, storage) for high availability and efficient performance.
  • Virtual Network Setup: Configuring virtual networks, virtual switches, and interconnecting virtual machines securely.
  • Cloud Platforms: Hands-on experience with cloud services (AWS, Azure, Google Cloud) for provisioning and managing compute, storage, and networking resources.
  • Cost Optimization: Ability to monitor and manage cloud costs, optimize resource usage, and implement auto-scaling.
  • Cloud Security: Implementing cloud security best practices such as IAM roles, VPCs, and encryption for both data at rest and in transit.
  • Hybrid/Multicloud Architecture: Designing systems that operate seamlessly across multiple cloud platforms and on-premise infrastructures.
  • Kubernetes Management: Deep knowledge in deploying, managing, and scaling containerized applications using Kubernetes in production environments.
  • Container Orchestration: Expertise in managing containers via Docker, Kubernetes, and Helm, including creating and managing Pods, Deployments, and StatefulSets.
  • Service Mesh: Experience with service meshes (e.g., Istio, Linkerd) for managing microservices communication in Kubernetes environments.
  • CI/CD with Kubernetes: Integration of CI/CD pipelines with Kubernetes clusters, enabling automated deployments and rollbacks.
  • Networking: Understanding of virtualized network topologies, DNS, DHCP, VPNs, and firewalls within cloud and on-prem environments.
  • Hardware Maintenance: Managing and troubleshooting physical hardware servers, including RAID configuration, disk management, and hardware lifecycle management.
  • Redundancy & High Availability: Designing and implementing highly available network and server architectures, load balancers, and backup strategies to ensure continuous uptime.
  • System Monitoring: Implementing centralized monitoring using tools such as Prometheus, Grafana, Nagios, or Zabbix to track system health, performance, and resource usage.
  • Log Management: Using tools like ELK stack (Elasticsearch, Logstash, Kibana) or Splunk for centralized logging and monitoring.
  • Alerting: Configuring proactive alerting systems to ensure timely identification and remediation of issues.
  • Leadership: Leading and mentoring a team of systems engineers to improve processes and support systems at scale.
  • Cross-team Collaboration: Working closely with developers, security teams, and other stakeholders to define and meet technical requirements.
  • Best Practices: Establishing and enforcing best practices for configuration management, cloud usage, and security compliance.
  • Backup Solutions: Implementing automated backup systems for critical data and configurations (e.g., using Veeam, Bacula).
  • Disaster Recovery: Creating and testing disaster recovery plans (RTO/RPO) for both on-premise and cloud-based infrastructures to ensure business continuity.
  • Documentation: Ensuring that all systems, processes, and architectures are well-documented, including infrastructure diagrams and runbooks.
  • Compliance: Managing systems and data in compliance with industry standards and regulations such as GDPR.
  • Load Balancing: Utilizing software or hardware load balancers (e.g., HAProxy, NGINX) for scaling applications horizontally.
  • Auto-scaling: Configuring cloud auto-scaling groups and Kubernetes Horizontal Pod Autoscaler to ensure optimal resource allocation as demand changes.
  • Capacity Planning: Analyzing system growth, forecasting future infrastructure needs, and scaling the architecture accordingly (some mentioned below).


Day to day tasks specifically:

  • for provisioning, Patching, Automating jobs, Managing Kubernetes master and worker nodes, Designing & managing Linux / software based storage solution in addition to handling of 3000+ machines
  • Provisioning VMs through ansible, salt modules, maintain the infrastructure & automating tedious jobs using ansible
  • Upgrading VMware Esxi blades, manage HP C700, Bl460 & DL380 machines
  • Automating daily tasks using Ansible, Shell-Scripting, Python & Puppet, creating builds using Jenkins and deployment of applications
  • Providing Unix support to the Group, supporting 3000+ servers which hosts the application and database of prod site and other applications
  • Ensuring high availability of the entire environment
  • Hardware support with vendor coordination
  • Kernel Patching, OS upgrades, Currently supporting Patching of Unix and windows servers
  • Building ESXi servers, Timely upgrade Esxi, Assigning LUNs to ESXi hosts and creating Data-stores

Senior Systems Engineer

Copart India Technology Limited
11.2017 - 02.2020

1. Linux Systems Administration and Management 2. Infrastructure Automation and Orchestration 3. Virtualization & Cloud Technologies 4. Kubernetes and Containerization 5. High Availability and Disaster Recovery 6. Network Infrastructure & Hardware Management 7. Monitoring, Logging, and Observability 8. Leadership, Collaboration, and Continuous Improvement 9. Documentation and Compliance

Responsible for overseeing the architecture, deployment, management, and optimization of large-scale IT infrastructures, ensuring seamless operations and high availability across cloud environments and on-premise data centers. With extensive experience in managing 100+ servers, automation, virtualization, hardware management, and cloud services, I play a crucial role in enabling operational excellence, driving innovation, and ensuring robust performance in critical systems.

  • Large-Scale Linux Environment Management: Oversee the administration of over 100+ Linux servers (RHEL, CentOS, Ubuntu) in production, ensuring optimal performance, security, and uptime.
  • System Performance Tuning: Expertise in monitoring and fine-tuning system resources such as CPU, memory, and storage to ensure that all systems operate at peak efficiency.
  • Security Hardening and Compliance: Implement industry-standard security practices to secure servers, including firewalls, SELinux, and regular security patching. Knowledge of compliance frameworks like PCI-DSS, HIPAA, and SOC 2.
  • Troubleshooting and Root Cause Analysis: Skilled in diagnosing hardware and software issues, resolving complex problems, and performing in-depth log analysis to prevent recurring incidents.
  • Automation of System Provisioning: Leverage tools such as Terraform, Ansible, Chef, and Puppet to automate infrastructure provisioning, configuration management, and software deployment, ensuring consistency and reducing manual overhead.
  • Custom Scripting for Operational Efficiency: Develop custom Bash, Python, or Shell scripts for routine tasks like system monitoring, backups, user management, and service orchestration.
  • CI/CD Pipeline Integration: Drive continuous integration and deployment using Jenkins, GitLab CI, and other CI/CD tools, ensuring smooth, automated rollouts of system updates, patches, and application releases.
  • VMware & KVM Management: Expertise in managing virtualized environments, including creating, provisioning, and maintaining virtual machines (VMs) using VMware and KVM technologies, ensuring resource optimization and scalability.
  • Cloud Infrastructure Expertise: Extensive experience working with leading cloud providers, including AWS, Azure, and Google Cloud, to design and manage scalable, highly available cloud infrastructure.
  • Hybrid Cloud Integration: Architect hybrid cloud environments that seamlessly integrate on-premise infrastructure with public cloud resources for enhanced flexibility and scalability.
  • Cost Optimization in the Cloud: Continuously monitor and optimize cloud costs, utilizing features such as auto-scaling, spot instances, and rightsizing to ensure efficient resource utilization.
  • Container Orchestration with Kubernetes: Lead the deployment and management of containerized applications using Kubernetes, facilitating seamless scaling, management, and monitoring of microservices in production environments.
  • CI/CD for Containers: Implement and manage end-to-end CI/CD pipelines for containerized applications, ensuring smooth deployments and automated rollbacks in Kubernetes clusters.
  • Container Security and Compliance: Enforce security best practices for containers, including image scanning, least-privilege access controls, and network segmentation within Kubernetes clusters.
  • Service Mesh Management: Configure and manage service meshes such as Istio or Linkerd to improve microservices communication and observability within Kubernetes environments.
  • Infrastructure Redundancy: Design and implement high availability (HA) systems that ensure redundancy across servers, databases, and services, minimizing downtime and improving resilience.
  • Disaster Recovery Planning: Develop and test disaster recovery strategies (RTO/RPO) across both on-premise and cloud infrastructures, ensuring the business can quickly recover from critical failures or outages.
  • Backup Strategies: Implement reliable backup solutions using technologies such as Veeam, Bacula, and cloud-based backup solutions to ensure data integrity and availability.
  • Network Configuration and Management: Design and maintain robust network topologies, configure firewalls, VPNs, DNS, and ensure secure and efficient communication between on-premise and cloud infrastructures.
  • Hardware Lifecycle Management: Oversee the lifecycle of physical hardware, including provisioning, maintenance, troubleshooting, and eventual decommissioning, ensuring systems operate at optimal capacity.
  • Storage Management: Administer storage solutions, both virtual and physical, including SAN/NAS and cloud storage, ensuring sufficient capacity and optimal performance.
  • Comprehensive System Monitoring: Utilize tools such as Prometheus, Grafana, and Nagios to monitor the health, performance, and resource usage of Linux servers, virtualized environments, and cloud infrastructure.
  • Centralized Logging: Implement centralized logging solutions such as ELK Stack or Splunk to aggregate logs from multiple sources, enabling rapid troubleshooting and system auditing.
  • Alerting & Incident Response: Configure proactive alerting systems to detect and respond to potential failures, performance degradation, or security incidents, ensuring quick remediation to minimize impact.
  • Team Leadership: Lead a team of skilled engineers, providing mentorship, training, and technical direction while ensuring high standards of quality and efficiency across all systems and processes.
  • Cross-Functional Collaboration: Work closely with development, security, and product teams to align infrastructure with business needs, deliver scalable solutions, and ensure high standards of system reliability.
  • Process Optimization: Continuously identify and implement improvements in workflows, tools, and best practices to increase efficiency, reduce costs, and enhance system reliability.
  • Comprehensive Documentation: Maintain clear and thorough documentation for system architectures, operational procedures, and disaster recovery plans to ensure continuity and knowledge sharing.
  • Regulatory Compliance: Ensure systems adhere to industry-specific regulatory standards (e.g., GDPR, SOC 2, HIPAA), conducting regular audits and implementing necessary controls for compliance.
  • Strong experience with Linux administration, including system hardening, performance tuning, and security.
  • Expertise in cloud infrastructure, virtualization, and container orchestration using Kubernetes.
  • Proven leadership abilities in managing teams and coordinating with cross-functional departments.
  • Deep knowledge of automation and orchestration tools such as Terraform, Ansible, and Jenkins.
  • Ability to design and maintain scalable, highly available, and cost-effective infrastructures across both cloud and on-prem environments.

Senior Cloud System Engineer

Dxc Technologies India solutions (formerly known as CSC)
03.2016 - 11.2017
  • Managing a team of Senior cloud techs, provide solutions to the critical upgrade projects and planned work
  • Participate in Disaster Recovery, Migrations & Vmware Esxi upgrades projects
  • Implement Dev-ops, Dockers, Containers & Open stack projects & automation through Python programming
  • Design, install, administrate and optimize company servers and related components to achieve high results
  • Service Delivery on Remote Infrastructure Management (RIM), IT Consolidation Projects
  • Ongoing management of the server environment
  • Where applicable, assist in overseeing the physical security, integrity, and safety of the server farm

System Administrator

Wells Fargo India Solutions
08.2009 - 06.2015
  • Installing / building maintaining Physical, Esxi & Virtual Machines
  • Resolving end-user issues related to hardware, software & operating systems issues
  • Responsible for trouble shooting LAN issues & Backups
  • Providing Unix support to Internet Services Group, supporting 1500+ servers which hosts the application and databases of the wellsfargo.com site and other Wells Fargo applications
  • Ensuring high availability of the entire environment
  • Hardware support with vendor coordination
  • Kernel Patching, OS upgrades, Currently supporting Patching of unix and windows servers
  • Coordinate and collaborate with network engineering, business application, and database administration function to ensure availability, reliability, and scalability of corporate servers to meet business demands
  • Perform file system configuration and management; define and perform server backups and recovery procedures

Education

Bachelor Of Science - Computer Science

Pragathi Degree College
Hyderabad, India

Skills

  • Linux
  • Git
  • VMware
  • Python
  • Configuration Management using Ansible/Python/Saltstack
  • Dev-ops
  • Nginx
  • AWS
  • GCP
  • Web Development / Full stack
  • Django / websockets
  • Postgresql
  • Jenkins

Certification

AWS Sysops Admin, Amazon Web Services, 2017

Personal Information

  • Date of Birth: 10/03/84
  • Gender: Male
  • Marital Status: Married

Awards

  • Spot Award, Copart, Automating File system level expansion through Stackstorm / chatops. Stackstorm automatic addition of backup clients to the master Backup server.
  • Cloud Certification of Appreciation, DXC Technologies India Ltd, Automation of VM builds, managing software based Storage solution & Resources.
  • Gold Coin Award, Wells Fargo India Solutions, Automating setup of Centralized Home Directories for End users logging in to Unix machines.

Profile - Aboutme

8, Infrastructure management, Virtualization, Provisioning, Automation, Cloud Management, Dev-ops solutions, StackStorm, ChatOps Automation, Ansible, Puppet, Jenkins, Kubernetes, Providing role-based access management using LDAP & Sudo, Automation, code version control, CI/CD, test automation and containerization, Managing Kubernetes Master & worker nodes, deployment of applications in Kube containers, Managing projects and build deployment using Jenkins, Exposure to working with storage activities like Lun Masking, zoning, addition of Data stores / LUNs to VMware esxi hosts & HBAs for physicals, Strong exposure in managing Servers, Blades, Clusters (Esxi Vmware & Veritas Volume Manager), Operation systems Redhat Linux / CentOS (5, 6 & 7), Solaris 10 & 11, Vmware Esxi 6 & 6.5, HP Bl360s & DL380s, Dell Power-edge, Tegile storage Units (T4100, HD2040-100 & T3700), Strong container concepts like Dockers & Solaris Global & local zones, Real time experience with Backup solutions with Veritas Netbackup, Working with object based storage solutions & allocations using ceph, Experience in managing servers remotely & supporting them on-site (in Data Center)

Accomplishments

The Infraportal project exemplifies the power of Kubernetes as the backbone for modern, cloud-native applications. By integrating NGINX for traffic management, Vault for secrets management, WebSockets for real-time updates, and Grafana/Prometheus for observability, Infraportal delivers a secure, scalable, and highly available platform for infrastructure management and monitoring.


Infrastructure portal deployment on Kubernetes. This is one stop shop for managing our applications that is a centralized platform designed to manage, monitor, and secure infrastructure services while providing real-time updates and actionable insights. Built entirely on Kubernetes, it leverages modern technologies like NGINX, Vault, WebSockets, Grafana, and Prometheus to deliver a scalable, secure, and observant system.

Key Components and Kubernetes Integration 1. Kubernetes

Kubernetes serves as the orchestration layer for Infraportal, providing scalability, reliability, and automation for deploying and managing services. The deployment utilizes:

  • Namespaces: Isolates services such as nginx, vault, websockets, grafana, and prometheus for better organization and security.
  • Deployments: Ensures high availability and fault tolerance for all microservices.
  • ConfigMaps and Secrets: Manages configurations and sensitive data dynamically, reducing hardcoded dependencies.
  • Persistent Volumes (PVs): Used for storing Vault data, Prometheus metrics, and Grafana dashboards persistently.
2. NGINX

NGINX is deployed as an Ingress Controller on Kubernetes, managing all external traffic and routing it to internal services.

  • Ingress Resources: Configured for routing requests to the WebSocket backend, Vault APIs, Grafana, and Prometheus endpoints.
  • SSL Termination: Integrated with Vault via Kubernetes Secrets for managing TLS certificates dynamically.
  • Horizontal Pod Autoscaling (HPA): Ensures NGINX scales based on traffic load.
3. Vault

Vault is deployed as a stateful application on Kubernetes, with high availability enabled via the Raft storage backend.

  • Dynamic Secrets: Vault dynamically generates secrets for Kubernetes services, avoiding the need for static tokens.
  • TLS Certificates: Vault’s PKI backend automates the issuance and renewal of certificates used by NGINX and other services.
  • Access Control: Vault integrates with Kubernetes Service Accounts for role-based access.
4. WebSockets

WebSocket services are containerized and deployed as a Kubernetes Deployment, ensuring seamless real-time communication.

  • Persistent Connections: Maintains low-latency connections between clients and the server for real-time updates.
  • Load Balancing: Handled by NGINX Ingress, ensuring efficient distribution of WebSocket connections.
  • Scaling: Kubernetes HPA dynamically adjusts the number of WebSocket pods based on CPU and memory usage.
5. Grafana

Grafana is deployed on Kubernetes as a stateful application, with dashboards configured to visualize infrastructure metrics.

  • Data Sources: Pulls metrics directly from Prometheus deployed within the same cluster.
  • Persistent Volumes: Stores custom dashboards and configurations.
  • Role-Based Access Control (RBAC): Ensures only authorized users can view or edit Grafana dashboards.
6. Prometheus

Prometheus operates as a central monitoring system within the Kubernetes cluster.

  • Scraping Targets: Configured to scrape metrics from all services, including NGINX, WebSocket pods, and the Vault cluster.
  • Alerting Rules: Defined to notify Grafana for anomalies and threshold breaches.
  • Persistent Storage: Uses Kubernetes PVs to store historical data.
Architecture Highlights
  • Kubernetes-Native: The entire system is containerized and orchestrated on Kubernetes, enabling seamless scaling and portability.
  • Secure by Design: Vault manages secrets and certificates, ensuring all services communicate securely.
  • Observability: Prometheus and Grafana provide real-time insights into system performance and user interactions.
  • Real-Time Performance: WebSockets deliver instantaneous updates to the frontend, critical for infrastructure monitoring.
  • Automation: Kubernetes automates scaling, failover, and self-healing for all microservices.
Deployment Insights
  • NGINX and Vault: TLS certificates for NGINX are dynamically fetched from Vault and stored in Kubernetes Secrets.
  • Prometheus Operator: Simplifies the deployment and management of Prometheus and Alertmanager within the cluster.
  • Grafana Helm Chart: Used for deploying Grafana with predefined dashboards and data sources.
  • Horizontal Pod Autoscalers: Configured for NGINX, WebSocket services, and other high-traffic components.
  • Kubernetes RBAC: Ensures granular access control across namespaces and services.


Timeline

Tech Lead - Linux & Cloud, Systems Engineering

Trapoc GMBH / Copart
03.2020 - Current

Senior Systems Engineer

Copart India Technology Limited
11.2017 - 02.2020

Senior Cloud System Engineer

Dxc Technologies India solutions (formerly known as CSC)
03.2016 - 11.2017

System Administrator

Wells Fargo India Solutions
08.2009 - 06.2015

Bachelor Of Science - Computer Science

Pragathi Degree College
Soujith SikinderpurkarTech Lead - Linux & Cloud, Systems Engineering