Home

Nidhin - Sr Linux /Devops/Cloud Infrastructure Specialist
surya.k@logicplanet.com
Location: Palo Alto, California, USA
Relocation:
Visa: H1B
Resume file: Nidhin resume_1743600382334.docx
Please check the file(s) for viruses. Files are checked manually and then made available for download.
Sr Linux Engineer | DevOps | Cloud Infrastructure Specialist

PROFESSIONAL SUMMARY

Senior Linux Administrator| DevOps Engineer with over 13 years of IT experience, including 10 years specializing in AWS Cloud, Bare Metal Virtualization and CI/CD pipeline development. Proven expertise in cloud automation, containerization (Docker, Kubernetes), and multi-cloud operations. Skilled at Linux server optimization, scalable CI/CD implementation, and supporting ML/AI workflows using Kubeflow, MLflow and AWS SageMaker.

Installing, configuring, and maintaining Linux operating systems, along with managing physical and virtual server infrastructures.
Proficient in managing multi-cloud infrastructure using Terraform, CloudFormation and puppet, Ansible ensuring consistency and repeatability in deployments.
Containerized Deployments: Automated the deployment of containerized applications using Docker and orchestrated services with Kubernetes ensuring scalability fault tolerance and minimal downtime.
Adept at optimizing Linux server performance through kernel tuning resource management and advanced monitoring tools like Nagios, Prometheus and Grafana.
Implemented role-based access control (RBAC) and sudo policies for least privilege access.
Diagnosed and resolved boot failures, kernel panics, and system crashes using logs and debug tools.
Proficient in deploying and managing virtualized environments using VMware and Proxmox virtualization platforms.
Performed routine system maintenance, including updates, patches, and security enhancements.
Conducted root cause analysis and documented troubleshooting procedures for system issues.
Hands-on experience with Bash scripting and puppet, Ansible Playbook yaml for server automation, package management and security patching.
Collaborate with cross-functional teams to ensure cloud solutions align with business goals
Deploy, configure, and maintain Pacemaker & Corosync clusters, implementing failover mechanisms, STONITH fencing, and resource management for critical workloads.
Expertise in maintaining secure server-client communication using OpenSSL and Certbot, WinAcme SSL/TLS certificates for certificate management and Key Management Service (KMS).
Implemented a fully automated CI/CD pipeline using Jenkins to build Dockerized applications, perform static code analysis with SonarQube, and publish artifacts to Nexus Repository. Leveraged Kubernetes (EKS) for orchestrating containerized workloads.
Skilled in deploying Java, PHP and Node.js applications with a focus on performance optimization and security enhancements across web servers such as HTTPD, Apache, Tomcat and Nginx in Centos, Rocky Linux and Ubuntu.
Managed a large-scale Red Hat Enterprise Linux (RHEL) infrastructure spanning on-premise and cloud environments.
Architect, design, deploy, and manage Linux-based infrastructure environments, ensuring high availability, security, and performance.
Deployed and maintained services utilizing HTTP/HTTPS, DNS, LDAP, and FTP/SFTP. Configured DNS name resolution, reverse proxy setups.
Trained and mentored L1, L2, and L3 support teams enhancing their advanced troubleshooting capabilities.
Troubleshoot and resolve issues related to Linux servers and Puppet automation.


CERTIFICATIONS
RHCE: RedHat Certified Engineer
MCSA: Microsoft Certified Solution Associate
MS-700: Microsoft Certified Teams Administrator Associate
Ribbon SBC Edge Technical: SBCE11
AWS Certified Solution Architect
AWS Certified Machine Learning Engineer-Associate

TECHNICAL SKILLS

Cloud Platforms AWS (EC2, VPC, RDS, IAM, CloudFormation, S3, ELB, Auto Scaling, CloudFront, Route 53, CloudWatch, AWS SageMaker, EBS), Azure, OVH (Dedicated CloudHosting), VMware, GCP
DevOps Tools | CI/CD Jenkins, Ansible, Terraform, GitHub, Azure Pipelines, AWS Code Pipeline, Docker,
Kubernetes, SonarQube, Nexus, Python, Bash Shell Scripting, Git, GitHub, GitLab,
Bitbucket, Azure Repos, Openshift, Jira, Power shell, puppet
Monitoring and Security Nagios, Prometheus, Grafana, Splunk, OSSEC, Lynis, Wireshark, Solar-Winds, Nessus, ELK, chkrootkit, Rkhunter, SELinux, Pacemaker & Corosync, Dynatrace
Databases MySQL, PostgreSQL, RDS, Cassandra, DynamoDB, Redis, Redshift
Operating Systems Linux (RHEL, Ubuntu, Suse, Debian, CentOS), Windows Server 2008R2,2012,2016
Network Protocols SSH, DNS, DHCP, HTTP/HTTPS, FTP, LDAP, Samba, NFS

PROFESSIONAL EXPEREINCE

WPP, San Francisco, CA August 2024 Present
Role: Sr. Linux Engineer | DevOps
Description: My role was pivotal in automating continuous integration/continuous Deployment pipelines, managing hybrid cloud infrastructures, optimizing containerized microservices, and implementing advanced monitoring and security solutions. By driving infrastructure automation and enabling seamless deployments collaborating with Data Science team, I have reduced deployment times by 40%, minimized downtime to 99.99% availability, and improved operational efficiency delivering measurable cost savings of $3 million annually.

Responsibilities
Using essential Linux tools, including monitoring (htop, sar), networking (tcpdump, nmap), security (iptables, fail2ban), automation (Ansible, Terraform), virtualization (KVM, Docker, Kubernetes), and package management (YUM, DNF, APT) to ensure optimal system performance, security, and efficiency.
Engineered fully automated CI/CD pipelines using Jenkins, SonarQube and Nexus, reducing deployment times by 40%.
Managed Red Hat Enterprise Linux (RHEL) environments, including installation, upgrades, and performance optimization.
Implemented a CI/CD pipeline using AWS CodePipeline, CodeBuild, and CodeDeploy, automating the software delivery lifecycle for containerized applications running on Amazon EKS.
Architected a highly available and auto-scaling web application by leveraging Amazon EC2 Auto Scaling, Elastic Load Balancer, and RDS Multi-AZ, ensuring optimal performance and fault tolerance.
Manage LVM, RAID, NFS, iSCSI, integrating Pacemaker for storage resource failover, ensuring high availability of file systems and automated backup solutions for DR planning.
Automated VM provisioning and configuration in Proxmox using Ansible, ensuring consistent environments for data scientists and developers.
Configured LVM, RAID, and filesystem management in Red Hat Linux to optimize storage efficiency.
Resolves system problems, ensures quality through research, manages firmware updates, provides customer service, reviews code written, and supports project management for efficient IT operations
Integrated automated testing and deployment workflows to ensure seamless application delivery and high-quality releases.
Automate Pacemaker cluster deployments using Terraform, Ansible, and Bash/Python scripting, optimizing cluster configuration, failover policies, and reducing manual intervention.
Deployed and orchestrated containerized applications using Docker and Kubernetes, implementing dynamic scaling, rolling updates, and auto-healing capabilities to ensure high availability and reliability of services.
Manage and optimize virtualized environments using Proxmox, VMware vSphere, and KVM/QEMU, ensuring resource allocation aligns with organizational needs.
Diagnoses and resolves system issues by identifying and fixing bugs, leveraging monitoring tools and log analysis to ensure optimal system performance and stability.
Designed, implemented, and manage automation workflows using Puppet and develop, maintain, and implement Puppet manifests for infrastructure automation.

Environment/Tools: Git, GitHub, Jenkins, Maven, SonarQube, Nexus, Kubernetes, EKS, AWS Fargate, AWS ECS, Prometheus, Grafana, ELK Stack, Terraform, Solar winds, Wireshark, Nagios

COMCAST, Philadelphia (Remote) Aug 2022 June 2024
Role: Linux Engineer | DevOps
Responsibilities
Designs, installs, configures, tests, maintains, upgrades, administers, monitors, and troubleshoots Linux operating systems and related software to ensure optimal performance and reliability
Integrated GitOps with monitoring tools like Prometheus and Grafana for real-time deployment tracking.
Configured IAM roles, accounts and policies to ensure secure and controlled access across AWS resource.
Managed Docker images in Nexus Artifactory for efficient container lifecycle management and deployment workflows.
Ensures system security by conducting regular security audits, managing user accounts, configuring file permissions, and enforcing access controls to protect data integrity and system resources
Implemented AWS solutions using EC2, S3, RDS, IAM, LAMBDA, Elastic Load Balancers (ELB), Auto Scaling Groups and Route 53 for DNS management, ensuring scalable, secure and highly available Infrastructure.
Built custom Ansible playbooks that automatically create YAML objects in Git repositories, allowing GitOps pipelines to pull configurations and apply them to OpenShift clusters.
Developed Ansible playbooks to automate the deployment and configuration of multi-tier applications across 100+ servers. Set up OCI Monitoring and Alarms for proactive incident management and resource optimization
Configured automated database snapshots using AWS Snapshots and Percona XtraBackup, ensuring consistent backups and migration without affecting performance.
Configured and administered Red Hat Satellite Server to automate RHEL patching, repository management, and subscription allocation across Linux servers.
Designed Terraform templates to provision AWS infrastructure-as-code including networking, compute, storage, security, and monitoring and
Developed and maintained end-to-end CI/CD pipelines using Jenkins, automating code integration, testing, and deployment processes.
Environment/Tools: Linux, AWS, Docker, Kubernetes, Jenkins, Nexus Artifactory, Nagios, ELK stack, Ansible, GitLab, Route53, IAM, CloudWatch, S3, EC2, Security Groups, TCP/IP, DNS, Shell/Bash Scripting

HTC Global, India Aug 2020 Aug 2022
Role: Sr Linux Administrator | Ansible
Responsibilities
Engineered comprehensive monitoring and observability solutions utilizing Prometheus, Grafana, and Datadog; established alerts that led to a 40% decrease in incident response times and improved system reliability metrics by 25%.
Integrated monitoring tools with incident management systems such as PagerDuty, reducing mean time to resolution (MTTR) by streamlining alert and escalation workflows.
Developed and maintained Ansible playbook yaml to automate server provisioning and configuration processes, ensuring consistent and efficient deployments across environments. Integrated Rundeck for task orchestration and self-service automation.
Actively participated in post-incident reviews and root cause analysis (RCA), documenting findings and implementing preventive measures to improve system reliability.
Automated incident response and ticket resolution processes using custom scripts, improving operational efficiency and reducing manual intervention.
Migrated legacy systems to Red Hat OpenShift Container Platform for containerized application deployment.
Orchestrated end-to-end data pipelines, integrating Kafka with Splunk streaming, to facilitate seamless data flow from producers to consumers.
Set up monitoring dashboards using tools like Grafana and Kafka Manager, providing real-time insights into stream health, throughput, and latency.
Environment/Tools: Prometheus, Grafana, Datadog, PagerDuty, New Relic, AppDynamics, Dynatrace, ELK Stack, Splunk, JMeter, Gatling, Locust, Apache Bench, CloudWatch, Ansible, RCA documentation.
GBS PLUS PVT LTD, India Apr 2018 Jul 2020
Role: Linux Administrator
Responsibilities
Installed, upgraded, and managed packages on Red Hat Linux and Debian servers using YUM, RPM and APT tools.
Linux server hardening strategies by configuring Fail2ban to prevent brute-force attacks, enforcing access controls with SELinux, AppArmor and securing network traffic using Firewalled and iptables.
Installed, configured, and maintained application servers like WebSphere and WebLogic, as well as web servers including Apache, HTTPD, and Tomcat on Linux and UNIX platforms.
Designed an event-driven automation system using Red Hat Ansible Automation Platform (AAP) to detect and remediate security incidents.
Led the implementation of Red Hat Ansible Automation Platform (AAP) to standardize IT operations and improve deployment efficiency.
Automated OS deployments using puppet, Ansible playbooks, ensuring consistency in server builds, patching, and security hardening.
Conduct vulnerability assessments using Nessus, Qualys and Lynis and mitigate risks by applying patches, upgrades.
Manage storage solutions using LVM, HDFS, Ceph, GlusterFS, and distributed storage for scalable data management.
Created and maintained VM images for VMware vSphere, RedHat OpenShift, VMware Workstation.
Perform advanced kernel tuning, resource allocation, and performance optimization for large-scale deployments.
Automated backups using Crontab based on client-specific requirements
Set up password-less authentication and agent forwarding for secure server access using ssh-keygen.
Expertise in Linux system security hardening, vulnerability assessments, and ensuring compliance with industry standards like PCI DSS and GDPR.
Environment/Tools: Windows Server 2012, CentOS 8.x, Red Hat Enterprise Linux 9, YUM, RPM, Bash, Shell, HTML, Firewall, Apache, Tomcat, LDAP, NFS, Samba, SSH, DHCP, DNS, Kickstart, TCP/IP, WebSphere, WebLogic, Nagios

Techvantage Systems, India Nov 2012 Mar 2018
Role: Sr. System Administrator
Responsibilities
Installed, configured, and maintained Windows Server environments, ensuring stable and secure operations.
Performed server patching and updates using Windows Server Update Services (WSUS) and automated tools to maintain compliance and reduce vulnerabilities.
Managed Active Directory (AD), including creating and managing users, groups, and organizational units (OUs) to maintain efficient directory structures
Configured role-based access control (RBAC) to enforce least privilege access policies across user accounts, enhancing security.
Implemented and maintained Group Policy Objects (GPOs) to enforce organizational security and compliance requirements, such as password policies and application restrictions
Deployed and maintained System Center Configuration Manager (SCCM) for operating system imaging, application deployment, and patch management, streamlining endpoint management.
Designed and managed Active Directory (AD) infrastructure, including domains, forests, and trust relationships, ensuring seamless authentication and resource access across the organization.
Created and maintained standardized system images using Sysprep for consistent deployment across diverse hardware and virtual environments.
Administering Office 365 applications, including email (Exchange Online), SharePoint, OneDrive, and Teams.

Environment/Tools: Windows Server 2008 R2, Windows Server 2012, CentOS, Ubuntu Server, VMware ESXi, Task Manager, Resource Monitor, Performance Monitor, YUM, Hyper-V.
Education
Bachelor of Science in Computer Science University of Kerala, Kerala 2006 2009
Master of Science in Computer Applications University of Kerala, Kerala 2009 2012
Keywords: continuous integration continuous deployment artificial intelligence machine learning javascript sthree active directory information technology microsoft Arizona California

732-512-0009 EXT 831
Keywords: continuous integration continuous deployment artificial intelligence machine learning javascript sthree active directory information technology microsoft Arizona California

To remove this resume please click here or send an email from surya.k@logicplanet.com to usjobs@nvoids.com with subject as "delete" (without inverted commas)
surya.k@logicplanet.com;5129
Enter the captcha code and we will send and email at surya.k@logicplanet.com
with a link to edit / delete this resume
Captcha Image: