shiva - cloud engineer |
[email protected] |
Location: Austin, Texas, USA |
Relocation: |
Visa: |
Objective:
AWS Certified SysOps Associate and Certified Kubernetes Administrator with over 7 years of experience in the IT industry comprising optimizing mission-critical deployments in AWS, Azure, GCP, EKS, Kubernetes, Containers Orchestration, Terraform, Ansible configuration management, CI/CD, Network Interfaces, Container Security, test and deployment automation, and many DevOps processes. Exceptional at analysis and evaluation of an issue to form a judgment. Summary of Qualifications: Worked as a Sr.Aws SysOps Engineer involving extensive work towards code compilation, automation, packaging, building, debugging, managing, tuning, and deploying code across multiple Aws environments. In-depth understanding of the principles and best practices of Software Configuration Management (SCM) in Agile, SCRUM, and Waterfall methodologies. Wrote Ansible playbooks to manage configurations and automate installation processes. Used Ansible and Ansible Tower as a Configuration management tool, to automate repetitive tasks, quickly deploys critical applications, and proactively manage change. Managing the configurations of multiple servers using Ansible & Chef. Managed environments DEV, QA, UAT, and PROD for various releases and designed instance strategies. Managed multi flavors of Linux and Windows virtual servers with Ansible & Chef using Git. Extensively worked on Jenkins and TeamCity for continuous integration (CI) and for End-to-End automation for all build and deployments. Performed Automation and Scaling of applications using Kubernetes. Experience with Ansible Tower to manage Multiple Nodes and Manage Inventory for different Environments. Used Ansible to orchestrate software updates and verify functionality. Terraforming and Awsformation to code all infrastructures into AWS and GCP. Organized different infrastructure resources like physical machines, VMs, and even containers using Terraform. Used AWS Beanstalk for deploying and scaling web applications and services developed with Java, and Node.js on familiar servers such as Apache. Worked on Terraform to set up the AWS infrastructures such as launching the EC2 instances, S3 buckets, VPC, and Subnets, and created the module-driven AWS Infrastructure with Terraform. strong practical experience Experience developing Terraform Modules for several cloud services, such as K8s, API Gateway, S3, and Lambda a thorough understanding of cloud networking and security including network interfaces, load balancers, IAM policies, and container security. Good Docker and Helm Charts working knowledge for Kubernetes implementation. Experience with Kubernetes for workload management and security with API Gateways, Load Balancers, Pod security, PVCs, and Fluentbit/FluentD API deployment. Worked on Docker container snapshots, attaching to a running container, removing images, managing directory structures, and managing containers. Experience working on several Docker components like Docker Engine, Hub, Machine, Compose, and Docker Registry. Experience working on docker hub, creating docker images, and handling multiple images primarily for middleware installations and domain configurations. I used Docker for configuration management automation. Set up Elastic Load Balancer for distributing traffic among multiple Web Logic servers and was involved in deploying the content cloud platform on Amazon Web Services using EC2, S3, and EBS. Creating the AWS Security groups to enable the firewall. Automated AWS volumes snapshot backups for enterprises using Lambda. Created functions and assigned roles in AWS Lambda to run scripts. Administration and maintenance of Docker runtime environment, Versioning and lifecycle management of Docker images, Experienced in Docker orchestration framework. Automated Compute Engine and Docker Image Build with Jenkins and Kubernetes. Having good implementation experience with installation and configuration of Kubernetes, clustering them, and managing local deployments in Kubernetes. Technical Proficiency: Operating Systems: UNIX, Linux, Ubuntu Windows Server DevOps Tools: Docker, Kubernetes, Jenkins, Ansible, and Git Cloud Platforms: AWS, Google Cloud Platform SCM Tools: Subversion, GIT, Perforce, Clear case, Nexus, Artifactory Build Tool: Ant, Maven, Gradle, MSBuild CI tools: Jenkins, TeamCity, XDeploy. Bug tracker & Testing: JIRA, Bugzilla, Redmine, Junit, TestFlight, TestRail, TeamCity Automation Tools: Ansible & Chef Web Service Tools: Nginx, Apache Tomcat, Web sphere, IIS Languages/Utilities: Shell Script, Batch Script, JavaScript. Networking: TCP/IP, NIS, NFS, DNS, SAN, SMTP, FTP/TFTP. Cloud & Networking: AWS, Azure, Apache Cloud-Stack, VMware-Exsi. Virtualization & Micro Services: Docker, Kubernetes KVM, LXD, VMware Workstation, Virtual-Box. Scripting and Automation: Bash, PowerShell Monitoring and Logging: GCP, Splunk Strong Troubleshooting and Problem-solving skills Excellent Communication and Collaboration abilities. Certifications: AWS Certified SysOps Administrator - Associate (Verification Code: JT87B5T182QEQB94) Certified kubernetes Administrator (LF-udeulhbhur) Professional Experience: Fusion, Dallas, TX 06/26/2023 - Present Sr. Cloud SysOps Consultant Managed the deployment, monitoring, and maintenance of AWS infrastructure. Implemented automation and orchestration using AWS CloudFormation, Lambda, and other relevant tools. Worked on migration, and successfully moved on-premise to AWS cloud environments. Collaborated with DevOps teams to ensure seamless CI/CD pipelines and efficient release management. Resolved complex technical issues, providing timely solutions to ensure minimal service disruptions. Implemented Infrastructure as Code (IaC) using CloudFormation and Terraform, reducing deployment time by 50% and increasing consistency across environments. Managed and optimized AWS resources, including EC2 instances, RDS databases, and S3 buckets, resulting in improved performance and cost savings of 25%. Implemented automated monitoring and alerting using CloudWatch, ensuring prompt identification and resolution of system issues. Mentored junior team members and conducted technical training sessions to enhance the team's AWS expertise. Led the design and implementation of cloud-native solutions, leveraging Kubernetes and microservices architecture. Developed Terraform modules for Kubernetes clusters, API Gateway, S3, Lambda functions, and other cloud resources, improving infrastructure as code (IaC) practices. Designed and implemented robust cloud networking solutions, including VPCs, subnets, and security groups, enhancing the overall security posture. Implemented IAM policies following the principle of least privilege, ensuring secure access control across the cloud environment. Collaborated with development teams to containerize applications using Docker and streamline deployment using Helm Charts. Achieved seamless workload management by deploying and scaling applications within Kubernetes pods, Load enhancing availability and performance. Ensured container security by configuring pod security policies, managing persistent volume claims (PVCs), and integrating Fluentbit/FluentD for efficient log management. Successfully deployed APIs on Kubernetes, implementing advanced load balancers and securing endpoints with industry best practices. Environment: Terraform, AWS, S3, EBS, ELB, Auto Scaling groups, VPC, IAM, Cloud Watch, Glacier, DynamoDB, OpenStack, shell/Python scripts, GIT, Jenkins, Maven, Nexus, Ansible, Fluentbit, Docker, Kubernetes, Helm Charts, Argo CD, JIRA, Splunk, Nagios, Python, Nginx. CVS, Remote 10/03/2022 - 06/19/2023 Sr.Cloud SysOps Consultant Designed and implemented Docker-based containerization for Episode Analytics Apps (UI, API, and Database) using Docker-compose. Created Kubernetes clusters for the EA app. Managed Kubernetes clusters, including provisioning, scaling, and monitoring, utilizing Minikube for local development and testing. Collaborated with development teams to optimize application performance and troubleshoot production issues, ensuring high availability and minimal downtime. Managing and setting up logging alerts in GCP based on a threshold. Implemented centralized logging and monitoring solutions using Splunk and GCP, enabling real-time visibility into system health and proactive issue detection. Conducted training sessions and workshops to educate team members on Docker, Minikube, and Kubernetes best practices and usage. Clone repositories to their local machines and create branches for new features or bug fixes. Commit changes to branches and push branches to the central repository. Pull the latest changes from the central repository. Resolve conflicts during merges. Review and test code changes by other contributors. Collaborate with other team members via pull requests or code reviews. Worked closely with cross-functional teams, including developers, system administrators, and QA engineers, to ensure smooth software delivery and continuous improvement. Creating documents and uploading them to the fuse channel and updating the documentation to ensure it remains accurate and helpful to users. Knowledge of developing Helm charts for application definition, packaging, and deployment on Kubernetes. By setting up pod security policies, controlling persistent volume claims (PVCs), and integrating Fluentbit/FluentD for effective log management, it was possible to guarantee container security. The ability to share and distribute Helm charts via Helm repositories. Understanding of combining other GCP services and apps with GCP Secret Manager. Knowledge of using GCP Secret Manager to create and manage secrets. Knowledge of setting up and configuring ArgoCD for Kubernetes clusters' continuous application deployment. Defining infrastructure resources on cloud platforms like AWS, GCP, or Azure by creating Terraform configurations. Environment: Terraform, AWS, S3, EBS, Elastic Load balancer (ELB), Auto Scaling groups, VPC, IAM, Cloud Watch, Glacier, Fluentbit, DynamoDB, shell/Python scripts, GIT, Sondarqube, Helm Charts, Argo CD, Jenkins, Maven, Nexus, Ansible, Docker, Kubernetes, Confluence, JIRA, Splunk, Nagios, Python, Nginx. Medtronics, Minnesota 07/06/2020 - 09/30/2022 Cloud SysOps Engineer Deploying infrastructure on AWS utilizing services such as EC2, RDS, VPC and Managed Network and Security, Route 53, Direct Connect, IAM policies, Cloud Formation, AWS Ops Works (Automate operations), Elastic Beanstalk, AWS S3, Glacier, (Storage in the cloud) and Cloud Watch Monitoring Management. Handling and maintaining Azure DevOps Pipeline with ADO and Terraform. Maintaining Terraform for Infrastructure as Code for Azure and GCP. Experienced in creating AWS IAM and Security Group in Public and Private Subnets in VPC. Created AWS Route53 to route traffic between different regions. Launched AWS and OpenStack instances (SUSE/Ubuntu/CentOS) and Configured launched instances for specific applications. Implementing AWS Lambda functions to run scripts in response to events in Amazon Dynamo DB table, S3 buckets, and HTTP requests using Amazon API Gateway. Involved in creating, and configuring AWS VPC services, installed EC2 instances for the new development team, and used AWS Route 53 to maintain the traffic and create the DNS name. Create a custom image of an Azure VM with Azure PowerShell, create a VM scale set, and deploy a highly available app on Windows with Azure PowerShell. Experience in load balances, Windows VM in Azure to create highly available applications with Azure PowerShell and manage Azure Virtual Network for Windows VM with Azure PowerShell. Experience in Backup and restore files for Windows VM in Azure cloud. Transformed data among servers using tools SQL Server Integration Services and BCP. Configured and Administered Net backup policies, which include Windows and Linux agents, VMware SAN backups, Oracle and SQL agents, and Net backup Accelerator for servers up to 380 TB in size. Written Python scripts and PowerShell for setting up baselines, branching, merging, and automation processes across the environments using SCM tools like GIT on Linux and Windows platforms. Implementing VPC, Auto Scaling, S3, EBS, ELB, Cloud Formation templates, and CloudWatch services from AWS. Used Maven dependency management system to deploy snapshots and release artifacts to Nexus to share artifacts across projects and environments. Built applications using Maven, and Gradle scripts. Responsible for Installing, administering, repository management, and User management in Nexus. Involved in checking the quality of the code by using SonarQube. Working knowledge/exposure in TOMCAT APACHE, WEB LOGIC & WEB SPHERE, formulated and executed designing standards for DNS servers. Deploy application code using CI/CD pipeline with Azure DevOps in Azure cloud, scale VM build automation using Azure DevOps in Azure VM agent plug-ins. Worked on Performance Monitoring, resolving network issues & tuning the system using tools. Develop a plan for problem and incident response & management process based on Information Technology Infrastructure Library (ITIL). Improved team performance and accelerated release cycle to accelerate delivery of new features and increase overall product qualities. Maintaining and developing Docker images for a tech stack including Cassandra, Kafka, Apache, and several in-house written in both OpenStack and AWS cloud on Kubernetes. Used Docker and Kubernetes to manage microservices deployment. Used Kubernetes to create Pods, Config Maps, and deployments into the cluster and provide pod security. Hands-on experience on Kubernetes to automate the deployment, scaling, and operations of application containers across clusters of hosts. Worked with containerization tool Docker and running containers and services and utilizing docker swarm. API to provide lightweight containers that run processes isolation and worked on the creation of customized Docker container images, tagged and pushed the images to the Docker Hub repository. Built and deployed an Ansible Server in Azure & AWS for infrastructure automation. Creating Chef Cookbooks and Recipes to maintain and automate various parts of the infrastructure. Responsible for configuration changes, code building, code staging, build automation, and deployment using tools like Octopus which also includes scripting in the command line, Python, PowerShell, and Perl. Configuration on WebSphere which includes writing Linux scripts- Shell scripting. Continuous Integration using Maven, Jenkins, and Hudson. Involved in migration of Artifactory & GIT server. Created an S3 bucket to host a static website with logging and versioning and customizing the template as per the client's requirement. DB administration and scripting in SQL, MySQL, and Oracle databases for report generation and data management. Created lambda and wrote lambda function in Python script to stream the data from the S3 bucket to Elasticsearch. Customized the log monitoring for a better and more convenient reading of the logs by creating the indices and passing them on to Elasticsearch which provides a URL for Kibana access. Responsible for designing & deploying new ELK clusters (Elasticsearch, Logstash, Kibana, Beats, Kafka, zookeeper. Responsible for merging, tagging, and conflict resolution in GIT and administered version control systems to create daily backups and checkpoint files. Responsible for User Management, Plugin Management, and new job design and setup in Jenkins. Good experience in converting build.xml into pom.xml to build applications using Maven. Experience in installing, configuring, and administration of VMware-based virtual servers using ESX/ ESXi servers, vSphere and Virtual Center. Used Javascript to create a workout timer dynamic overlay, which updates the database via saved timestamps. Workaround was required to save the timestamp on the tab out of the mobile browser. Adept at Angular front-end development, and have made native mobile apps in my spare time. Established communication between VPCs using VPC Peering and Transit Gateway. Configured lifecycle configuration rules on S3 buckets to move data between storage classes for easy accessibility, archiving, and or cost optimization. Environment: Terraform, AWS, S3, EBS, Elastic Load balancer (ELB), Auto Scaling groups, VPC, IAM, Cloud Watch, Glacier, DynamoDB, OpenStack, shell/Python scripts, GIT, Jenkins, Maven, Nexus, Ansible, Docker, Kubernetes, JIRA, Splunk, Nagios, Python, Nginx, Tomcat, Windows and Linux environment. Medsmitra, India 06/03/2019 - 06/30/2020 SysOps/Automation Consultant Designed AWS Cloud Formation templates to create custom-sized VPC, subnets, and NAT to ensure successful deployment of Web applications and database templates. Set up and built AWS infrastructure for various resources, VPC EC2, S3, IAM, EBS, Security Group, Auto Scaling, and RDS in Cloud Formation JSON templates. Working in the implementation team to build and engineer servers for Linux & AIX operating systems. Working on AWS cloud to provision new instances. S3 Storage Services. AWS EC2 & Cloud watch services. CI/CD pipeline management through Jenkins. Automated Compute Engine and Docker Image Build with Jenkins and Kubernetes. Having good implementation experience with installation and configuration of Kubernetes, clustering them, and managing local deployments in Kubernetes. Provided consistent environment using Kubernetes for deployment scaling and load balancing to the application from development through production, easing the code development and deployment pipeline by implementing Docker containerization. Focused on containerization and immutable infrastructure. Docker has been core to this experience, along with AWS ECS and Kubernetes. Integrated Kubernetes with network, storage, and security to provide comprehensive infrastructure and orchestrated containers across multiple hosts. Maintained persistent volume claims (PVCs), configured pod security policies, and integrated Fluentbit/FluentD for effective log management. Created Docker images using a Docker file. Worked on Docker container snapshots, removing images and managing Docker volumes, and experience with Docker container service. Wrote Ansible playbooks to manage configurations and automate the installation process. Used Ansible and Ansible Tower as a Configuration management tool, to automate repetitive tasks, quickly deploys critical applications, and proactively manage change. Managing the configurations of multiple servers using Ansible. Managed the Artifactory repository for the current project created a new repository and provisioned it. Wrote PowerShell scripts to automatically restart the uDeploy agents on LINUX machines Performed Automation and Scaling of applications using Kubernetes. Experience with Ansible Tower to manage Multiple Nodes and Manage Inventory for different Environments. Used Ansible to orchestrate software updates and verify functionality. Used Ansible to document all infrastructures into version control. Used GIT as Version Control System for two applications. Managed development streams and Integration streams. Used AWS Bean Stalk for deploying and scaling web applications and services developed with Java, Node.js, Python, and Ruby on familiar servers such as Apache, and IIS. Worked on Docker container snapshots, attaching to a running container, removing images, managing directory structures, and managing containers. Experience working on several Docker components like Docker Engine, Hub, Machine, Compose, and Docker Registry. Experience working on docker hub, creating docker images, and handling multiple images primarily for middleware installations and domain configurations. Used Docker for configuration management automation Set up Elastic Load Balancer for distributing traffic among multiple Web Logic servers and was involved in deploying the content cloud platform on Amazon Web Services using EC2, S3, and EBS. Creating the AWS Security groups to enable firewalls. Automated AWS volumes snapshot backups for an enterprise using Lambda. Created functions and assigned roles in AWS Lambda to run Python scripts. Administration and maintenance of Docker runtime environment, Versioning and lifecycle management of Docker images, Experienced in Docker orchestration framework. Environment: Linux(RHEL) Redhat, AWS, Puppet, Chef, Amazon EC2, Apache/ Web Server, WebSphere Application Servers Subversion, Maven, Jenkins, Apache Tomcat, JIRA, MS SharePoint XML, UNIX/LINUX, Windows, Perl Scripts, Shell-Scripts, Java/J2EE, Kubernetes, Docker, HortonWorks, Hadoop. ClusterIT, India 07/02/2018 - 05/31/2019 SysOps/Cloud Engineer AWS, Large scale Infra environments consisting of over 400+ Linux & Windows Instances running multiple technologies including Apache, Nginx, Memcached, MySQL, Tomcat, Kafka, AWS, Dockers, Nexus, Jenkins, Zookeeper, Storm, Hadoop, H-Base, Data dog, Log-watch, Postfix, ELK-Stack & EFK-Stack. OS and Kernel Layer Tuning and Complying for Better Performance. Private Cloud with Cloud-Stack and VMware EXSI consisting of 80-100+ Linux Environment For Hadoop and Other Development Tools Implementation and Configuration of ELK Stack and Configuring Dashboard with FluentD and Logly. Implementation and Configuration of Identity Provider or SAML (Single Sign On) Server. BlackBorad and Canvas Configuration and Setup with Automation tools. Designed and implemented Ansible with Vagrant and Docker Containers to configure, install, manage, and deploy releases across all environments. Docker Containers Monitoring with Custom Script and Sensu Monitoring Tool. Managed and maintained ongoing changes to configuration with Ansible and Custom Scripts. Template Design and Configuration for Apache Cloud-stack and AWS Cloud-Formation. Hadoop multiple multi-nodes fully distributed cluster and testing Environment Setup with simple scripts and Ansible Playbook. Regular maintenance of clusters with Hadoop balancer and monitoring of clusters with Sensu and Python Flask Monitoring Tool. Troubleshoot the build issue during the Jenkins build process. Experience in designing microservices deployment using Docker. Developed build and deployment scripts using ANT and MAVEN as build tools in Jenkins to move from one environment to another environment. Troubleshooting and Recovering Hadoop Data-Node and Name-Node Failure or Recovery. Installing multi-nodes fully distributed and highly available more than 300 nodes storm cluster. Installing and configuring highly available HBase clusters in fully-distributed mode. Installing and configuring Cloudera-Hadoop with MySQL database backend. Configured, performed troubleshooting and tuning for all layers of the stack Created and implemented multi-environment Nagios & Sensu monitoring and managed with Python scripts for adding and removing hosts Provided timely support as needed to staging & production releases Configured, performed troubleshooting and tuning for all layers of the stack Created and implemented multi environments. S3 Custom Backup script and RDS Management. AWS RDS running MySQL with replication. Several automated Shell and Python Script for daily and new implementation work. Nginx Load balancer and Webserver configuration. Nagios & Sensu monitoring and managed auto-reactivation of application service with Nagios reactor and Sensu remediation (Tomcat, Nginx, Memcached, MongoDB, MySQL lots more). Deploying Self-Made Plugins for Sensu or Nagios with Python or Shell Script. Implementation of Docker with different applications & Tools. Publishing Docker images to Docker Hub & Nexus Repo. Environment: GIT, Jenkins, ANT, Maven, Ansible, JIRA, Python Scripting, Open stack, Artifactory, Docker, SQL, Ruby, Nagios, Kibana, Swagger, Shell, Tomcat, SVN, LDAP, Postfix. SVVG, India 06/13/2016 - 06/29/2018 Cloud Engineer Cloud Migration & EBS Disk Migration. Manage infra in AWS, Azure, and VMWare. Ansible with Vagrant for new Deployment of Instance or server Caching server with Redis-Sentinal or Memcached. Experience in deploying Hosted Chef-Server on AWS for managing AWS EC2 instances. Server Provided Systems Administration, monitoring, maintenance, deployments, troubleshooting, and automation services. Responsible for CI/CD process implementation using Jenkins along with Shell scripts to automate routine jobs. Setup MySQL & MongoDB (NoSQL) with failover and Replication Monitoring using Nagios Configured, performed troubleshooting and tuning for all layers of the stack Created and implemented multi-environment Nagios & Sensu monitoring and managed with Python scripts for adding and removing host. Documentation of how deployment works, installation, and setup process of the application running inside Dockers and Monitoring. Configuring Webserver Apache and Nginx. S3 Custom Backup script and RDS Management. AWS RDS running MySQL with replication. Several automated Shell and Python Script for daily and new implementation work. Coordination with multiple hardware and operating system software vendors to resolve the critical production issue. Automatic script for Log Management and query alert. Performed weekly and on-call deployments of application codes to production environments. Created Maven, Perl & UNIX scripts for build and Release activities in QA, Staging, and Production environments. Worked on creating the Docker containers and Docker consoles for managing the application life cycle. Working experience on AGILE/SCRUM Methodologies. Maintained and expanded usage of TeamCity and Gitlab. Created and deployed a tool to automate branch and project creation in subversion using Perl and Chef and Puppet scripts. Also, help set up some of the AWS accounts peered with Shared Services so some users can use their CORP login creeds for logging into AWS accounts. Work along with QA Lead to complete test planning tasks and executions for Statoil. (HAL Insite, WITSML, Apache Cassandra DB, Storm UI, Google Postman, WAPI entitlements, and Zookeeper Applications). Environment: Web Sphere, Nginx Server, Log4j, JavaScript, J2EE Patterns Search, SQL, MySQL, LDAP. Education: Bachelor of Science in Computer Science Amrita University, Bangalore, India Keywords: continuous integration continuous deployment quality analyst user interface javascript sthree database information technology microsoft Texas |