Sravan - looking for Devops Engineer |
[email protected] |
Location: Santa Rosa Valley, California, USA |
Relocation: yes |
Visa: Green Card |
SRAVAN KUMAR REDDY
Sr. Cloud DevOps Engineer PROFESSIONAL SUMMARY: Overall 12+ years of experience in the IT Industry where DevOps Engineer with 6+ years of comprehensive experience in Cloud with a major focus on Microsoft Azure and AWS. Acclaimed excellent knowledge in DevOps, Configuration Management, Infrastructure Automation, Build and Deployment Automation, Designing, and Development on Microsoft Azure. Relevant 4 years as QA Test Engineer. Hands-on Experience Administrating Microsoft Azure Services like Azure App Services, Azure SQL Database, Azure AD, Azure Blob storage, Azure Functions, Virtual machines, Azure Fabric controller, Azure Data Factory, Azure web applications, Azure Service Bus, and Notification hub. Expertise in utilizing AWS services like AWS Code Deploy, Route53, EBS, S3, EC2, Cloud Formation, ECS, SNS, Lambda, SQS, Cloud Front, and Auto Scaling for deploying Java and .net applications to the cloud. Developed PowerShell scripts and ARM templates to automate the provisioning and deployment process. Seasoned in Configuring Azure Automation Desired State Configuration (DSC) configuration management to assign permissions through Role-Based Access Control (RBAC), assign nodes to proper automation accounts and DSC configurations, to get alerted on any changes made to nodes and their configuration. Expertise in Designing and implementing Continuous Integration (CI) throughout many environments utilizing Azure DevOps tools to provide an agile development process that is automated and repeatable, allowing teams to safely deploy code several times per day while ensuring Azure Kubernetes Services (AKS) are supported. Well-versed in using Azure Infrastructure as a Service (IaaS), Provisioning VMs, Virtual Networks, Deploying Web Apps, Microsoft SQL Server, using ARM Templates, and Azure DevOps CI/CD pipelines. Expertise in configuring the Monitoring and Alerting tools according to the requirement like Prometheus and Grafana, setting up alerts, and deploying multiple dashboards for individual applications in Kubernetes. Experience in Configuring VNET Peering using Terraform Modules and configuring NSGs for two tiers and three- tier applications were set up to filter network traffic, to facilitate connectivity between resources across various Virtual Networks. Expertise in automating deployment processes using Azure DevOps to various environments (e.g., development, staging, production). Developed and designed Azure DevOps pipeline to manage the resources across multiple subscriptions in Azure. Highly involved in Setting up and configuring Jenkins pipelines for automated builds, tests, and deployments. Experienced building Jenkins pipelines to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, Created Pods, and managed using Kubernetes. Designing and implementing Ansible playbooks and roles to automate infrastructure configuration and management. Experience using Ansible Tower dashboard, role-based access control, and access to Ansible for deployments and worked with Ansible Playbooks to automate various deployment tasks and working knowledge on Ansible Roles, Ansible inventory files, and Ansible Galaxy. Well-versed in creating reusable modules and templates for infrastructure provisioning with Terraform in different cloud platforms. Developing and maintaining Splunk queries and search queries for log analysis and troubleshooting. Expertise in Deploying and managing Kubernetes clusters on on-premises on cloud environments. Configuring and optimizing cluster resources, including nodes, networking, and storage Implemented and managed AWS container orchestration platforms such as ECS and EKS to automate the deployment and management of containerized applications. Proficient in creating Docker images using Docker File, worked on Docker container snapshots, removing images, and managing Docker volumes and implementing. Excellent reputation for resolving production issues very quickly with an analytical bent of mind. Excellent communication skills with extensive experience in organizing meetings and gathering project requirements from multiple teams in large multi-functional organizations. Good exposure in managing various LINUX servers clustered environments. Excellent troubleshooting and time management skills, ability to work independently while coordinating efforts with engineers, vendors, and customer support resources. Proficient in using AWS CDK(Cloud Development Kit) with Typescript for infrastructure provisioning. Expertise in GitHub Administrative activities. Configured GIT with Jenkins and schedule jobs using POLL SCM/Webhooks option and integrated to automate the code checkout process. experience in customizing and extending Argo CD. include creating custom resource definitions (CRDs), building plugins, or developing custom workflows to meet specific requirements. Experienced in integrating Argo CD into CI/CD pipelines like Jenkins, azure pipeline, or other continuous integration and delivery platforms to trigger Argo CD deployments and enable end-to-end automation. Experienced in the practical application of the GitOps methodology with Argo CD. and integrated Git as the single source of truth for application configurations and utilized Argo CD to reconcile the desired state of applications with the state in the Git repository. Deployed and configured Elastic search ELK, Logstash, and Kibana (ELK) for log analytics, full-text search, and application monitoring. EDUCATION: Computer Science Engineering TECHNICAL SKILLS: WORK EXPERIENCE: Client: Starbucks, Remote| July 2024 - September 2024 Sr. Cloud DevOps Engineer/SRE Responsibilities: Designed, implemented, and managed cloud infrastructure using Terraform and Azure, creating reusable modules to standardize deployments across environments, ensuring scalability, high availability, and cost-efficiency. Automated provisioning of resources such as virtual networks, virtual machines, and storage accounts, enabling consistent and repeatable infrastructure deployment. Developed and maintained Terraform state files, implementing state locking and management to ensure consistency during concurrent deployments, reducing configuration drift risks. Integrated Terraform workflows with Azure DevOps (ADO) pipelines, automating infrastructure deployment as part of CI/CD processes, significantly reducing manual provisioning efforts and deployment times. Managed code repositories using Git, implementing branching strategies, code reviews, and version control best practices for infrastructure as code (IaC) and application development. Conducted code reviews for Terraform scripts to ensure adherence to best practices and security guidelines, fostering a culture of clean and maintainable infrastructure code. Implemented and managed Terraform workspaces to manage multiple environments (development, staging, and production), ensuring consistency across all stages of the software development lifecycle. Optimized Terraform configurations to reduce deployment times and minimize infrastructure costs, achieving a 20% reduction in cloud spend through efficient resource management. Supported and optimized SQL databases, ensuring high performance, regular backups, and disaster recovery readiness. Implemented monitoring solutions to ensure uptime and reliability, responding to incidents promptly to minimize downtime and conducting post-incident reviews. Collaborated with security teams to integrate automated security controls into Terraform, ensuring adherence to compliance standards and best practices like IAM role creation, network security groups, and encryption. Developed detailed documentation for Terraform modules, best practices, and troubleshooting guides to enhance knowledge sharing and onboarding for new team members. Created scripts and automation tools to reduce manual efforts, improving operational efficiency in infrastructure provisioning, scaling, and backup processes. Client: Broadridge Financial, Coppell, TX USA. | April 2021 June 2024 Role: Sr. Cloud DevOps Engineer Responsibilities: Involved in creating Azure Key Vaults to hold Certificates, Keys, and Secrets. Designing Inbound and Outbound traffic rules and linking them with Subnets and Network Interfaces to filter traffic to and from Azure Resources. Well-versed in automating Infrastructure using Azure CLI, monitoring and troubleshooting Azure resources with Azure App Insights, and accessing subscriptions with PowerShell. Created and deployed Python Azure FunctionApp and integrated it with Azure API management. Configured and maintained Azure Storage Firewalls and Virtual Networks, which use virtual Network Service Endpoints to allow administrators to define network rules that only allow traffic from specific V-Nets and subnets. Planning and implementation of data and storage account solutions for Azure files, Queue storage, and blob storage. Implemented terraform modules for automatic provisioning in the Azure environment. Worked on Azure DevOps services such as Azure Repos, Azure Boards, and Azure Test Plans to plan work and collaborate on code development, built and deployed applications. Designed and implemented database schema, created tables and indexes, and configured security settings for Azure SQL DB. Configuring database-level security settings, creating users and roles, and implementing access controls to ensure that only authorized users have access to the correct schema. Designed and implemented Terraform scripts to automate SQL DB Instances creation. Managing and monitoring Azure SQL DB instances troubleshooting issues and optimizing the performance of SQL Managed Instances. Worked with Terraform for automating VNET, NSG, AKS, ACR, VMs, and Storage accounts to replace the rest of our infrastructure. collaboratively with cross-functional teams, including developers, system administrators, and other stakeholders. Effective communication and teamwork skills are crucial in an OpenShift environment. worked on Deploying and managing applications on the OpenShift platform. configured containers, manage deployments, and perform scaling and updates. Experienced in OpenShift is built on top of Kubernetes and leverages Docker for containerization. Worked on the creation of various custom modules in Terraform for various environments. Managed servers on the Microsoft Azure Platform Azure Virtual Machines instances using Ansible Configuration Management and created Ansible Playbooks, tasks, and roles to automate system operations. Designed and implemented CI/CD pipeline for Azure app services and was responsible for monitoring the performance and availability of Azure App Services and troubleshooting any issues that arise.Developed and maintained Continuous Integration (CI) using tools in Azure DevOps (VSTS) spanning multiple environments, enabling teams to safely deploy code in Azure Kubernetes Services (AKS) using YAML scripts and HELM charts. Worked on deployment automation of all the microservices to pull images from the private docker registry and deploy to the Azure Kubernetes cluster (AKS) using Ansible. Installed and configured the configuration management tool Ansible for Cloud Automation. Worked on creating, configuring, and managing AKS clusters in Azure, including managing node pools, configuring networking, and setting up load balancing. Managing the Azure Kubernetes Services (AKS) policies, providing access to different Azure resources, and developing and improving the workflows that govern access. Configured and installed Prometheus and Grafana using HELM charts within an AKS cluster for monitoring and visualization of Kubernetes workloads. Integrated Azure Storage Accounts with Azure Functions, Azure Databricks, and AKS cluster. Developed continuous integration and deployment pipelines that automated builds and deployments to many environments using VSTS/TFS in the Azure DevOps Project. Configured private endpoints for Azure services, such as Azure Storage and Azure SQL Database, to restrict access to resources from within the virtual network only. Configured applications that run multi-container Docker applications by utilizing the Docker-Compose tool which uses a file configured in YAML format. Used Kubernetes to manage containerized applications using its nodes, Config-Maps, selector, Services, and deployed application containers as Pods. Used Chef/Puppet to automate repetitive tasks, quickly deploy critical applications and manage changes. Designed, implemented, and managed containerized application platforms using OpenShift, including cluster setup, deployment, and maintenance. Configured and managed the New Relic monitoring agents, setting up custom dashboards and alerts, and monitoring application performance metrics to ensure application availability. Managed and maintained multiple GitHub repositories and was responsible for managing Pull requests. Developed and implemented access controls in GitHub to ensure that only authorized users have access to sensitive data and repositories. Extensive involvement in Azure Backup, Azure Policies, and Azure Key Vault. Developed and implemented processes and procedures for Linux OS patching. Client: US Bank, Richfield, MN USA. | Sep 2018 - Mar 2021 Role: Sr. DevOps Engineer Responsibilities: Automated Build and Deployment process-setup Continuous Integration and Continuous Deployment of applications onto different environments like Dev, QA, and Production using Jenkins. Designing, deploying, and maintaining the application servers on AWS infrastructure, using services like AWS EC2, S3, Glacier, VPC, Lambda, Route53, SQS, IAM, Cloud Front, and RDS Worked with the different instances of AWS EC2, AWS AMI s creation, managing the volumes, and configuring the security groups. Created AWS VPC network for the installed instances and configured AWS Security groups, NACL, and Elastic IPs installing load balancers and VPC with public and private subnets. Responsible for user access and permissions for AWS services and resources using IAM roles, groups, and policies. Integrated IAM policies with AWS services such as EC2, S3, and RDS to achieve secure access controls. Created Chef Cookbooks for automation purposes like file copy, permission changes, configuration changes, path- specific folder creation, etc. Wrote Cookbooks for provisioning, orchestration, packages, services, configuration, and deployments. Involved in writing terraform modules for automation and the Creation of different environments. Created private cloud using Kubernetes and managed using Helm charts. Implemented a server less architecture using ARI Gateway, Lambda, and Dynamo DB and deployed AWS Lambda code from Amazon S3 buckets. Responsible for setting up databases in AWS using RDS, storage using S3 buckets, and configuring instance backups to S3 buckets. Worked on creating AMI (AWS Machine Images) that utilizes ELB (Elastic Load Balancer) and Auto Scaling. Designed and deployed AWS cloud architecture using the AWS CDK (Cloud Development Kit) with Typescript, ensuring robust and efficient solutions aligned with business requirements. Integrated Jenkins with Nexus, SonarQube, and Chef and used CI/CD within Jenkins on a Docker container environment, utilizing Docker for the runtime environment for the CI/CD system to build, test and deploy in AWS EKS clusters. Created custom dashboards on Grafana on top of Prometheus for the monitoring of the Kubernetes cluster. Developed and configured Jenkins pipeline for EKS cluster deployment for various environments DEV, QA, UAT, and PROD. Responsible for ensuring security and compliance of the EKS clusters, by enforcing security policies, performing vulnerability scans, and configuring Kubernetes RBAC. Created custom alerts and notifications in Elasticsearch and Kibana-based CPU or memory usage thresholds, error rates, and latency for AWS EKS clusters. Administered and Engineered Jenkins for managing weekly Build, Test, and Deploy chain as a CI/CD process, GIT with Development/Test/Prod Branching Model for weekly releases. Implemented automation of AWS infrastructure via Terraform. Used Kubernetes to orchestrate the deployment, scaling, and management of Docker Containers. Worked on the Cloud Watch to monitor the performance environment instances for operational and performance metrics during load testing. Worked with the AWS S3 services in creating the buckets and configuring them with logging, tagging, and versioning. Used Ansible Tower, which provides an easy-to-use dashboard and role-based access control which makes it easier to allow individual teams access to use Ansible for their deployments. Configured Cloud Watch metrics and alarms to monitor S3 bucket performance and usage. Created the trigger points and alarms in Cloud Watch based on thresholds and monitored logs via metric filters. Used Nexus Artifactory to store and maintain the artifacts in the binary repositories and push new artifacts by configuring the Jenkins project. Involved in designing, deploying, and managing AWS RDS database instances for various applications and environments. Worked with Cloud Formation Templates, terraform along with ansible to render templates. Troubleshooting and debugging performance issues with Elastic Beanstalk environments, including diagnosing issues related to application code, infrastructure, and networking. Extensively worked on GitHub for creating repositories, branching, merging, and resolving merge conflicts, and setting up GitHub webhooks to trigger Jenkins builds automatically. Wrote shell scripts to archive old logs. Configured and maintained GitHub organizations, and teams, to enable effective collaboration between multiple teams while ensuring security and compliance. Conducted knowledge-sharing sessions and provided training to team members on AWS best practices and CDK development. Client: 7-Eleven INC, Hackberry, Irving, USA. | Jun 2015 - Sep 2018 Role: DevOps Engineer/SRE Responsibilities: Collaborated with software development teams to design and implement highly available and fault-tolerant architectures for web applications. Implemented robust monitoring and alerting systems. Ensured to detect the performance issues and proactive resolution at an early stage. Developed custom modules in Terraform. Led incident response efforts and coordinated with cross-functional teams to restore services quickly and minimize customer impact. Worked on many different concepts of Chef like Roles, Environments, Data Bags, Knife, and Chef Server Admin/Organizations. Written Chef Recipes to automate the build/deployment process and data bags in Chef for better environmental management. Tested Chef Cookbook modifications on cloud instances in AWS and using Test Kitchen and Chef Spec Written Chef Cookbooks for various DB configurations to modularize and optimize product configuration, converting production support scripts to Chef Recipes and AWS server provisioning using Chef Recipes. Developed AWS-based applications using Typescript and AWS SDKs ensuring seamless integration with various AWS services. Implemented a Continuous Delivery pipeline with Docker, Jenkins, and GitHub. Responsible for installation & configuration of Jenkins to support various Java builds and Jenkins plugins to automate continuous builds and publishing Docker Images to the Nexus Repository. Extensively involved in setting up Kubernetes (k8s) clusters for running microservices and implemented a Kubernetes Container Orchestration solution within OpenStack allows for easy management. Managed local Deployments in Kubernetes using the Open Shift platform, creating local clusters, and deploying application containers and used Open Shift to manage microservices for the development of CI/CD. Used SonarQube for continuous inspection of code quality and to perform automatic reviews of code to detect bugs. Automated Nagios alerts and email notifications using Python script and executed them through Chef. Installed, configured, and maintained web servers like Apache Web Server and WebSphere Application Server on Red Hat Linux. Experience with Red Hat Linux kernel, memory upgrades, and swaps area. Red Hat Linux Kickstart. Installation Sun Solaris Jumpstart Installation. Configuring DNS, DHCP, NIS, and NFS in Sun Solaris 8/9 & other Network Services. Used multiple EC2 instances at the same time and provided highly durable and available data by using S3 data store, versioning, lifecycle policies, & create AMIs for mission-critical production server s backup. Provided highly durable and available data by using S3 data store, versioning, and lifecycle policies. Created terraform templates to create resources in the cloud and pipelines in Jenkins to trigger terraform. Performed the automation deployments using AWS by creating the IAMs and used the code pipeline plugin to integrate Jenkins with AWS and created the EC2 instances to provide the virtual servers. Worked with Knife command line tool for creating Recipes, Cookbooks and worked with Chef supermarket. Implemented Docker-Maven plugin and Maven POM to build Docker Images for all microservices and later used Docker file to build the Docker Images from the java jar files. Used Git for source code version control and integrated with Jenkins for CI/CD pipeline, code quality tracking, and user management with build tools Maven and Ant and written Maven pom.xml build script. Installed, Configured, and Managed Monitoring Tools such as Nagios for Resource Monitoring/ Network Monitoring. Client: Barclays, New York, USA| Nov 2013 - May 2015 Role: Build Release Engineer Responsibilities: Participated in daily scrum/standup meetings and provided updates to the scrum master about the task assigned to me. Managed Git repository, code merging, and production deployments. Coordinate/assist developers with establishing, and applying appropriate branching, labeling/naming conventions using Subversion, and Git source control. Maintained build-related scripts developed in shell for Maven builds. Created and modified build configuration files including POM.xml. Defined dependencies and plugins in Maven pom.xml for various activities and integrated Maven with GIT to manage and deploy project-related tags. Worked recently on tools migration from old tools like PVCS (version control system) to SVN, Tracker to Jira, and finally CI tool Hudson to Jenkins. Partially worked as a build tools administrator (SVN, Jira, and Jenkins). Worked on management tools like Ansible, Nagios, Git, GitHub, Bitbucket, and Jenkins for CI-CD. Set up full CI/CD pipelines so that each commits a developer makes will go through the standard process of the software lifecycle and is tested well enough before it can make it to production. Implemented Ansible playbooks to install configure and maintain web servers like Apache Web Server, and nginx. Wrote Ansible playbooks, the entry point of Ansible provisioning, where the automation is defined in tasks using YAML. Ran Ansible scripts to provision Dev servers. Handle Load balancing and monitoring with Nagios. Used Ansible inventories to define groups of hosts. Configured different Environments to specified states using Ansible Playbooks and Ansible Vault in Playbooks to protect sensitive data. Worked and helped to improve the Continuous Delivery framework to streamline releases and reliability. Deployed the Java applications into Apache Tomcat Application Servers. Writing Maven pom.xml and Ant build.xml for build scripts. Executed user administration and maintenance tasks including creating users and groups. Utilized WAR and JAR files for the deployment of enterprise apps. Managed Nexus Maven repositories to download the artifacts during the build. Created and maintained the Shell deployment scripts for Web Logic Web Applications servers. Monitoring Live Traffic, logs, Memory utilization, Disk utilization, and various other factors, which are important for deployment. Client: Qikwell Technologies India Pvt Ltd| Jan 2012 - Sep 2013 Role: Linux System Administrator Responsibilities: Maintained customized user environments using SUDO for system administrator access on test and production servers. Experienced in tasks like managing User Accounts and Groups, managing Disks and File systems. Installing and upgrading virtual machine hardware version using VMware tools. Provisioning virtual machines and Deploying virtual machines from Templates by cloning. Creating and deleting of VM's snapshots. Assigning LUNs to ESX/ ESXi servers and creating data stores. Adding virtual machine memory and CPU. Use VMware vSphere vCenter Update Manager to apply patches to virtual machines and also Virtual Private Networking (VPN) tools and techniques to maintain these VM's. Installing and configure Redhat Linux Servers with KickStart, LDAP, DNS, and DHCP. and applying security policies for hardening the server based on the company's policies. Working on volume/File system management using Logical Volume Manager (LVM), Veritas Volume Manager, and Solstice Disk Suite (Solaris Volume Manager). Implemented, configured and deployed new patches, upgrades, bug fixes on both physical and virtual Red Hat Linux servers using satellite server and YUM server. Experience in installing, configuring and maintaining application Server like WebSphere and Web Servers like Web Logic, Apache HTTP and Tomcat on UNIX and Linux. Have tested various Linux Environments in the Development/Test Stages to help create a better standardized environment and resolutions of any problems if any occurred. Created and wrote Shell scripting and PowerShell for automating tasks. Keywords: continuous integration continuous deployment quality analyst sthree database active directory information technology golang Minnesota Texas |