bharath chandra,yaswanth,keerthana - DevOps Engineer,Cloud enignner,SRE,Azure devops ,AWS devops |
[email protected] |
Location: Remote, Remote, USA |
Relocation: yes |
Visa: GC |
Certified Cloud DevOps Engineer with 10 years of IT professional experience in all the areas of Software development life cycle which are Included but not limited to automation, Implementation of CI/CD, System Administration, Cloud computing, Software Configuration Management (SCM), Containerization, orchestration, Scripting for various environments.
Proficient in monitoring and troubleshooting OpenShift environments, utilizing built-in monitoring tools and logging mechanisms to identify and resolve performance issues and ensure optimal application performance. Experience in working with a wide range of Amazon Web Services (AWS) cloud services, including Autoscaling, AWS Storage, EC2, ELB, EBS, VPC, CloudFormation, Security Groups, ACL, Route 53, RDS, S3, IAM, and CloudWatch. Hands-on Experience Administering Microsoft Azure Services like Azure App Services, Azure SQL Database, Azure Active Directory, Azure DataBricks, Azure Blob storage, Azure Functions, Virtual machines, Azure Fabric controller, Azure Data Factory, Azure web applications, Azure Service Bus, and Notification hub. Experience in designing Azure Resource Manager (ARM) Template to deploy multiple resources as well as in designing custom build steps using PowerShell and Developed PowerShell scripts and ARM templates to automate the provisioning and deployment process. Led the design and implementation of Ansible Tower to automate infrastructure provisioning, configuration management, and application deployments. Designed and implemented scalable and secure Kafka clusters for real-time data processing and event-driven architectures. Utilized SageMaker's automated model tuning and hyperparameter optimization features to improve model performance. Used SageMaker's model explainability and interpretability features to gain insights into model behavior and improve transparency. Experienced in practical application of the GitOps methodology with Argo CD and integrated Git as the single source of truth for application configurations and utilized Argo CD to reconcile the desired state of applications with the state in the Git repository. Created infrastructure as code using Terraform which includes defining the infrastructure components, modules, and dependencies needed to create a complete environment. Managed Kubernetes using Helm charts. Created reproducible builds of the Kubernetes applications, templatize Kubernetes manifests, and provided a set of configuration parameters to customize the deployment and managed releases of Helm packages. Strong understanding of TCP/IP and its role in facilitating communication on the internet. Experience with DNS and its role in resolving domain names to IP addresses. Expertise lies in Microsoft Azure Integration, including Azure API Management, Azure LogicApp, Web API, Azure Functions, Service Bus, Event Grid, Event Hub, and Azure Kubernetes Service on Azure. Knowledge of the features and benefits of each VM SKU series, including the Compute, memory, and storage capabilities ,Networking and storage performance and also Cost and pricing considerations. Containerized Java, .NET and Python Applications by writing Dockerfiles and setting up containers in JFrog Artifactory and running containers in AKS and EKS. Well versed in using Azure Infrastructure as a Service (IaaS), Provisioning VMs, Virtual Networks, Deploying Web Apps, Microsoft SQL Server, using ARM Templates, and Azure DevOps CI/CD pipelines. Expertise in configuring the Monitoring and Alerting tools according to the requirements like Prometheus and Grafana, setting up alerts, and deploying multiple dashboards for individual applications in Kubernetes. Developed and deployed Kafka producers and consumers using Python and Apache Spark, ensuring data integrity and encryption. Hands-on experience in developing complex models in Deep Learning for Computer Vision using TensorFlow on cloud infrastructure. Proficient in creating, developing, and testing environments for different applications by provisioning Kubernetes clusters on AWS using Docker, Ansible, and Terraform. Strong proficiency in Azure AD and Azure AD Connect and extensive experience with Exchange administration and management. Collaborated with development teams to integrate OpenStack with Continuous Integration/Continuous Deployment (CI/CD) pipelines, leveraging heat for infrastructure as code. Worked with OpenShift On-Prem Private Cloud Clusters deploying containers and setting up using OC CLI. Experience with developing .NET applications using Azure-based services such as Azure Functions, Azure App Service, and Azure Kubernetes Service (AKS). Implemented Redis clustering and sharding to achieve horizontal scaling and high availability. Developed Redis-based solutions for caching, leaderboards, and real-time data processing. Expertise in various aspects of Jenkins, including Plugin Management, Securing Jenkins, Performance Optimization, Analytics, Scaling Jenkins, and integrating Code Analysis and Test Phases within Jenkins for complete CI/CD pipelines. Hands-on experience in developing and maintaining CI/CD pipelines using Jenkins and AWS cloud-native services such as CodeCommit, CodeBuild, CodePipeline, and CodeDeploy. Proficient in setting up Kubernetes (k8s) clusters for running microservices and automating Kubernetes cluster management using Ansible playbooks. Developed and maintained large-scale GoLang projects, leveraging its concurrency features to handle high volumes of concurrent requests. Proficient in version control tools like Git, Subversion (SVN), and TFS, and experienced in branching, tagging, and maintaining versions across different platforms and SCM client tools like GitLab, GitHub, and Bitbucket. Skilled in scripting using Python, Ruby, and Shell Scripts for configuration management and maintaining enterprise applications. Well-versed in using JIRA, Bugzilla and ServiceNow for ticketing and defect tracking, configuring workflows, customizations, and integrating Jenkins with JIRA, GitHub, and Maven. Worked on Production Support Troubleshooting Infra and Application Issues and monitor Application Performance Metrics using ELK Stack, Dynatrace. Certified in Microsoft Azure Administrator Associate. Certified in Kubernetes Administrator. Certified in AWS Developer. Master s in Computer Technology, Eastern Illinois University , 2013 Infrastructure as a service Amazon Web Services, Azure Version Control Tools and SCM GIT, SVN, GitHub, Azure Repos, Bitbucket, Gitlab Infrastructure as Code Terraform, AWS Cloud Formation, Arm Templates Containerization Docker, Kubernetes, Helm, Open-Shift, Docker Swarm Build Tools Gradle, Ant, Maven, MSBuild, SBT Continuous Integration Tools Jenkins/Hudson, GitHub Actions, Azure DevOps Pipelines, BuildBot, Bamboo, Sonar Web technologies HTML, JAVA Script, jQuery, React, GraphQL Configuration Management Tools Ansible, Puppet Tracking Tools Jira, Bugzilla, ServiceNow, Atlassian Databases Amazon RDS, PostgreSQL, DynamoDB, SQL Server, CosmosDB, My SQL DB, Oracle, Cassandra Monitoring Tools Nagios, Splunk, Grafana, Prometheus, Cloud Watch, Cloud Trail, Azure monitor, Kibana, Datadog, New relic, ELK Stack, Dynatrace Programming and Scripting Languages Groovy, Ruby, SHELL, POWERSHELL, Python, YAML, JSON, .NET SDLC Methodologies Agile/Scrum, Kanban, Waterfall Web/ Application Servers Apache Tomcat, JBoss 4.x/5.x, WebSphere 8.5, WebLogic 10 Artifactories Nexus, Artifactory, Docker Hub Virtualization Docker, Kubernetes, VMware Operating Systems Linux (Red Hat, CentOS, Ubuntu), Windows Code Scan/Testing and ticketing tools SonarQube, Trivy, HashiCorp Vault, Veracode, Twist lock, WhiteSource, Black Duck, Selenium, JIRA, Service Now, Remedy, Connect Wise. Logging CloudWatch, CloudTrail, Azure App Insights, Azure Monitor. Client: AXA XL, Stamford, CT. SRE / Sr. DevOps Engineer Jun 2022 Current Participated in the migration of on-premises cloud storage to Microsoft Azure using Azure Site Recovery and Azure Backup, as well as deploying Azure IaaS virtual machines (VMs) and PaaS role instances into secure VNETs and subnets, and configured Azure Storage Monitor, snapshot, Load balancer etc. Administered Microsoft Azure as part of a DevOps team, focusing on automation of internal projects and configuration management. This involved configuring virtual machines, storage accounts, and resource groups. Setup Azure Active Directory Roles for Authentication and assigned roles for all resources in Azure and Authentication for Applications using Service Principal and Managed Identity. Experience with creating and managing ARM template repositories using Git and version control systems. Knowledge of ARM template validation and testing tools, such as Azure Resource Manager Validate and ARM Template Test. Proficient in writing vanilla JavaScript, without relying on frameworks or libraries. Strong understanding of JavaScript fundamentals, including prototypes, closures, and the DOM Ability to integrate ARM templates with Azure DevOps and Azure Pipelines for automated deployment and continuous integration/continuous deployment (CI/CD). Spearheaded the implementation of UCD methodologies, leveraging tools like IBM UrbanCode Deploy, to automate configuration deployments across diverse environments, driving agility and consistency in software delivery. Engineered reusable deployment templates and scripts, optimizing deployment workflows and minimizing errors, resulting in accelerated deployment cycles and enhanced reliability. Familiarity with ARM template security best practices and secure coding guidelines. Knowledge of ServiceNow's workflow and automation capabilities, including Scripted REST APIs and Orchestration. Ability to create and customize ServiceNow reports and dashboards using Advanced Reporting and Analytics. Implemented security best practices for AI/ML platforms, ensuring data privacy and compliance with industry standards. Deployed and managed A-series VMs for dev/test environments, optimizing resource utilization and reducing costs and also designed and implemented D-series VMs for mission-critical applications. Implemented HTTPS Ingress controller and use TLS certificate on AKS to provide reverse proxy, configurable traffic routing for individual Kubernetes services. Responsible to setup Self Hosted Build Agents for Azure DevOps Pipelines in Azure VNET, do upgradation. Installed and configured Prometheus and Grafana for AKS cluster monitoring and configured metric alerts. Knowledge of DevOps practices and tools like Azure Pipelines and Azure App Service to streamline performance testing and deployment. Utilized New Relic to monitor and analyze application performance, identifying bottlenecks and optimizing code for improved user experience. Used New Relic's alerting and notification capabilities to quickly respond to performance issues and outages. Implemented ServiceNow IT Service Management (ITSM) module in the Azure cloud environment to enhance incident, problem, and change management practices. Experience in working with End-to-End Test cases of Terraform using Terratest and Terrascan for detecting vulnerabilities. Proficient in using Azure Performance Analysis and Azure Monitor to troubleshoot and optimize performance issues. Migrated Containers running on OpenShift (3.9) On-Prem Clusters to AKS Public Cloud and have implemented CI/CD pipelines using OpenShift's integrated Jenkins, Git, and automation tools, streamlining the software delivery process and enabling rapid and frequent deployments. Knowledge of Azure-specific .NET frameworks such as Azure .NET SDK and Azure .NET Core. Familiarity with deploying and managing .NET applications in Azure using tools like Azure DevOps, Azure Pipelines, and Azure App Service Environment Used ELK for monitoring purposes for LOG analytics and Application monitoring and integrated this with Azure for security purpose we use search guard to protect the logs. Developed and implemented comprehensive monitoring and alerting systems for AI/ML platforms, ensuring proactive detection and resolution of issues. Leveraged Python's extensive libraries, including Pandas and NumPy, for data manipulation and analysis tasks. Experience with building reusable, composable React components. Strong understanding of React fundamentals, including state management, props, and lifecycle methods. Utilized Azure Storage and Azure Cosmos DB for data storage and management in .NET applications. Worked with RedHat OpenShift Container Platform for Docker and Kubernetes, used Kubernetes to manage containerized applications using its nodes, ConfigMaps, node-selector, Services and deployed application containers as Pods. Optimized OpenStack performance and troubleshot issues using monitoring tools like Prometheus and Grafana. Designed and configured custom DataDog dashboards and alerts tailored to project requirements, enabling proactive identification of critical issues, and ensuring prompt incident response. Utilized Nagios for monitoring log files, network monitoring, and hard drive status. Implemented Nagios event handlers for automatic restart of failed applications and services. Client: Atlanticus, Atlanta, GA. SRE / Azure Cloud DevOps Engineer Jan 2020 May 2022 Created templates and jobs in Ansible Tower for deploying Java applications to servers in different environments (Dev, QA, Stage, and Production) and automated them for periodic releases. Implemented Ansible scripts for deploying applications to remote servers, ensuring zero-downtime releases across multiple backend systems. Led the successful migration of on-premises databases to PostgreSQL, utilizing custom migration scripts and automation tools. This migration project resulted in improved database performance, reduced operational costs, and enhanced scalability for the organization. Developed and maintained automated CI/CD pipeline for code deployment using Jenkins on AWS Lambda. Created CI/CD Pipelines using Jenkins and GitHub Actions for Application deployments and Infrastructure Deployments. Experience with using Datadog for monitoring and troubleshooting applications and infrastructure in AWS. Understanding of Datadog alerting and notification capabilities, including integrations with AWS services like SNS and SQS. Used SageMaker's model explainability and interpretability features to gain insights into model behavior and improve transparency. Integrated SageMaker with other AWS services, such as AWS Recognition and AWS Comprehend, to build comprehensive machine learning solutions. Ensured data privacy and compliance with industry standards for AI/ML platforms, mitigating potential security risks and maintaining customer trust. Integrated Ansible Tower with GitHub and Vault for source code management and secret management. Integrated AWS Secrets Manager with Helm charts to manage sensitive data securely, such as API keys and database credentials. Automated processes with PowerShell scripts. DNS changes, DC builds User management. Utilized Kubernetes for container orchestration, deploying Docker containerized applications on a cluster of EC2 instances in a Linux environment. Migrated Containers running at On-Prem OpenShift cluster to EKS Cluster by writing HELM Charts and Integrated with Jenkins and GitHub actions. Collaborated with development and security teams to integrate Kafka with existing security tools and processes, enhancing overall data security posture Developed Ansible playbooks for provisioning virtual and physical instances, configuration management, patching, and software deployment on AWS environments and integrated Ansible Playbook using Terraform Provisioners and configured Terraform Statefile and Locking using DynamoDB. Configured Ansible to manage AWS environments and automate the build process for core Amazon Machine Images (AMIs) used by all application deployments, including Auto Scaling and CloudFormation scripts. Optimized Kafka performance and throughput, achieving high availability and low latency for real-time data pipelines Developed and maintained complex CloudFormation templates to deploy and manage large-scale applications and infrastructure. Implemented best practices for CloudFormation template development, including modularity, reuse, and testing. Utilized AWS CloudFormation templates for infrastructure-as-code (IaC) to create the necessary AWS resources for EKS and the Helm deployment. Worked with AWS CloudFormation templates to automate the provisioning and management of the PostgreSQL database infrastructure, facilitating infrastructure as code practices. Implemented DNS, Route 53, and AWS Elastic Load Balancer (ELB), and assigned AWS SSL Certificates to servers. Setup and build AWS infrastructure using various resources, VPC EC2, RDB, S3, IAM roles, Elastic Load balancer, EBS, Security Group, Auto Scaling, SES, SQS, SNS and RDS in Cloud Formation templates, Route53, Lambda. Worked with Terraform to manage infrastructure through terminal sessions, execute scripts, and create alarms and notifications for EC2 instances using CloudWatch. Configured Nagios plugins for EC2, RDS, and ELB and utilized Nagios for comprehensive monitoring of AWS resources, including auto-scaling. Worked with AWS Key Management Service (KMS) to enable encryption-at-rest for the PostgreSQL database, safeguarding sensitive data from unauthorized access. Designed and deployed ELK stack for centralized log management and real-time monitoring. Integrated ELK with AWS Lambda for serverless analytics. Leveraged New Relic's dashboards and visualizations to provide real-time visibility into application health and performance metrics. Configured Ansible to integrate with F5 BIG-IP load balancers for rolling deployments. Utilized Git as the version control system, creating repositories, merging branches, integrating with Jenkins for poll SCM, and integrating with Jira for issue tracking and other tasks like tagging.x Implemented Micro Services framework with Spring Boot, Node.js and OpenShift containerization platform (OCP). Implemented Microservices on OpenShift based on Docker to achieve Continuous Delivery. Client: Pragmatics Inc., Arlington, VA Role: SRE/DevOps Engineer Aug 2017 Dec 2019 Worked on different Azure Service like App Services, Key vault, function app, Blob storage, Azure Active Directory (Azure AD), Service Bus, Azure Container Registry (ACR), Azure Kubernetes service (AKS), Azure SQLDB. Responsible for creating, deploying, and managing Azure App Services, including Web Apps, API Apps. Implemented best practices for security, compliance, and governance in GCP, utilizing IAM, VPCs, Firewall rules, and encryption mechanisms to protect sensitive data and ensure regulatory compliance. Setting up CI/CD pipeline for Build and deployment process for Azure app services with Slots. Terraform Modules for entire 3-Tier Architecture resource deployment into Azure. Utilized Ansible Tower to orchestrate the deployment of multi-tier applications across development, testing, and production environments. Developed playbooks and roles to ensure consistent and reliable application deployments with minimal downtime. Proficient in writing PowerShell scripts to provision and configure Azure infrastructure, including ARM templates and Azure CLI Ability to create and manage ServiceNow workflows and business processes using Flow Designer. Experience with using ServiceNow's machine learning and artificial intelligence capabilities, such as Predictive Analytics and Intelligent Automation. Monitoring and troubleshooting Azure App Services using Azure Monitor / Application Insights tool. Created Terraform templates for provisioning Virtual Networks, Subnets, VM Scale sets, load balancers, App services, AKS cluster. Experience with integrating PowerShell with other Azure tools and services, such as Azure DevOps and Azure App Service Ability to create reusable and modular ARM templates using nested templates and linked templates. Familiarity with ARM template best practices and common pitfalls. Experience with parameterizing ARM templates for flexible deployment and reuse. Skill in using ARM templates to deploy and manage Azure resources such as virtual machines, storage accounts, and networks. Ability to debug and troubleshoot ARM template deployment issues. Continuously monitored and optimized GCP infrastructure, leveraging monitoring and logging tools like Stackdriver or Cloud Monitoring, and making recommendations for cost optimization, performance enhancement, and reliability. Deployed .NET, Python and Java Applications to AKS Cluster and deployed stateful applications by setting up persistent volumes and deployed Redis Cache Implemented Azure-based monitoring and logging for .NET applications using Azure Monitor and Azure Log Analytics. Worked with Shared Templates in Azure DevOps Pipelines and configured multiple CI/CD Pipelines from scratch using YAML. Proficient in writing .NET code that leverages Azure services such as Azure Storage, Azure Cosmos DB, and Azure Active Directory (AAD) Ability to optimize .NET application performance and troubleshoot issues in an Azure environment Azure VM SKU pricing and cost optimization strategies, including reserved instances and spot instances. Familiarity with Azure VM SKU limitations and constraints, such as maximum CPU and memory limits.. Containerized applications by writing Dockerfiles and write CI/CD Pipeline and HELM Charts for Deployments. Responsible for Managing and Provisioning Kafka Cluster, Namespaces and Topics into Azure VM s Creating and maintaining containerized micro services and configuring/maintaining Azure container registry (ACR). Utilized NC-series VMs for high-performance computing workloads. Worked Azure Kubernetes Service (AKS) to deploy a managed Kubernetes cluster in Azure and created an AKS cluster using terraform. Utilized Python to integrate with monitoring systems and APIs, enabling real-time monitoring and alerting functionalities and developed custom monitoring scripts and used Python libraries like Prometheus and Grafana to visualize and analyze system metrics. Experience with using Azure VM SKU features like autoscale, load balancing, and application gateways to optimize application performance. Written Ansible playbooks to deploy services in remote servers by Integrating with Jenkins Worked with containers, Deployment, Services, Replication Controller, Replica sets, Auto-scaling, rolling update and blue/green deployment. 24/7 on call production support. Implemented ELK-based monitoring and analytics for Azure Kubernetes Service (AKS) and Azure App Service Worked on Production Support Troubleshooting Infra and Application Issues and monitor Application Performance Metrics using ELK Stack. BMO Harris Bank, Chicago, IL. Role: Build & Release Engineer. Sept 2015 - July 2017 Responsible for Build and Deployment of the java applications on different environments like Dev/QA & Production. Involved in Installing Jenkins on a Linux machine and created a master and slave configuration to implement multiple parallel builds through a build farm. Used Jenkins, Build forge for Continuous Integration and deployment into Tomcat Application Server. Installing, configuring, and administering Jenkins CI tool on Linux machines. Application installation and troubleshooting for Solaris, Red Hat Linux, and CentOS. Designing, Administration and troubleshooting database servers like MySQL 5.7, MySQL 5.6, MySQL 5.5 Installed and Configured the Nexus repository manager for sharing the artifacts within the company. Worked on upgrading the Splunk Indexers, Search heads and forwarders. Experience to Integrate Release with Jira for Requirements Management, Bamboo/Deploy for Build & Deployment and Service Now for Change Management process via plugins. Wrote puppet modules for the Tomcat/Apache/Splunk forwarder services in our distributed infrastructure. Administered Linux Servers Linux (Red hat RHEL 3/4/5) for several functions including managing Apache/Tomcat server, mail server, MySQL database and firewalls in both development and production. Created monitors, alarms, and notifications for EC2 hosts using CloudWatch. Worked on Cloud automation using AWS Cloud Formation templates. Used OpenStack and AWS cloud services to launch Linux and windows machines, created security groups and written basic PowerShell scripts to take backups and mount network shared drives. Responsible for Installation, Configuration Management, Maintenance and Systems Development of Red Hat Linux, Oracle Enterprise Linux / UNIX Systems. Designed Puppet Modules to manage configurations and automate installation process. Implementing a Continuous Delivery framework using Jenkins, Puppet, Maven & Nexus in Linux environment. Assisted the development team in integrating DynamoDB with the application code by utilizing serverless architecture with AWS Lambda and API Gateway. Used JIRA custom workflows to track issues and JBoss application server to deploy the application. Maintained all the Red hat servers Linux entitled with the latest subscriptions and also created Instances on OpenStack. Performing server s health checks on daily basis, by logging into Unix system and monitoring logs, and processes. Client: FISERV, Atlanta, GA. Role: Linux System Administrator July 2013 Aug 2015 Installed, Configurator and Managed RedHat Enterprise Linux 5.x. Involved in Installing, configuring, and maintaining application Servers like WebSphere and Web servers like Web Logic, Apache, HTTP and Tomcat on UNIX and Linux. Managed users, groups, and permissions configured LDAP and given SUDO Access in test and development servers and allowed passwordless logins for authorized users on servers. Installed and upgraded packages on Red Hat Linux servers through YUM and RPM tools. Responsible for Oracle database security and patching, Oracle database capacity analysis and planning. Installing Red Hat Linux using kick-start and applying security policies for hardening the server based on the company policies. Hands-on experience in server builds, migrations such as OS migrations, Application migrations, and Storage migrations. Involved in developing dynamic web-based user interfaces using HTML and JavaScript. Installing and configuring SSH (Secure Shell) encryption to access securely on Ubuntu and Red hat Linux. Keywords: continuous integration continuous deployment quality analyst artificial intelligence machine learning javascript sthree database active directory information technology ffive Connecticut Georgia Illinois North Carolina Virginia |