Anvesh Are - Data Engineer |
[email protected] |
Location: Herndon, Virginia, USA |
Relocation: Yes |
Visa: H1B |
Anvesh
GCP Data Engineer Cell: 219-209-2540 Email ID: [email protected] PROFESSIONAL SUMMARY Having 7 years of experience in Data Analysis, Data Engineering in Commercial, Retail and Health domains, Data wrangling, Data Scrubbing, Implementing ETL processes and Orchestrating, Data visualization to draw meaningful insights, Big Data Technologies, Data Warehousing, Cloud. Extensive experience in Analyzing, Developing, Managing and implementing various stand - alone, client-server enterprise applications using Python, Django and mapping the requirements to the systems. Well versed with Agile with SCRUM, Waterfall Model and Test-driven Development (TDD) methodologies. 3+ years of experience in Azure Cloud, Azure Data Factory, Azure Data Lake Storage, Azure Synapse Analytics, Azure Analytical services, Azure Cosmos NO SQL DB, and Data bricks. Experience in developing web applications by using Python, Django, C++, XML, CSS, HTML, JavaScript and jQuery. Implement Splunk solutions in highly available, redundant, distributed computing environments. Involved in the product releases and closely worked with Devops teams during prod deployment. Involved in Continuous Integration/ Continuous Deployment process(CI/CD). Experience in analyzing data using Python, R, SQL, Microsoft Excel, Hive, PySpark, Spark SQL for Data Mining, Data Cleansing, Data Munging and Machine Learning. Extensive experience in Data Mining solutions to various business problems and generating data visualizations using Tableau, PowerBI, Birst, Alteryx. Have Extensive Experience in IT data analytics projects, Hands on experience in migrating on premise ETLs to Google Cloud Platform (GCP) using cloud native tools such as BIG query, Cloud Data Proc, Google Cloud Storage, Composer. Hands on experience in GCP, Big Query, GCS bucket, G - cloud function, cloud dataflow, Data Proc, Stack driver. Designing and implementing Splunk - based best practice solutions. Experience working on Healthcare data, developing data pre-processing pipelines for data like DICOM and NONDICOM images of XRAYS, CT-SCANS etc. Excellent knowledge of Machine Learning, Mathematical Modeling and Operations Research. Comfortable with R, Python, SAS and Weka, MATLAB, Relational databases. Deep understanding & exposure of Big Data Eco-system. Good experience in developing web applications implementing Model View Control (MVC) architecture using Django, Flask, Pyramid and Python web application frameworks. Experience in working with number of public and private cloud platforms like Amazon Web Services (AWS), Microsoft Azure. Experience on Hadoop Distribution Platforms: Hortonworks, IBM Big Insights and Cloudera and Cloudera platforms GCP and AWS. Help manage the strategy of the Splunk Business Unit within the company. Experience in Migrating SQL database to Azure data Lake, Azure Synapse, Azure data lake Analytics, Azure SQL Database, Data Bricks, and Azure SQL Data warehouse and controlling and granting database access and migrating on-premise databases to Azure Data Lake store using Azure Data factory. Practical knowledge of Databricks' Unified Data Analytics, the Databricks Workspace User Interface, Databricks Notebook Management, Delta Lake with Python, and Delta Lake with Spark SQL. Expert knowledge in create, update, maintain, scheduling through calendars and troubleshoot of Control-M jobs and batch flows. Extensive experience in Amazon Web Services (Amazon EC2, Amazon S3, Amazon Simple DB, Amazon RDS, Amazon Elastic Load Balancing, Elastic Search, Amazon MQ, Amazon Lambdas, Amazon SQS, AWS Identity and access management, AWS Cloud Watch, Amazon EBS and Amazon Cloud Formation). Basic understanding of data modelling concepts, OLTP, OLAP - Star and Snowflake data models. Expertise in Software Testing Life Cycle (STLC) and QA Methodologies under multiple Operating Systems. Experienced in API testing, front end, Backend, black box testing, Experienced in using versioning tools/ repositories SVN, GIT. Expertise in software engineering methodologies like Waterfall/Agile and SCRUM Keywords: cplusplus continuous integration continuous deployment quality analyst message queue sthree database rlang information technology Connecticut Idaho |