Home

Afreen najam - Data egineer
[email protected]
Location: Kansas City, Kansas, USA
Relocation: no
Visa: H1B
Dear Recruiters,

Hope you are doing great. We have the following experienced quality consultants for Sr. Big Data Engineer available for any c2c positions, and please let me know if you have any open C2C positions.

Please send us C2C positions at [email protected] or reach me at 8325484963

Our consultant- having genuine experience.


Lavanya Data Engineer
BACKGROUND SUMMARY:
Big Data Engineer/ Data Engineer with over 7+ years of overall experience as software developer in
design, development, deploying and large scale supporting large scale distributed systems.
In depth understanding/knowledge of Hadoop Architecture and various components such as HDFS, MR,
Hadoop GEN2 Federation, High Availability and YARN architecture and good understanding of
workload management, scalability and distributed platform architectures.
Assisted Deployment team in setting up Hadoop cluster and services.
Having good knowledge in Benchmarking & Performance Tuning of cluster.
Designed and implemented a product search service using Apache Solr.
Good experience in Oozie Framework and Automating daily import jobs.
Implemented various algorithms for analytics using Cassandra with PySpark and Scala.
Extensive implemented ETL design and development using Python, PySpark on Foundry
(Palantir)
Create data transformations in PySpark and Python code in the Palantir Foundry platform that is
implemented into the data pipeline or used to create data analytics reports
Have experience in using Contour and Data Preparation in Palantir Foundry platform
Experienced of building Data Warehouse in Azure platform using Azure data bricks and data factory.
Experienced in managing Hadoop clusters and services using Cloudera Manager.
Experience in working with the Hive data warehouse tool-creating tables, data distribution by
implementing partitioning and bucketing, writing and optimizing the HiveQL queries
Expertise in using various Hadoop infrastructures such as Map Reduce, Pig, Hive, Zookeeper, Hbase,
Sqoop, Oozie, Flume, Drill and PySpark for data storage and analysis.
Experience in developing a data pipeline through Kafka-PySpark API.
Proficient in data processing like collecting, aggregating, moving from various sources using Apache
Flume and Kafka.
Experience in developing custom UDFs for Pig and Hive to incorporate methods and functionality of
Python/Java into Pig Latin and HQL (HiveQL) and Used UDFs from Piggybank UDF Repository.
Experienced in running query - using Impala and used BI tools to run ad-hoc queries directly on Hadoop.
Collected logs data from various sources and integrated in to HDFS using Flume.
Experience using Job scheduling tools like Cron, Tivoli and Automic.
Experienced in troubleshooting errors in Hbase Shell/API, Pig, Hive and MapReduce.
Highly experienced in importing and exporting data between HDFS and Relational Database
Management systems using Sqoop.
Extensive experience as Hadoop and PySpark engineer and Big Data analyst.
Excellent understanding of Hadoop architecture and underlying framework including storage
management.
Hands-on experience with Amazon EC2, Amazon S3, Amazon RDS, VPC, IAM, Amazon Elastic Load
Balancing, Auto Scaling, Cloud Front, CloudWatch, SNS, SES, SQS and other services of the AWS
family.
Selecting appropriate AWS services to design and deploy an application based on given requirements.
Have experience in installing, configuring and administering Hadoop clusters for major Hadoop
distributions like CDH4, and CDH5.
Good understanding of NoSQL Databases and hands-on work experience in writing applications on No
SQL databases like Cassandra and MongoDB.
Experienced in Creating Viz boards for data visualization in Plat fora for real - time dashboard on Hadoop.
Good knowledge in querying data from Cassandra for searching, grouping and sorting.
Good Knowledge in Amazon AWS concepts like EMR and EC2 web services which provides fast and
efficient processing of Big Data.
Keywords: business intelligence sthree active directory

To remove this resume please click here or send an email from [email protected] to [email protected] with subject as "delete" (without inverted commas)
[email protected];1055
Enter the captcha code and we will send and email at [email protected]
with a link to edit / delete this resume
Captcha Image: