Lead Big Data Developer at Remote, Remote, USA |
Email: [email protected] |
From: saloni chaurasia, tekinspirations [email protected] Reply to: [email protected] Hi, I Hope you are doing great. Please find below position if you have any matching candidate as per requirement. Please send me updated resume with candidate information. Lead Big Data Developer Location: Remote in DMV and NY Metro Duration: 6+ months Need only EST Zone Candidates Must have: Need candidates who have worked at Microsoft or Amazon in Past 10+ years of candidate profile Overview : We are searching for a Lead Big Data Developer to support our client, a large financial regulator, to be a part of a multi-year initiative to build a new system that assists with the collection and processing of up to 100 million records to support financial regulations. The Lead Big Data Developer will be responsible for designing, ingesting, storing, validating, and disseminating data in a consumable format for intelligence teams to gain insights. Requirements : BS degree in computer science or related field 7+ years of experience in programming language Java or Scala 7+ years of experience in ETL projects 5+ years of experience in Big Data projects 3+ years of experience with API development (REST API's) Strong experience in Java or Scala Strong experience in big data technologies like AWS EMR, AWS EKS, Apache Spark Strong experience with serverless technologies like AWS Dynamo DB, AWS Lambda Strong experience in processing with JSON and csv files Must be able to write complex SQL queries Experience in performance tuning and optimization Familiar with columnar storage formats (ORC, Parquet) and various compression techniques Experience in writing Unix shell scripts Unit testing using JUnit or ScalaTest Git/Maven/Gradle Code Reviews Experience with CI/CD pipelines BPM/ AWS Step Functions Python scripting Performance testing tools like Gatling or JMeter Responsibilities : Understand complex business requirements Design and develop ETL pipeline for collecting, validating and transforming data according to the specification Develop automated unit tests, functional tests and performance tests. Maintain optimal data pipeline architecture Design ETL jobs for optimal execution in AWS cloud environment Reduce processing time and cost of ETL workloads Lead peer reviews and design/code review meetings Provide support for production support operations team Implement data quality checks. Identify areas where machine learning can be used to identify data anomalies Regards, Saloni Chaurasia { Technical Recruiter } TEK Inspirations LLC Pvt. Ltd. | 13573 Tabasco Cat Trail, Frisco, TX 75035, United States E-Mail: [email protected] Keywords: continuous integration continuous deployment database New York Texas |
[email protected] View all |
Thu Mar 14 20:12:00 UTC 2024 |