10+ ONLY - Java Developer + Apache PySpark Engineer @ REMOTE at Apache, Oklahoma, USA |
Email: [email protected] |
Hello, Hope you are doing well, We have the below requirement open. Please send me your candidates updated Resume to [email protected] Role: Java Developer + Apache PySpark Engineer Location: REMOTE Job Roles / Responsibilities: NOTE : Need Passport number and LinkedIn ID and please do mention Current Location and Visa of candidate while sending the Profile We are looking for a skilled Java + Apache PySpark Engineer to join our team. The ideal candidate will have strong expertise in Java programming and hands-on experience with Apache PySpark for developing scalable and-high-performance data processing applications. As a Java + Apache PySpark Engineer, you will be responsible fr designing, implementing, and optimizing data pipelines, ETL processes, and analytical solutions to support our business needs effectively. **Responsibilities:** Design, develop, and maintain data processing applications using Java and Apache PySpark. Collaborate with data scientists, analysts, and stakeholders to understand requirements and translate them into technical solutions. Develop and optimize ETL pipelines for ingesting, transforming, and loading large volumes of data from diverse sources. Implement data cleansing, aggregation, and enrichment techniques to ensure data quality and integrity. Perform performance tuning and optimization of PySpark jobs for maximum efficiency and scalability. Integrate data processing applications with external systems, databases, and APis as needed Develop and maintain unit tests, integration tests, and documentation for the codebase. Stay updated with the latest technologies, frameworks, and best practices in big data processing and analytics. Participate in code reviews, design discussions, and knowledge sharing sessions within the team. **Requirements:** Bachelor's degree in Computer Science, Information Technology, or related field. Proven experience as a Java developer with strong proficiency in core Java concepts and frameworks. Hands-on experience with Apache PySpark for distributed data processing and analytics. Familiarity with big data technologies such as Apache Hadoop, Apache Kafka, and Apache Hive. Experience with cloud platforms like AWS, Azure, or GCP is a plus. Strong understanding of data structures, algorithms, and database concepts. Excellent problem-solving, analytical, and communication skills. Ability to work effectively in a collaborative team environment. Experience with version control systems like Git. Knowledge of Python programming is a plus. Experience with containerization and orchestration tools like Docker and Kubernetes is a plus. Thanks & Regards Mohd Azhar Uddin Email: [email protected] LinkedIn: https://www.linkedin.com/in/azhar-uddin-13b7b7173/ -- Keywords: information technology Idaho 10+ ONLY - Java Developer + Apache PySpark Engineer @ REMOTE [email protected] |
[email protected] View all |
Thu May 09 00:38:00 UTC 2024 |