Data Engineer (Distributed Systems) 100% Remote Only Druid Experience needed at Remote, Remote, USA |
Email: [email protected] |
From: Anu Bandi, w3global Inc [email protected] Reply to: [email protected] Hi Data Engineer (Distributed Systems) Remote TX Design and Development: Design and implement data ingestion pipelines to load data into Apache Druid. Develop and optimize Druid schemas and data models for efficient querying and performance. Design and implement data aggregations against event-level data in distributed database (preferably in Druid) Migration from existing data stores to Druid Optimization: Implement best practices for data ingestion, storage, and querying to improve system performance. Optimize query performance and resource utilization. Key Qualifications 9+ years of experience in data engineering or analytics in distributed data systems, i.e., Druid, Snowflake, Redshift, CockroachDb, or Pinot (Druid experience preferred). Strong understanding of distributed data store architecture, data ingestion methods, scaling mechanisms and querying capabilities. Proficiency in SQL and experience with data modeling and ETL processes. Hands-on application coding experience in Java, Python or others. A fast learner of new technologies. Experience with performance tuning and optimization of data systems. Excellent problem-solving skills and the ability to work independently as well as in a team. Familiarity with other big data technologies (e.g., Hadoop, Spark, Kafka) is a plus. Highly Beneficial: Familiarity with cloud platforms (e.g., AWS, GCP, Azure) and containerization (e.g., Docker, Kubernetes). Knowledge of programming languages such as Scala. Experience generating reports against distributed data systems with SQL and visualization tools. Keywords: Texas Data Engineer (Distributed Systems) 100% Remote Only Druid Experience needed [email protected] |
[email protected] View all |
Thu Aug 22 01:01:00 UTC 2024 |