Home

Data Architect (Databricks) || Manhattan, NY at Manhattan, Kansas, USA
Email: [email protected]
Hello, This is Priyanka Tyagi from Technocraft Solution.   

I have an urgent requirement with my client for Data Architect (Databricks).   

Please let me know if you are comfortable with this role and location or if you have any reference for this position

please let me know, Ill really appreciate it.   

Please check the below detailed job description.

Role: Data Architect (Databricks)

Location: Manhattan, NY

Job Description:

As a Senior Architect in the Data Engineering & Analytics team, you will design/develop data & analytics Databricks solutions that sit atop vast datasets gathered by retail stores, restaurants, banks, and other consumer-focused companies.
The challenge will be to create high-performance solution, cutting-edge analytical techniques including machine learning and artificial intelligence, and intuitive workflows that allow our users to derive insights from big data that in turn drive their businesses.
You will have the opportunity to create high-performance analytic solutions based on data sets measured in the billions of transactions and front-end visualizations to unleash the value of big data. You will have the opportunity to architect data-driven innovative
analytical solutions using Databricks and identify opportunities to support business and client needs in a quantitative manner and facilitate informed recommendations/decisions through activities like building ML models, automated data pipelines, designing
data architecture/schema, performing jobs in big data cluster by using different execution engines and program languages such as Python, Spark, Scala etc.

Responsibilities:

Hands-on developer who writes good quality, secure code that is modular, functional, and testable.
Drive the evolution of Data & Services products/platforms with an impact-focused on data science and engineering.
Design and implement scalable data architecture and data pipelines
Solving complex problems with multi-layered data sets, as well as optimizing existing machine learning libraries and frameworks.
Provide support for deployed data applications and analytical models by being a trusted advisor to Data Scientists and other data consumers by identifying data problems and guiding
issue resolution with partner Data Engineers and source data providers.
Ensure proper data governance policies are followed by implementing or validating Data Lineage, Quality checks, classification, etc.
Discover, ingest, and incorporate new sources of real-time, streaming, batch, and API-based data into our platform to enhance the insights we get from running tests and expand the
ways and properties on which we can test Experiment with new tools to streamline the development, testing, deployment, and running of our data pipelines.
Participate in the development of data and analytic infrastructure for product development

Continuously innovate and determine new approaches, tools, techniques & technologies to solve business problems and generate business insights & recommendations

Partner with roles across the organization including consultants, engineering, and sales to determine the highest priority problems to solve
Evaluate trade-offs between many possible analytics solutions to a problem, taking into account usability, technical feasibility, timelines, and differing stakeholder opinions to make
a decision

Break large solutions into smaller, releasable milestones to collect data and feedback from product managers, clients, and other stakeholders.

Evangelize releases to users, incorporating feedback, and tracking usage to inform future development

Work with small, cross-functional teams to define the vision, establish team culture and processes
Consistently focus on key drivers of organization value and prioritize operational activities accordingly

Escalate technical errors or bugs detected in project work
Maintain awareness of relevant technical and product trends through self-learning/study, training classes, and job shadowing.
Support the building of scaled machine learning production systems by designing pipelines and engineering infrastructure.

Mandatory Skills Description:

Excellent architectural experience of more than 3+ years with Databricks with cloud platform like AWS or Azure.
Excellent understanding of Databricks security, clusters, user management, deployment and performance tuning.
Working proficiency in using Python/Scala, Spark (tuning jobs), SQL, Hadoop platforms to build Big Data products & platforms.
Experience in working with CI/CD of Databricks Solutions.
Experience in working with SQL database like Postgres, MS SQL Server, Oracle, Snowflake etc
Preferably with hands-on experience with Hadoop big data tools (Hive, Impala, Spark)
Good troubleshooting and debugging skills.
Proficient in standard software development, such as version control, testing, and deployment
Demonstrated basic knowledge of statistical analytical techniques, coding, and data engineering
Ability to quickly learn and implement new technologies
Ability to Solve complex problems with multi-layered data sets
Ability to innovate and determine new approaches & technologies to solve business problems and generate business insights & recommendations.
Ability to multi-task and strong attention to detail
Flexibility to work as a member of a matrix based diverse and geographically distributed project teams
Good communication skills - both verbal and written - and strong relationship, collaboration skills, and organizational skills

Nice-to-Have Skills:

Experience with performance Tuning of Database Schemas, Databases, SQL, ETL Jobs, and related scripts
Experience participating in complex engineering projects in an Agile setting e.g. Scrum

Thanks and Regards,

Priyanka Tyagi

Associate Recruiter

Technocraft Solutions LLC

Email:
[email protected]

www.technocraftsol.com

Keywords: continuous integration continuous deployment machine learning information technology microsoft New York
[email protected]
View all
Thu Oct 12 20:29:00 UTC 2023

To remove this job post send "job_kill 744373" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 1

Location: , Oregon