Home

Python Data Engineer Raleigh, North Carolina-Hybrid at North, Virginia, USA
Email: [email protected]
From:

Gulshan,

Stellent IT

[email protected]

Reply to:   [email protected]

Python Data Engineer

Raleigh, North Carolina-Hybrid

Phone+skype

24+Month

JobDescription

Must Have:

5+ years experience as a data engineer or python engineer

Strong python libraries- pandas (specifically), lambda

Experience in SQL and Spark SQL and EMR

A deep understanding of performance tuning

AWS Cloud experience

Plus:

Experience in the finance industry

Experience working with vendors like: JMP, Morning star, JPMC, or big fin vendors

Day to Day:

UAP is an initiative to create a common/unified acquisition platform to provide acquisition as a service through low code and config driven architecture for fidelity needs.

Business Capabilities:

1) Self service capability to search and subscribe/request for existing dataset

2) Self service capability to request new vendor dataset

3) Integration and automation with MDD to accelerate new feed registration

4) Accelerators to acquire and onboard new feeds

5) Accelerators and capabilities to switch vendor

6) Drop zone to manage data distribution based on licensing/subscription

7) Registry and inventory to manage list of vendor feeds and their consumption patterns

8) Finops model for consumption

9) Finops model to derive ROI

10) Multi-tenant capability for applications to co-exists on the same infrastructure with right level of abstraction based on authorization.

11) Capability to provide reports comparing vendors.

a) Data overlap, difference

b) Coverage

c) SLA and past performance in terms of meeting SLA

Technology Enablers:

Integration and self service with MDD, Vendor management/Procurement and Governance

Configuration driven feed acquisition

Configuration driven transformation of vendor data into canonical format

Capability to compare data across vendors and generate gap reports

Registry to maintain feed metadata, contact, SLA, Owners

Lineage to track usage (run time information on who consumes what data on a daily basis)

Configuration driven distribution

Self service capabilities for onboarding new feeds

Ingestion adaptors for known file types

Enable data for analytics and exploration use case

Reports (Quality, Coverage, Data Gaps, Data Catalogs, Feed metrics...)

Keywords: cprogramm information technology
[email protected]
View all
Thu Jan 05 20:51:00 UTC 2023

To remove this job post send "job_kill 254752" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,