Home

GCP Vertex Engineer ------ Richardson, TX (Onsite) at Richardson, Texas, USA
Email: [email protected]
From:

Naveen,

Softcom Systems Inc

[email protected]

Reply to:   [email protected]

Details :

Client : Infosys

Role: GCP Data Engineer

Location : Onsite, Richardson, TX .

Type : Contract

Required:

            Python, Bigdata Hadoop, Hive, SQL

            GCP, Vertex AI

            Strong Experience as Data analyst using SQL, Python Scripting

Recommended:

            CI/CD ML  Pipelines, Auto ML

            Tableau, Minitab, Databricks, Power BI

Job Description:

            Overall 10+ yrs. of experience in SQL, Hive, Python Bigdata and Hadoop, Data mining with large data sets of Structured and Unstructured data

            Experience in implementing the solution for data preparation which is responsible for data transformation as wells as handling user stories.

            Developing and testing data Ingestion/Preparation/Dispatch jobs.

            Hands on Experience in Data Retrieval and uploading and creating customized reports from Microsoft SQL server database.

            Hands on Experience in python programming for performing complex data transformations and ETL/ELT processes.

            Experience in Creating and ran python/spark scripts that require extraction of large data sets.

            Proficient in Big Data with deep understanding of the Hadoop Distributed File System and Eco System (HDFS, Map Reduce, Hive, Sqoop, Oozie, Zookeeper, HBase, Flume, PIG, Apache Kafka) in a range of industries such as Retail and Communication sectors

            Data Acquisition, Data Validation, Predictive Modelling, and Data Visualization.

Experienced in working with various methodologies, including SDLC, Agile, and Waterfall, to ensure efficient project execution and data analysis.

            Proficient in creating Apache Spark RDD transformations on Data sets in the Hadoop data lake. Used Apache Oozie to combine multiple jobs for Map Reduce, Hive, Pig, Sqoop into one logical unit of work

            Proficient in Python Libraries like NumPy, Pandas, Matplotlib, Seaborn, Scipy, ggplot2, and Pytorch for comprehensive data analysis and visualization.

            Expert in data visualization tools like Tableau, Power BI, and MS Excel for creating compelling, insightful reports and dashboards.

            Skilled in working with GCP Vertex AI & AWS to leverage cloud resources for data analysis, including EMR (Elastic MapReduce), ensuring scalability and efficiency.

            Capable of handling various database systems, such as MYSQL, SQL Server, and NoSQL, for data storage and retrieval.

            Experience in Agile process, Knowledge of using Jira for project management and GitHub for code repositioning.

            Proficiency in data cleaning and data wrangling techniques to ensure data quality and accuracy

Keywords: continuous integration continuous deployment artificial intelligence machine learning business intelligence microsoft Texas
[email protected]
View all
Tue Dec 19 22:31:00 UTC 2023

To remove this job post send "job_kill 951317" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 17

Location: Richardson, Texas