GCP Vertex Engineer ------ Richardson, TX (Onsite) at Richardson, Texas, USA |
Email: [email protected] |
From: Naveen, Softcom Systems Inc [email protected] Reply to: [email protected] Details : Client : Infosys Role: GCP Data Engineer Location : Onsite, Richardson, TX . Type : Contract Required: Python, Bigdata Hadoop, Hive, SQL GCP, Vertex AI Strong Experience as Data analyst using SQL, Python Scripting Recommended: CI/CD ML Pipelines, Auto ML Tableau, Minitab, Databricks, Power BI Job Description: Overall 10+ yrs. of experience in SQL, Hive, Python Bigdata and Hadoop, Data mining with large data sets of Structured and Unstructured data Experience in implementing the solution for data preparation which is responsible for data transformation as wells as handling user stories. Developing and testing data Ingestion/Preparation/Dispatch jobs. Hands on Experience in Data Retrieval and uploading and creating customized reports from Microsoft SQL server database. Hands on Experience in python programming for performing complex data transformations and ETL/ELT processes. Experience in Creating and ran python/spark scripts that require extraction of large data sets. Proficient in Big Data with deep understanding of the Hadoop Distributed File System and Eco System (HDFS, Map Reduce, Hive, Sqoop, Oozie, Zookeeper, HBase, Flume, PIG, Apache Kafka) in a range of industries such as Retail and Communication sectors Data Acquisition, Data Validation, Predictive Modelling, and Data Visualization. Experienced in working with various methodologies, including SDLC, Agile, and Waterfall, to ensure efficient project execution and data analysis. Proficient in creating Apache Spark RDD transformations on Data sets in the Hadoop data lake. Used Apache Oozie to combine multiple jobs for Map Reduce, Hive, Pig, Sqoop into one logical unit of work Proficient in Python Libraries like NumPy, Pandas, Matplotlib, Seaborn, Scipy, ggplot2, and Pytorch for comprehensive data analysis and visualization. Expert in data visualization tools like Tableau, Power BI, and MS Excel for creating compelling, insightful reports and dashboards. Skilled in working with GCP Vertex AI & AWS to leverage cloud resources for data analysis, including EMR (Elastic MapReduce), ensuring scalability and efficiency. Capable of handling various database systems, such as MYSQL, SQL Server, and NoSQL, for data storage and retrieval. Experience in Agile process, Knowledge of using Jira for project management and GitHub for code repositioning. Proficiency in data cleaning and data wrangling techniques to ensure data quality and accuracy Keywords: continuous integration continuous deployment artificial intelligence machine learning business intelligence microsoft Texas |
[email protected] View all |
Tue Dec 19 22:31:00 UTC 2023 |