Previous Job
Previous
Data Engineer 3
Ref No.: 18-13231
Location: San Jose, California
Design, build, and manage complex analytics data models in Hive/Hadoop for GTM Analytics team across all customer journey from Acquisition, Engagement, and Retention. The analytics data marts will be used by data analysts in GTM Analytics and other team to do deep dive analysis, build analytics dashboard, or other data science project.
Design, build, deploy, and maintain new data models ETL pipeline with SQL query, Python, Oozie, and other script language and create/maintain workflow using Oozie.
Ensure overall data quality.

- Querying and manipulating large data sets for analytical purposes using SQL-like languages (Hive is strongly preferred)
- Experience with Hadoop/big data environments to synthesize and analyze data.
- professional experience in the data warehouse space
- Good attention to detail and ability to QA multiple data sources
- Experience working on building scalable ETL pipelines, data warehousing and schema modeling
- Experience working with Oozie Workflow
- Experience with script language such as Python
- 2 - 3 years of relevant experience

Skills:
Required
DATA WAREHOUSING
ENGINEER
ETL
HIVE
SQL
Additional
DATA SOURCES
OOZIE
QA
WORKFLOW
APACHE HADOOP OOZIE
DATA MODELS
DATA QUALITY
DATA SCIENCE
DATA WAREHOUSE
DATABASE
DATABASES
HADOOP
PYTHON