Previous Job
Previous
Database Engineering
Ref No.: 18-03117
Location: Plano, Texas
Primary Skillset:
Data Engineering Tools/languages - Spark/Scala, Pig, Hive, HDFS, Java, etc
DevOps and CICD Tools - Jira, Jenkins, Chef, Puppet, Ansible, Code Cloud or other Cloud Repos like Git, etc.
Preferred working knowledge:
AWS Tools - EC2, EMR, Athena, Hadoop/Hortonworks on EC2, S3, DyanamoDB, Lamda, etc
Preferable knowledge on Tivoli Workflow Scheduler (TWS)
Good domain knowledge on TV content data such as viewership, channel/program metadata etc
• Expert level competency with building scalable, high performing and robust applications using Scala/Spark
• Excellent knowledge on Hadoop query languages including Pig and Hive
• Advanced skills are essential in Linux Shell scripting
• Experience with Source Code Management Tools such as SVN or GitHUB
• Strong knowledge in agile methodologies
• Excellent experience in technical data analysis, functional design, modeling, solution development processes, software testing and deployment
• Should be able to independently work with data scientists and business clients to understand the data requirements. Analyze various data sets in Datalake, Curate and store into a format where it can be consumed by Data Scientists to create critical insights.
• Should be able to work on multiple projects in parallel and also provide solutions/support to production issues
• Should be able to quickly scale up and adopt new technologies
• Ability to mentor and develop others in big data technologies