Previous Job
Data Engineer
Ref No.: 18-07724
Location: Mountain View, California
Job description
As a Data Engineer at Customer, your core responsibility will be to maintain and scale our infrastructure for analytics as our data volume and needs continue to grow at a rapid pace. This is a high impact role, where you will be driving initiatives affecting teams and decisions across the company. You'll be a great fit if you thrive when given ownership, as you would be the key decision maker in the realm of architecture and implementation.

• Architect systems and end-to-end solutions that provide fast, efficient and reliable interfaces to heterogeneous data, meta data for internal users of the analytics infrastructure.
• Automate existing processes and create systems that favor self-service data consumption.
• Own the quality of our analytics data. Implement a robust monitoring & logging framework that guarantees the traceability of inevitable incidents.
• Evaluate whether the best solution for each problem at hand is to build, buy or contract the work.
• Interface with data scientists, analysts, product managers and all other customers of the analytics infrastructure to understand their needs and expand the infrastructure as we grow.

Qualifications and Skills
• 5-10+ years of experience in data, web or mobile services.
• Data architecture skills.
• Experience in SQL or similar languages and development experience in at least one scripting language (Python, Perl, etc.).
• Experience with Redshift and other AWS technologies
• 8+ years of experience working with large data sets and distributed computing tools (Map/Reduce, Hadoop, Hive, Spark etc.)
• 8+ years utilizing Object Oriented design and programming with Scala/Python skills to design, develop, and maintain large-scale web applications.
• Experience with data sets, Hive, and data visualization tools is a plus.
• BA/BS in Computer Science, Math, Physics, or other technical field.

Additional bonus:
• Deep knowledge of machine learning, information retrieval, data mining, statistics, NLP or related field.
• years of experience building machine learning networks to scale problems at scale with Knowledge using sci-kit learn, spark MLlib etc.