Previous Job
Data Wrangling and Blending
Ref No.: 17-02137
Location: Jersey City, New Jersey
Position Type:Contract
Start Date: 06/28/2017
Title: Data Wrangling (Paxata, Alteryx)
Location: Jersey City, NJ
Education: Bachelor's Degree
Skills: Paxata, Alteryx, Data Wrangling
Job Description:
Experience between 7 to 12 years in pure IT. Of that around recent past 3 years in performing Data Wrangling function and should be able to perform consultative as well as individual contribution activities in the Data wrangling, blending and preparation activities in order, to support the Data Science Team. Should have experience working with and presenting to business and IT management.
Major Responsibilities:
  • Data Wrangling, Blending and Data Preparation activities
  • Apply business analysis techniques and data management best practices to solve challenging business scenarios
  • Reinforce the concept of "data as an asset” and provide consultative services to the stakeholders on how this can be achieved for the Big Data Ecosystem and Big Data Reservoir that the client is building.
  • Assist data scientists within the organization with data asset lifecycle management activities such as:
  • Taking down and understanding data requirements for model building
  • Analyzing all possible data sources from where the data can be sourced for the problem analysis and model building
  • Blend the data from various sources to create a dataset for analysis
  • Use tools such as Paxata, Alteryx, Trifacta, etc. for dynamic data blending and integration
  • Use R or Python packages to perform tests on data
  • Use Tableau or Qlikview like tools to visualize the data. Alternatively some experience in data visualization using R or Python packages is also preferred
  • Create data aggregations at required levels and perform reconsolidation checks to make sure the data is correct
  • Standardize, sanitize/cleanse and validate data elements
  • Create metadata for the datasets being produced
  • Document entire workflows with detailing each step in creating the datasets
  • Create data integration workflows for each data set
  • Operationalize / automate recurring workflows to generate datasets in future
  • Provide datasets to Data Scientists as per requirements
  • Alter datasets as per the model needs
  • Bring the external working knowledge from other assignments and industry best practices to the table as appropriate.
  • Should be well versed with the predictive model building, machine learning and statistical data analysis concepts.

Required Skills and Technical expertise:
Good working experience with any of the following data blending tools:
  • Paxata
  • Alteryx
  • Trifacta
  • Talend Data Prep
  • Good working experience with any of the following data blending tools:
  • Tableau
  • Qlikview
  • Spotfire
  • Power BI
  • Solid proficiency in MS Excel
  • Analyze data requirements and identify data elements and functionality to support the Data Science activities
  • Bachelor's degree and/or equivalent combination of education and work experience in related field.
  • Demonstrated effectiveness working in dynamic and changing environment.
  • Good knowledge of Microsoft Word, Power Point and Access.
  • Proficiency with SQL is required.
  • Knowledge of any Data base systems such as Oracle, SQL Server, MySQL, Cassandra, Hive, etc. is required.
  • Solid understanding in Data Governance, Data Quality, and Data Lineage concepts is required.
  • Knowledge of Data warehouse concepts is preferred.
  • Working knowledge of project management concepts is a plus