Previous Job
Big Data Scientist
Ref No.: 18-07564
Location: Oaks, Pennsylvania
Job Duties
Extract data from a variety of relational databases, manipulate, explore data using quantitative, statistical and visualization tools.
Work on Distributed file system and Big Data technologies like Hadoop, Advanced and Statistical Analytics, Real Time analytics.
Conduct statistical analysis using appropriate tools and advanced techniques.
Work on the acquisition, management, and documentation of data (including geo-spatial data).
Work on NTTDATA's Big Data accelerators and Visualization Framework.
Data wrangling of heterogeneous data to explore and discover new insights.
Participate in proposal writing, client deliverables.
Assist business development teams with pre-sales activities and RFPs.
Identify appropriate analytic and statistical methodology; develop predictive models and document process and results.
Work on NTTDATA's Big Data accelerators and Machine Learning Framework.
Process unstructured data into a form suitable for analysis.
Gather and process raw data at scale (including writing scripts, web scraping, calling APIs write SQL queries, etc.).
Work with business and cross-functional teams to thoroughly document reporting processes and systems.
Work on Distributed file system, DSaaS, SaaS, PaaS and cloud computing services like AWS.
Work on emerging big data technologies and reporting requirements.
Create visualizations from data / GIS data analysis.
5-7 years of experience manipulating data sets and building statistical models.
3+ years experience with distributed data/computing tools: Map/Reduce, Hadoop, Hive, Spark, Gurobi, MySQL, etc.
3+ years experience visualizing/presenting data for stakeholders using: Periscope, Business Objects, D3, ggplot, etc.
Strong problem solving skills with an emphasis on product development.
Experience working with and creating data architectures.
Excellent written and verbal communication skills for coordinating across teams.
A drive to learn and master new technologies and techniques.
Coding knowledge and experience with several languages: C, C++, Java,JavaScript, etc.
Knowledge and experience in statistical and data mining techniques: GLM/Regression, Random Forest, Boosting, Trees, text mining, social network analysis, etc.
Experience creating and using advanced machine learning algorithms and statistics: regression, simulation, scenario analysis, modeling, clustering, decision trees, neural networks, etc.

NTT DATA Services is a leading IT services provider and global innovation partner with 130,000 professionals based in over 50 countries. NTT DATA recently acquired Dell Services. NTT DATA Services emphasizes long-term commitment and combines global reach and local intimacy to provide premier professional services, including consulting, application services, business process, IT outsourcing, and cloud-based solutions. We are a part of NTT Group, one of the world's largest technology services companies, generating more than $100 billion in annual revenues and partner to 80% of the Fortune 100. Visit to learn how our consultants, projects, managed services, and outsourcing engagements deliver value for a wide range of businesses and government agencies.

The Company is an equal opportunity employer and makes employment decisions on the basis of merit and business needs. The Company will consider all qualified applicants for employment without regard to race, color, religious creed, citizenship, national origin, ancestry, age, sex, sexual orientation, genetic information, physical or mental disability, veteran or marital status, or any other class protected by law. To comply with applicable laws ensuring equal employment opportunities to qualified individuals with a disability, the Company will make reasonable accommodations for the known physical or mental limitations of an otherwise qualified individual with a disability who is an applicant or an employee unless undue hardship to the Company would result.