Previous Job
Scientific Data Curator
Ref No.: 17-02270
Location: Cambridge, Massachusetts
Start Date / End Date: 06/26/2017 to 06/29/2018
The Scientific Data Curator is responsible for identifying the data needs of drug discovery scientists, informaticians and data scientist and obtaining, organizing and integrating the data to enable the answering of complex research questions. Specific emphasis of the role includes the integration, validation and organization of genomics datasets and sample data. The Scientific Data Curator will work closely with a core genomics team as well as internal bench scientist stakeholders to facilitate data submission, including use of curation tools and workflows. The Scientific Data Curator will also identify needs for the improvement of data curation workflows, and will propose and implement solutions.
Job Dimensions
The Scientific Data Curator will be a member of the TMS/ Knowledge Engineering group, and will join the NX genomics warehouse team. Based in Cambridge, MA, this role will work with scientists, informaticians and IT professionals in the global organization.
Major Accountabilities / Key Performance Indicators
Work collaboratively with programmers, bioinformaticians, project managers, and research scientists to enable high quality, timely preparation of datasets for loading into a genomics warehouse.
Work with knowledge engineering team to develop and test curation workflow tools, provide user support for stakeholders to promote curation at source.
Use ontologies and terminologies to assist in the disambiguation, semantic integration and organization of a wide variety of biomedical data sets.
Independently identify and collaboratively solve complex data modeling and integration problems.
Able and willing to independently learn and adopt new technologies and be hands-on.

Ideal Background / Capabilities
3 or more years of experience in drug discovery informatics or drug discovery research. A strong preference will be given to candidates with industry experience.
Detailed knowledge of ontologies relevant to molecular biology, medicinal chemistry, genomics and/ or drug discovery.
Demonstrated experience in the curation and integration of data relating to one or more of biochemistry, medicinal chemistry, molecular biology or genomics.
Excellent knowledge of genetics, genomics, and biomedical research required, in depth knowledge of a disease related research area such as cancer genetics or immunology a plus
Excellent knowledge of SQL and relational databases.
Knowledge of Triple Store databases, RDF, and Linked Data approaches would be a strong advantage.
Experience in using text mining software APIs or automated interaction with one or more text mining applications an advantage
Experience with automated selection of corpora, biomedical entity-recognition and disambiguation, and the automated detection of entity relationships would be an advantage.
Strong communication and interpersonal skills and demonstrated ability to work as a member of a data curation/ integration team.
Graduate degree in a biomedical discipline or graduate degree in computer science, or 3 or more years of equivalent experience in biomedical research, familiarity with genomics technologies and analysis methods as well as facility with scripting or automated tools to prepare large data sets