Data Scientist/Architect - Frederick

Recruiter
Frederick National Laboratory for Cancer Research
Location
Frederick, Maryland
Posted
Sep 11, 2019
Closes
Sep 23, 2019
Ref
037e8787ce8f
Organization Type
Corporate
The Advanced Biomedical and Computing Sciences (ABCS) is part of the Biomedical Informatics and Data Science directorate at Frederick National Laboratory for Cancer Research. The ABCS supports the biomedical mission of the NCI and NIH by providing technology development, scientific consultation and collaboration, scientific application development, data analysis and management, training, and other research services to the NCI and NIH scientists and staff. The Data Solutions and Systems Biology (DSSB) group in ABCS is an interdisciplinary team that provides innovative solutions for the NCI/NIH community in the management and use of biological information collected across different sources and formats. Integrating diverse data sources and making them easily accessible and understandable is a key goal to enable researchers explore public and institutional data through disease agnostic access and analyses, genomic annotation, data normalization frameworks, and translating basic research data to clinical the clinical setting as envisioned in precision medicine.

Key Roles/Responsibilities

Senior Scientist/Architect
  • Lead data architecture and modeling efforts as well as building databases and integration services using variety of tools and technologies such as RDBMS, NoSQL, Graph Models and distributed architecture in accordance with the functional and nonfunctional requirements of other teams and scientists
  • Develop advanced analytic models and machine learning algorithms that discover clinical insights
  • Develop custom data querying/mining pipelines for mining and integrating information from clinical and genomic data, multi-level biological annotations and information from other knowledge mining applications
  • Coordinate with the other technical and nontechnical teams in building public-facing applications and services (APIs) for a variety of users in the cancer research field

Work closely with other FNL and NCI teams to coordinate activities and develop collaborative projects

Basic Qualifications:

This is a dual level requisition and can be filled either as a Data Scientist III or Data Scientist IV

Data Scientist III
  • Possession of a Bachelor's degree in Data/Computer Science or a related field from an accredited college or university according to the Council for Higher Education Accreditation. (Additional qualifying experience and certifications may be substituted for the required education).
  • Five (5) or more years of experience in data science, computational science, or quantitative science, etc.

Data Scientist IV
  • Possession of a Bachelor's degree in Data/Computer Science or a related field from an accredited college or university according to the Council for Higher Education Accreditation. (Additional qualifying experience and certifications may be substituted for the required education).
  • Eight (8) or more years of experience in data science, computational science, or quantitative science, etc.

Both positions require:
  • Deep knowledge in data structures, data modeling and architecture for high-throughput and scalability
  • Significant expertise in RDBMS like Oracle, MySQL & PostgreSQL.
  • Strong knowledge of various querying languages and query performance tuning with demonstrated ability of writing complex queries
  • Experience in building robust data pipelines and services
  • Knowledge in scripting languages and Unix/Linux OS scripting
  • Strong knowledge of algorithms and one or more programming languages such as Python, Java, or C++
  • Strong drive and initiative to explore new territories in data and knowledge mining
  • Demonstrated ability to learn and apply new technologies according to the changing needs of the project and organization.
  • Able to work on initiatives independently and in a highly collaborative environment
  • Has strong work ethics, organized, detail oriented and focused on results
  • Strong verbal and written communication skills
  • Must be able to obtain and maintain a security clearance

Preferred Qualifications:

Candidates with these desired skills will be given preferential consideration:
  • Master's or Ph.D in Data/Computational Science or related field
  • Expertise in NoSQL data architecture such as Key-Value, column, document and graph model
  • Knowledge of agile web development, experience in developing data integration and visualization tools
  • Knowledge of the biomedical domain
  • Knowledge of NLP applications
  • Leadership characteristics - Able to mentor and direct more junior developers on technical issues.