This job has expired

NLP Data Scientist

Pivotal Life Sciences
California, KY
Closing date
Apr 4, 2023

View more

Science, Mathematics and Statistics
Organization Type

If WHO you work for & with and WHY your company exists is at the top of your list of criteria for choosing your next opportunity, we encourage you to take a look at the roles we are hiring for at Pivotal Life Sciences (PLS) and join our growing team of data scientists/engineers working together toward a common goal-the health and care of the patient. We're looking for people to not only join us-but to be a big part of the solution. To do more and be more. To do well and do good. If you have the ability to see the bigger picture and bring it to life, let's connect!

Pivotal Life Sciences ("PLS"), a part of the Nan Fung Group, is a global investment platform focusing on the life sciences. Leveraging on Nan Fung Group's strong capital base and long-term commitment to the space, the company aims to become the ideal partner for scientists, entrepreneurs, corporations and investors in the life science space. Through direct investments via Pivotal bioVenture Partners funds (both in US and China) and fund investments covering the full spectrum of the industry (including therapeutics, medical devices and diagnostics) and across different development stages, Nan Fung Life Sciences has significant presence in both US and Greater China. Learn more at

Locations: San Francisco, CA, USA, Shanghai, China

The Vision

PLS envisions becoming the life sciences investment industry's best tech-enabled investment platform. The AI team aims to provide best-in-class intelligence support across all steps of the investment process, from deal sourcing to exit. The system will function as an additional team member and help augment the investment team to make better investments and build better companies for unmet therapeutic needs. Ultimately, PLS will become a scientifically driven investor across the life science ecosystem from academic spinouts to venture rounds and ultimately to exit. Come be part of our vision!

The Role

As a PLS Data Scientist, you will be a member of our new global AI and Data Intelligence team. This team's goal is to build state-of-the-art data and AI technology with strong research fundamentals for our life sciences investment arm. You will utilize your expertise in the data sciences to investigate and work with relevant data sources and build AI products that support the investment process such as disease mapping, portfolio management predictions, financial planning and management operations optimizations. This is a great opportunity to work on a range of AI powered projects in a growing team with exposure to the best life science companies today.

  • Work with researchers as well as software engineers and other data scientists to develop an AI and analytics platform which can support our investments team
  • Mapping and analyzing multiple data sets with emphasis on text-based data. For example: competitive landscape analysis for a drug target or company, tracking of companies in a sector, talent mapping and analysis, etc.
  • Develop NLP based AI models for due diligence using biological and financial data sources including but not limited to PubMed, Patents, SEC filing, other biological data sets, financial data sets, and others
  • Work closely with the investment team to understand the business needs, and provide the investment team with insights on scientific and financial due diligence of new investment opportunities
  • Design and implement AI solutions working within a Software Engineering Life Cycle (SDLC)
  • Maintain awareness of and utilize where appropriate, state-of-the-art AI including GANs, Transformers, Diffusion based models.
  • Collaborate closely with different functions to evangelize an AI supported investment process
  • Text extraction and intelligence/insights development from biological, financial, and operations data sets
  • Optimization of operational work such as talent management, deal structuring, portfolio management with data solutions and AI models using extracted CRM data, legal, financial, SEC filings, etc.
  • Benchmark, evaluate and document model performance, and provide recommendations for continuous improvement of models

  • Master's degree or PhD in computer science, artificial intelligence, applied mathematics, statistics, machine learning or related discipline
  • 5+ years of post-grad/industry, applied experience in machine learning, deep learning methods, statistical data analysis and complex data visualization; experience in life science industry would be a plus
  • Deep experience with Python
  • Experience or strong interest in working with cloud computing systems (preferably AWS)
  • Experience with Docker and building containerized data workflows to handle large scale data streams
  • Experience with AI platforms such as SageMaker, MLFlow, others preferred
  • Experience with building machine/deep learning models with at least one common framework such as PyTorch, Tensorflow, Keras, Scikit learn etc.
  • Knowledge of relational database architecture and data management with expertise in SQL
  • Familiarity with software development practices such as unit testing, code reviews, and version control
  • Excellent analytical skills and presentation skills
  • Strong verbal and written communication skills and ability to work independently and cooperatively
  • Proficiency in English
  • US Work Visa
  • Hybrid work schedule: Able to be in San Francisco office, in-person at least 3 days per week, option to work from home 2 days per week

Get job alerts

Create a job alert and receive personalized job recommendations straight to your inbox.

Create alert