We are seeking a Sr. Data Scientist with Machine Learning Engineering experience to join our digital strategy and enablement team. This role involves working on technology solutions that advance data science practices and contribute to meaningful applications in the field. As part of a diverse and collaborative team, you will work on developing structured data solutions with a focus on Large Language Model (LLM) applications, supporting projects that drive impactful outcomes.
Key Responsibilities:
- Collaborate with data scientists, ML engineers, and cross-functional teams to create innovative solutions using modern NLP technologies, particularly LLMs.
- Design and implement data pipelines and deployment pipelines for machine learning models.
- Develop and tune ML models to meet business and functional requirements.
- Deploy models and optimize them for enhanced performance.
- Document and communicate the design and implementation details clearly.
- Contribute to technical decisions within the AI team.
- Partner with various departments to deploy scalable, maintainable solutions.
- Act as a technical point of contact for enterprise-wide technology solutions and lead troubleshooting efforts.
Qualifications:
- Experience with LLM applications development, including tools such as RAG solutions and code interpreters.
- Background in LLM fine-tuning is a plus.
- Proficiency in building data and deployment pipelines for LLM applications.
- Familiarity with ML/AI toolkits, including AWS Sagemaker, with experience in others such as PyTorch, TensorFlow, or Keras being beneficial.
- Experience with MLOps technologies like Sagemaker, Vertex AI, or Kubeflow.
- Working knowledge of cloud platforms (AWS, Azure, GCP) and containerization (e.g., Docker).
- Strong skills in scripting, automation, and tools like Git, Bash, Linux, CI/CD tools (Jenkins, GitLab CI).
- Programming proficiency in Python and R.
- Understanding of test-driven development and software lifecycle best practices.
- Problem-solving and decision-making capabilities, along with strong interpersonal skills.
- Focus on customer delivery and the ability to collaborate with diverse team members.
- Experience with deploying scalable applications is a plus.
- Familiarity with clinical study data is an advantage.
Education & Experience:
- Master’s degree in a quantitative field (mathematics, statistics, computer science, etc.) or a related area, with 5+ years of experience in data science. A PhD is a plus.
- 2+ years of experience in data engineering, ML engineering, MLOps, or related fields.
- 3+ years of software engineering experience.
Top Qualifications:
- Recent experience in LLM application development, especially RAG applications.
- Strong software development skills.
- Ability to collaborate effectively in a diverse team.
Work Location:
- Preference for on-site work at our South San Francisco campus, with a hybrid option available.
This role is ideal for a detail-oriented professional passionate about leveraging data science and machine learning to drive impactful outcomes.