Description
YOUR ROLE
You will lead the data science projects and mentor the junior team members. You need to have extensive knowledge of advanced Large Language Models (LLMs), including Retrieval-Augmented Generation (RAG) and fine-tuning techniques. This role requires a hands-on leader who is eager to drive projects to completion, support peers and junior data scientists, and resolve production issues promptly whenever needed.
WHAT YOU'LL DO
Project Ownership: Take full ownership of data science projects from inception to deployment. Ensure timely and successful project delivery by overseeing the entire data science pipeline.
Mentorship: Mentor and guide junior data scientists, fostering a collaborative and learning environment. Provide technical and professional development support to team members.
Advanced LLM Knowledge: Apply advanced knowledge of Large Language Models (LLMs), including Retrieval-Augmented Generation (RAG) and fine-tuning. Stay updated with the latest advancements in LLM technologies and implement them in projects.
Proactivity and Execution: Demonstrate a proactive attitude and eagerness to get things done efficiently. Drive project initiatives and ensure continuous progress.
Peer and Junior Work Review: Review the work of peers and junior data scientists, providing constructive feedback and support. Ensure high-quality deliverables through thorough evaluation and guidance.
Production Issue Resolution: Take responsibility for fixing data science-related production issues promptly. Implement robust solutions to minimize downtime and ensure smooth operations.
Data Cleaning and Analysis: Clean and preprocess training data to ensure quality and accuracy. Perform exploratory data analysis to uncover data patterns and insights.
Training and Retraining Models: Develop and train text classifiers for various applications. Identify and mitigate biases in models through retraining and continuous improvement.
Machine Learning and Statistics: Apply machine learning algorithms and statistical techniques to solve complex problems. Evaluate model performance and validate results using statistical methods.
AWS SageMaker and Bedrock Familiarity: Utilize AWS SageMaker for building, training, and deploying machine learning models. Leverage AWS Bedrock for model deployment, data management and integration tasks.
Other duties and responsibilities as assigned.
Requirements
This is a fully remote role, with the exception of onboarding and optional in-office events.
SKILLS AND QUALIFICATIONS
Bachelor's degree, Master’s degree or PhD in Computer Science, Data Science, Statistics, or a related field.
5+ years proven experience as a Senior Data Scientist, with expertise in advanced LLMs like RAG and fine-tuning.
Strong programming skills in Python or R.
Hands-on experience with text classification and model retraining.
Solid understanding of machine learning algorithms and statistical analysis.
Familiarity with AWS SageMaker and Bedrock is highly desirable.
Excellent problem-solving skills and attention to detail.
Strong leadership, communication, and collaboration skills.
Ability to mentor and guide junior team members effectively.
Proactive and results-driven mindset.
PHYSICAL REQUIREMENTS
Must be able to sit/stand/walk for prolonged periods of time, (up to 8 hours per day) at a desk working on a computer.
Must be able to use standard office equipment for extended periods of time, including but not limited to, a mouse, keyboard, phone and video conferencing
Summary
Bold Penguin is the premier digital broker for commercial insurance. Their technology expands top-of-funnel demand and increases yield for enterprise partners. Through digital distribution, wholesale capabilities, and AI-driven insights, they transform the lifecycle of both simple and complex risks, from prospecting to placement.
Benefits
- Medical, Dental, and Vision
- Flexible PTO Policy
- 401(k) with a company match
- Employee Assistance Program
- Parental Leave
- Disability and Life Benefits