Lead Data Scientist
-
Location:
41 South High StreetColumbus,OH
- Reference Number: R0059303
Description
AI CoE team is seeking a highly skilled Lead Data Scientist with a focus on Large Language Models (LLMs) to join our team. The ideal candidate will leverage their expertise in machine learning, natural language processing, and artificial intelligence to develop and implement advanced models that drive strategic decision-making and innovation. AI CoE team is part of Enterprise Data and Analytics team, and we drive innovation and financial success by delivering results on a project-by-project basis. The Lead Data Scientist role involves leading data science projects from conception to deployment and collaborating with cross-functional teams to deliver impactful insights.
Duties and Responsibilities:
- Own the entire model development process for both on premises and AWS cloud projects, from identifying the business requirements, data sourcing, model fitting, presenting results, and production scoring
- Act as the go-to resource for machine learning across a range of business needs, especially in developing and deploying LLM use cases.
- Apply automation techniques to manual processes to create efficiency, improve accuracy, and build upon successes that create a suite of self-service assets
- Stay current with emerging Machine learning and cloud trends and technologies, driving innovation though their application
Basic Qualifications:
- Master’s degree in computer science, statistics, economics or related field
- 5+ years related work experience using statistics and machine learning to solve complex business problems, experience conducting statistical analysis with advanced statistical software, scripting languages, and packages, including experience with big data analysis tools and techniques, and building and deploying predictive models, web scraping, and scalable data pipelines.
- 3+ years related work experience in AWS cloud machine learning and cloud computing, familiar with AWS Sagemaker, Lambda, S3, Athena, etc.
- 3+ years experience with Python, R, RSTudio, Spark, SQL, NoSL
- 1+ years experience with transformers & other advanced NLP techniques; proficiency with Scikit-learn, NLTK, TensorFlow and PyTorch
- 1+ years experience with Large Language Models (Llama, Claude, Titan), including frameworks such as Langchain, Embedding and Vector databases, Prompt engineering
- 1+ years experience with ETL pipelines and MLOps
Preferred Qualifications:
- PhD in computer science, statistics, economics or related fields
- Willingness to step out of comfort zone and solve problems not strictly related to modeling and data science in order to reach team goals.
- Working knowledge of finetuning LLMs
- Advanced techniques in RAG such as MMR, Multi-vector retrieval, RAG fusion, HYDE, self-RAG, etc
- Knowledge of NoSQL databases (e.g., DynamoDB)
- Advanced AWS services work experience with Textract, JumpStart, Bedrock, etc. is preferred
- A/B testing and performance analysis for LLM model evaluation
- Excellent written and verbal communication skills, with a proven ability to interact effectively across all organizational levels
- Progressive thinking and problem solving, with a strong ability to manage ambiguity/complexity.
#LI-Hybrid
#LI-MH1
Exempt Status: (Yes = not eligible for overtime pay) (No = eligible for overtime pay)
Workplace Type:
HybridHuntington is an equal opportunity and affirmative action employer and is committed to providing equal employment opportunities for all regardless of race, color, religion, sex, national origin, age, disability, sexual orientation, veteran status, gender identity and expression, genetic information, or any other basis protected by local, state, or federal law.
Tobacco-Free Hiring Practice: Visit Huntington's Career Web Site for more details.
Agency Statement: Huntington does not accept solicitation from Third Party Recruiters for any position