Data Science Manager jobs in Connecticut

Data Science Manager manages teams tasked with identifying trends, patterns, and anomalies found in big data sets and used to develop insights by performing extensive data analysis. Oversees the interpretation of results from multiple sources using a variety of techniques, ranging from simple data aggregation via statistical analysis to complex data mining. Being a Data Science Manager manages the design and implementation of big data solutions for the organization. Uses extensive knowledge and research into big data tools to guide data scientists' adoption and use of new and existing tools. Additionally, Data Science Manager typically requires a master's degree in computer science, mathematics, engineering or equivalent. Typically reports to senior management. The Data Science Manager manages subordinate staff in the day-to-day performance of their jobs. True first level manager. Ensures that project/department milestones/goals are met and adhering to approved budgets. Has full authority for personnel actions. To be a Data Science Manager typically requires 5 years experience in the related area as an individual contributor. 1 - 3 years supervisory experience may be required. Extensive knowledge of the function and department processes. (Copyright 2024 Salary.com)

C
Data Engineer
  • Catalytic Data Science
  • Westport, CT FULL_TIME
  • Data Engineer III (Large Language Models)

     

    About Catalytic Data Science (CDS):

    Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment.   By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy.  Our customers are passionate about making the world a better place, and we are inspired by the opportunity to help them.


    The Role

    You are a Data Engineer with experience in processing terabytes of data and working with large language models (LLMs). You have experience in creating and automating scalable, fault-tolerant, and reproducible data pipelines for natural language processing (NLP) using Amazon AWS technologies. You will design and implement data ingestion, processing, and storage solutions that can handle massive amounts of text data from various sources. You are interested in helping to create a platform completely built on top of AWS. You are eager to join a team of Life Scientists and Software Engineers that believe the brightest minds in research should have the best tools to drive innovation. 

    What You’ll Do

    • Build, test, and operate automated Extract, Transform, and Load (ETL) pipelines that process terabytes of text data nightly
    • Develop service frontends around our various backend data stores (AWS Aurora, MySQL, Elasticsearch, S3)
    • Rapidly protype, test, and deploy data pipelines for LLMs using AWS.
    • Collaborate with data scientists and NLP engineers to understand the data requirements and specifications for LLMs and related tasks such as text summarization, translation, and question answering.
    • Optimize the performance, reliability, and scalability of the data pipelines and LLMs by applying best practices and techniques such as data partitioning, caching, compression, and monitoring.
    • Ensure the quality, integrity, and security of the data by implementing data validation, cleaning, and governance policies and procedures.
    • Research and evaluate new technologies and methods for data engineering and LLMs and stay updated with the latest trends and developments in the field.
    • Participate in data architecture and engineering decisions, bringing your strong experience and knowledge to bear.

    Qualifications

    • Bachelor's degree or higher in computer science, engineering, or a related field.
    • 3 years of experience in data engineering, preferably with large-scale text data and LLMs and 6 years of any software engineering experience (including data engineering).
    • Proficient in Python 3 or Java, preferably both.
    • Experience with data modeling, ETL, and data warehouse design and implementation.
    • Expertise with ETL schedulers such as Airflow, Prefect or similar frameworks.
    • Familiar with LLMs and NLP concepts and frameworks such as Transformers, BERT, GPT, PaLM, and LLaMA.
    • Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS
    • Experience extracting, processing, storing, and querying of petabyte-scale datasets
    • Familiarity with building and using containers
    • Familiarity with event-based microservices
    • Strong communication, collaboration, and problem-solving skills.

     

    Core Skills:

    1. ETL Processes
    2. Data Modeling and Database Design
    3. Proficiency in Large Language Models
    4. Data Pipeline Optimization
    5. Cross-functional Collaboration
    6. Problem-solving and Analytical Skills 

    Nice-to-Haves

    • Prior experience with Elasticsearch (custom development and/or administration) is a huge plus
    • Knowledge of Graph databases


    What Do We Love in Team Members? 

    Your specialization is less important than your ability to learn fast and adapt to shifting technologies. We’re especially fond of people who:

    • Focus on customer’s needs and our company’s goals, not just writing code
    • Iterate until customers love what you’ve built
    • Self-start and initiate
    • Self-organize
    • Strive to grow personally and professionally, beyond just expanding technical abilities
    • Love to experiment with new technology and share knowledge with the team



    In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

  • 2 Months Ago

C
Data and Machine Learning Scientist
  • Catalytic Data Science
  • Westport, CT FULL_TIME
  • Position Title: Data and Machine Learning Scientist About Catalytic Data Science (CDS): Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific re...
  • 2 Months Ago

L
Manager, Data Science
  • Launch Potato
  • Hartford, CT FULL_TIME
  • YOUR ROLE You will be developing deep personalization models for our users and complex optimization algorithms to bridge our customer experiences with new products/services. You will be pivotal to the...
  • 25 Days Ago

L
Manager, Data Science
  • Launch Potato
  • New Haven, CT FULL_TIME
  • YOUR ROLE You will be developing deep personalization models for our users and complex optimization algorithms to bridge our customer experiences with new products/services. You will be pivotal to the...
  • 25 Days Ago

L
Manager, Data Science
  • Launch Potato
  • Stamford, CT FULL_TIME
  • YOUR ROLE You will be developing deep personalization models for our users and complex optimization algorithms to bridge our customer experiences with new products/services. You will be pivotal to the...
  • 22 Days Ago

D
Production Manager
  • Data-Mail, Inc.
  • Newington, CT FULL_TIME
  • Location: On site, 240 Hartford Avenue, Newington, CT 06111Work Schedule: 2nd shift hours; Monday - Friday; 3:30 pm - 11:30 pm SUMMARYProvide direction and facilitate, through supervisory staff, mail ...
  • 10 Days Ago

P
Data Science Manager
  • Plymouth Rock Assurance
  • Woodbridge, NJ
  • The Data Science Manager will manage a team of statisticians in research and development of new predictive models and ne...
  • 6/1/2024 12:00:00 AM

P
Data Science Manager
  • Plymouth Rock Assurance
  • Charlotte, NC
  • *Please note: This role will need to be physically located in our Woodbridge, NJ location. The Data Science Manager will...
  • 6/1/2024 12:00:00 AM

P
Data Science Manager
  • Plymouth Rock Assurance
  • Cleveland, OH
  • *Please note: This role will need to be physically located in our Woodbridge, NJ location. The Data Science Manager will...
  • 6/1/2024 12:00:00 AM

C
Data Science Manager
  • City of New York
  • New York, NY
  • Job Description Hours: Full-Time Position 35 Hours Work Location: 30-30 Thomson Avenue, LIC, NY 11101 The NYC Department...
  • 6/1/2024 12:00:00 AM

A
Data Science Manager
  • Abbott
  • Alameda, CA
  • Abbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-chan...
  • 6/1/2024 12:00:00 AM

B
Data Science Manager
  • bioMerieux Inc.
  • Chicago, IL
  • A world leader in the field of in vitro diagnostics for 60 years, bioMerieux provides diagnostic solutions intended for ...
  • 5/29/2024 12:00:00 AM

F
Data science, Manager
  • Facebook
  • Burlingame, CA
  • Summary: Facebook is seeking a data center Critical Facility Engineer to join our Data Center Facility Operations team. ...
  • 5/29/2024 12:00:00 AM

I
Data Science Manager
  • Idaho State Job Bank
  • Boise, ID
  • Data Science Manager at Meta in Boise, Idaho, United States Job Description Summary: Meta Platforms, Inc. (Meta), former...
  • 5/29/2024 12:00:00 AM

Connecticut is bordered on the south by Long Island Sound, on the west by New York, on the north by Massachusetts, and on the east by Rhode Island. The state capital and fourth largest city is Hartford, and other major cities and towns (by population) include Bridgeport, New Haven, Stamford, Waterbury, Norwalk, Danbury, New Britain, Greenwich, and Bristol. Connecticut is slightly larger than the country of Montenegro. There are 169 incorporated towns in Connecticut.The highest peak in Connecticut is Bear Mountain in Salisbury in the northwest corner of the state. The highest point is just east...
Source: Wikipedia (as of 04/11/2019). Read more from Wikipedia
Income Estimation for Data Science Manager jobs
$168,217 to $215,682

Data Science Manager in Santa Barbara, CA
They work closely with stakeholders across the business to ensure that our data driven insights do not collect dust.
February 02, 2020
Data Science Manager in Tucson, AZ
�         Assist in growing data science practice by meeting business goals through client prospecting, responding to proposals, identifying and closing opportunities within identified client accounts.
February 11, 2020
Data Science Manager in Pittsfield, MA
�         Participate in client discussions, interact with CxOs at client organization to articulate the value of data science approaches, different service offerings and guide them on implementation of the same.
February 24, 2020