Data Scientist
Data Scientist
Remote
6+ Months CTH
SUMMARY OF ESSENTIAL DUTIES AND RESPONSIBILITIES:
We are seeking a highly skilled and motivated Data Scientist with expertise in Epic EMR (Electronic Medical Records) data to join our team. The successful candidate will play a critical role in leveraging Snowflake's data platform to accelerate the adoption of AI in health and hospital projects. This role involves implementing rules-based logic and creating structured data models for Retrieval-Augmented Generation (RAG) to support decision-making and improve operational efficiency and clinical outcomes.
Key Responsibilities:
Data Management and Analysis:
- Extract, clean, and analyze large datasets from Epic EMR and other healthcare data sources.
- Develop and maintain data pipelines to ensure the accurate and timely flow of data.
- Perform data validation and ensure data quality and integrity.
Implementation of Rules-Based Logic:
- Develop and implement rules-based logic to support various healthcare use cases, including One Stop Benefits, Charge Capture Automation, and Denials Optimization.
- Create structured data models that facilitate the application of rules-based logic.
RAG (Retrieval-Augmented Generation) Implementation:
- Design and implement structured data models for RAG to enhance data retrieval and generation processes.
- Develop dashboards and visualizations to present RAG insights and other key performance indicators to stakeholders.
- Utilize AI and ML techniques to enhance the accuracy and predictive capabilities of the RAG models.
Collaboration and Communication:
- Work closely with cross-functional teams, including data engineers, developers, and healthcare professionals, to understand project requirements and deliver data-driven solutions.
- Communicate complex analytical results and insights to non-technical stakeholders in a clear and concise manner.
Project Execution:
- Participate in agile development processes and contribute to sprint planning, reviews, and retrospectives.
- Ensure timely delivery of project milestones and adhere to project timelines.
Preferred Skills
- Design and implement predictive models using various machine learning techniques, including both supervised and unserved algorithms.
- Utilize deep statistical analysis to understand and model complex public health data.
- Develop and deploy Large Language Models (LLMs) for creating chatbots and extracting insights, enhancing user engagement and information dissemination.
- Analyze clinical data to derive insights that improve patient care and clinical workflows.
Knowledge, Skills, Abilities and other Requirements:
- Proficiency in programming languages such as Python, R, and SQL.
- Experience with data visualization tools such as Tableau, Power BI, or similar.
- Familiarity with Snowflake or other cloud-based data platforms.
- Knowledge of ETL processes and tools.
- Experience with AI and ML techniques to enhance data analysis and RAG implementation but not limited to SQL, Excel and/or SAAS
- Proficient in various data catalog and visualizations tools, including but not limited to Tableau, PowerBI, Informatica, and/or Snowflake
- Knowledgeable in electronic medical records, preferably EPIC
Years of Experience:
- Minimum of 5 years of experience in data science, with a focus on healthcare analytics.
- Proven experience working with Epic EMR data, including extraction, transformation, and analysis.
- Strong background in implementing rules-based logic and RAG in a healthcare setting.
Educational Level:
Bachelor's or Master’s degree in Data Science, Computer Science, Statistics, or a related field. A Ph.D. is a plus.