CoreTek Labs

Data Scientist

📍

Washington, DC

🧪

Mid-Senior level

Description

As a Machine Learning Engineer on our core Software AIML team, you will be at the forefront of designing and developing cutting-edge GenAI applications.
You will be working closely with business stakeholders and data engineers to communicate AI recommendations to senior management.

Key Responsibilities:

Engage deeply with business teams to identify opportunities and translate the needs into innovative and practical AIML solutions
Design, build, and deploy state of the art AIML models to solve complex business problems
Understand key performance levers and metrics to highlight AIML solution operational issues and drive improvement
Foster close collaboration with engineers and infrastructure partners to implement robust and scalable solutions
Communicate the results and insights effectively to partners and senior leaders, providing clear and actionable recommendations
Stay updated with the latest trends, technologies, and best practices in AI and GenAI and data engineering
Regularly research and present new ideas to improve the team's technical capabilities

Minimum Qualifications:

Industry experience 10+ years BSCS, 5+ years MS/PhD in Computer Science, Statistics, Applied Math, or a related field

Preferred Qualifications:

Expert in AIML modeling, Python, AIML Infrastructure, model deployment and MLOps
Experience working in Supply Chain, Operations, or a related field
End to end GenAI application experience
Knowledge in RAG, Prompt Engineering, LoRA and other GenAI techniques
Ability to operate independently and lead without authority
Superb communication and interpersonal skills

AI SW Engineer Skills/Experience:

Experience in AI System Development foundational model Lifecycle training, retrieval of Augmented Systems, and Model Evaluation.

Proficient in Large Language Models (LLMs) Architecture, creating targeted Instruction Datasets, and implementing Quantization and Inference Optimizations.

Capable of covering Problem Definition, Data Acquisition, and Full-cycle Deployment, utilizing tools like MLflow, Cloud Platforms like AWS, Google Cloud, Azure.

Proprietary Models like GPT-4 and Claude, Open-source Models like Llama and Mistral, able to deliver high-performing, scalable, and cost-efficient AI solutions.

NLP libraries like NLTK, spaCy, GPT-3, BERT, and Hugging Face's Transformers to create sophisticated natural language processing solutions.

TECHNICAL SKILLS:

Methodologies: Spiral, Agile, Waterfall, Lean

Programming Language: C++, Python, Shell, SQL, R

IDE Tools: PyCharm, RStudio, Visual Studio Code, Jupyter Notebook, Google Collab, Navicat

ML Frameworks: Transformers, Scikit-Learn, Keras, TensorFlow, PyTorch, ONNX, NLTK, OpenAI, langchain,

llama-index,kore.ai

DL Architectures: LLM, ANN, CNN, R-CNN, RNN, GRU, LSTM, Transformers, Attention Mechanism, Tokenisers,

BERT, T5, Sentence Transformers, Foundational models

Packages: Pandas, NumPy, Spark, Matplotlib, SciPy

Cloud Technologies: AWS (EC2, S3, RDS, ECS, Lambda), Azure(VM, Functions, ACR, AKS), Digital Ocean, GCP, Paperspace

Database: MySQL, PostgreSQL, MongoDB, ChromaDB, Pinecone, SQLite

Web Frameworks and Web Servers/ Deployment:

Fast API, Flask, Nginx, Gunicorn, Uvicorn, Docker, Kubernetes

Miscellaneous Tools and Technologies:

JIRA, Auto Gluon, GitHub Actions, Github Issues, Datadog, MS Power BI, Postman, Locust, Stream lit, CUDA, cuDNN, TensorRT

Version Control: Git, GitHub, DVC (Data Version Control)

Operating Systems: Windows, Linux, Mac

Apply Now

Key informations

🧳

Contract

📅

Posted a year ago

Report a problem

Don’t miss out on new
Data & AI Jobs

Get curated job alerts weekly.

Create talent profile

Share job

Other jobs at CoreTek Labs

CoreTek Labs does not currently have any open job positions in Data & Ai.

Data Scientist

Key informations

Don’t miss out on new Data & AI Jobs

Share job

Other jobs at CoreTek Labs

Don’t miss out on new
Data & AI Jobs