CoreTek Labs Company logo on Dataaxy
CoreTek Labs

Data Scientist

πŸ“
Washington, DC
πŸ§ͺ
Mid-Senior level

Description

  • As a Machine Learning Engineer on our core Software AIML team, you will be at the forefront of designing and developing cutting-edge GenAI applications.
  • You will be working closely with business stakeholders and data engineers to communicate AI recommendations to senior management.

Key Responsibilities:

  • Engage deeply with business teams to identify opportunities and translate the needs into innovative and practical AIML solutions
  • Design, build, and deploy state of the art AIML models to solve complex business problems
  • Understand key performance levers and metrics to highlight AIML solution operational issues and drive improvement
  • Foster close collaboration with engineers and infrastructure partners to implement robust and scalable solutions
  • Communicate the results and insights effectively to partners and senior leaders, providing clear and actionable recommendations
  • Stay updated with the latest trends, technologies, and best practices in AI and GenAI and data engineering
  • Regularly research and present new ideas to improve the team's technical capabilities

Minimum Qualifications:

  • Industry experience 10+ years BSCS, 5+ years MS/PhD in Computer Science, Statistics, Applied Math, or a related field

Preferred Qualifications:

  • Expert in AIML modeling, Python, AIML Infrastructure, model deployment and MLOps
  • Experience working in Supply Chain, Operations, or a related field
  • End to end GenAI application experience
  • Knowledge in RAG, Prompt Engineering, LoRA and other GenAI techniques
  • Ability to operate independently and lead without authority
  • Superb communication and interpersonal skills

AI SW Engineer Skills/Experience:

Experience in AI System Development foundational model Lifecycle training, retrieval of Augmented Systems, and Model Evaluation.

Proficient in Large Language Models (LLMs) Architecture, creating targeted Instruction Datasets, and implementing Quantization and Inference Optimizations.

Capable of covering Problem Definition, Data Acquisition, and Full-cycle Deployment, utilizing tools like MLflow, Cloud Platforms like AWS, Google Cloud, Azure.

Proprietary Models like GPT-4 and Claude, Open-source Models like Llama and Mistral, able to deliver high-performing, scalable, and cost-efficient AI solutions.

NLP libraries like NLTK, spaCy, GPT-3, BERT, and Hugging Face's Transformers to create sophisticated natural language processing solutions.

TECHNICAL SKILLS:

Methodologies: Spiral, Agile, Waterfall, Lean

Programming Language: C++, Python, Shell, SQL, R

IDE Tools: PyCharm, RStudio, Visual Studio Code, Jupyter Notebook, Google Collab, Navicat

ML Frameworks: Transformers, Scikit-Learn, Keras, TensorFlow, PyTorch, ONNX, NLTK, OpenAI, langchain,

llama-index,kore.ai

DL Architectures: LLM, ANN, CNN, R-CNN, RNN, GRU, LSTM, Transformers, Attention Mechanism, Tokenisers,

BERT, T5, Sentence Transformers, Foundational models

Packages: Pandas, NumPy, Spark, Matplotlib, SciPy

Cloud Technologies: AWS (EC2, S3, RDS, ECS, Lambda), Azure(VM, Functions, ACR, AKS), Digital Ocean, GCP, Paperspace

Database: MySQL, PostgreSQL, MongoDB, ChromaDB, Pinecone, SQLite

Web Frameworks and Web Servers/ Deployment:

Fast API, Flask, Nginx, Gunicorn, Uvicorn, Docker, Kubernetes

Miscellaneous Tools and Technologies:

JIRA, Auto Gluon, GitHub Actions, Github Issues, Datadog, MS Power BI, Postman, Locust, Stream lit, CUDA, cuDNN, TensorRT

Version Control: Git, GitHub, DVC (Data Version Control)

Operating Systems: Windows, Linux, Mac

Key informations

🧳
Contract
πŸ“…
Posted a month ago

Don’t miss out on new
Data & AI Jobs

Get curated job alerts weekly.

Other jobs at CoreTek Labs

CoreTek Labs does not currently have any open job positions in Data & Ai.
© 2023 | All Rights Reserved | Built with 🀍 in MontrealAll our data is gathered from publicly available sources or contributed by users