Data Engineer / Data Scientist (IRAD Program)
Position Type: Part-time, W-2
SRG is seeking a versatile and mission-driven Data Engineer/Data Scientist to lead backend and infrastructure development for two development tools:
Project 1— A visualization tool.
Project 2 –A secure, air-gapped platform integrating local LLMs (e.g., LLaMA, Dolphin).
This position involves direct collaboration with SRG leadership. The ideal candidate thrives in an agile, mission-driven development environment, has strong Python expertise, and brings practical experience integrating structured data, APIs, and LLMs in secure architectures.
Remote/Hybrid work with occasional in-person attendance.
Washington, D.C. Metro Area (Remote/Hybrid)
For Project 1:
- Build data pipelines for real-time and batch data ingestion from various sources.
- Configure and manage data storage using Azure SQL Database and Blob Storage.
- Develop and maintain backend logic (Python-based) supporting analytic workflows and LLM integration (Azure OpenAI).
- Secure, test, and optimize API calls and interface logic for performance and scalability.
- Support real-time analysis and output via Azure LLM and Cognitive Services.
For Project 2:
- Architect and deploy a secure, air-gapped data infrastructure with encrypted local storage.
- Implement GPU-enabled server configurations and integrate local LLMs (LLaMA/Dolphin) with Hugging Face Transformers.
- Manage ingestion, preprocessing, and validation of structured and unstructured data from >20 secure and open APIs.
- Design data processing logic in Python (Pandas, NumPy).
- Coordinate secure data transfer workflows (manual and API-based) across disconnected systems.
- 5+ years of experience in data engineering, machine learning, or applied data science roles.
- Strong Python programming skills; experience with Flask, FastAPI, and Jupyter.
- Familiarity with API integration and data security protocols (OAuth2, SSL/TLS).
- Experience with Azure services (SQL Database, Cognitive Services, Key Vault).
- Knowledge of secure system architectures and experience working in restricted or air-gapped environments.
- Ability to work collaboratively in agile teams with iterative development goals.
- Hands-on experience deploying LLMs locally (e.g., LLaMA, Dolphin) with GPU support.
- Familiarity with data governance frameworks and federal compliance standards.
Apply Now
Position:
First Name:
Last Name:
Email:
Phone Number:
Currently Located In:
Available Start Date:
Resume:
Cover Letter:
The questions below are designed for demographic data collection purposes only. They are entirely optional and will not be taken into account when evaluating candidates for the position. Your responses to these questions will be kept confidential and solely used for statistical analysis.
Your application has been submitted