Data Scientist – (IRAD Program – Prototype)
Position Type: Part-Time, W-2
Clearance: U.S. Citizenship required
SRG is seeking a Data Scientist to support several development projects. This position will work under the Lead Data Engineer and serve as a key integrator of data sources, APIs, and language model processing pipelines. You will contribute to the server-side logic, perform data transformation and analysis, and optimize backend systems that drive AI-informed decision support.
Remote/Hybrid work with occasional in-person attendance.
Washington, D.C. Metro Area (Remote/Hybrid)
- Collaborate with the Lead Data Engineer to integrate structured and unstructured data sources (APIs, manual datasets, local files) into scalable processing workflows.
- Develop backend logic for secure data ingestion, feature engineering, and transformation using Python, Pandas, NumPy, and SQL.
- Implement and optimize API connectivity with federal datasets (e.g., Census, EPA, OpenWeather, BLS, etc.).
- Contribute to LLM data preparation and tuning pipelines, including integration with OpenAI, LlaMA, Dolphin, or Hugging Face Transformer models (local or cloud).
- Work within secure environments (including air-gapped systems) to preprocess data and support analytics workflows.
- Participate in technical team syncs to validate data flow logic, troubleshoot model outputs, and improve end-user data accessibility.
- Ensure adherence to data privacy, encryption, and access control protocols.
- 3+ years of experience in applied data science or backend-focused data engineering roles.
- Proficiency in Python 3.x, especially for data transformation, cleaning, and pipeline construction.
- Experience with SQL databases and structured query development (Azure SQL preferred).
- Familiarity with secure API integrations and working with RESTful endpoints.
- Strong background in data validation, feature engineering, and analytic preprocessing.
- Comfortable working in secure environments with strict access control and version discipline.
- U.S. Citizenship is required.
- Familiarity with local or secure deployment of LLMs (e.g., LlaMA, Dolphin) and ML inference at scale.
- Knowledge of air-gapped computing, encryption protocols (TLS/SSL), and secure file transfers.
- Experience using Flask, FastAPI, or equivalent for backend logic integration.
Apply Now
Position:
First Name:
Last Name:
Email:
Phone Number:
Currently Located In:
Available Start Date:
Resume:
Cover Letter:
The questions below are designed for demographic data collection purposes only. They are entirely optional and will not be taken into account when evaluating candidates for the position. Your responses to these questions will be kept confidential and solely used for statistical analysis.
Your application has been submitted