Commit Graph

9 Commits

Author SHA1 Message Date
59c808cb3a feat: add S3 model upload support
- Add upload_to_s3 function to handler
- Save trained BERT models to S3 when credentials provided
- Save LoRA adapters to S3 when credentials provided
- Input params: s3_bucket, s3_prefix, aws_access_key_id, aws_secret_access_key, aws_region
2026-02-03 15:13:21 +00:00
45b96e2094 feat: Switch to Llama 3.1 8B (Bedrock-compatible)
- Default model now meta-llama/Llama-3.1-8B-Instruct
- Added multi-model chat format support:
  - Llama 3 format
  - Mistral/Mixtral format
  - Qwen format
  - Gemma format
- Trained model can be imported to AWS Bedrock
2026-02-03 04:38:54 +00:00
67a1095100 feat: Add 176K real DrugBank DDI samples with drug names
- Downloaded 191K DDI pairs from TDC DrugBank
- Fetched 1,634 drug names from PubChem API (96% hit rate)
- Created complete training dataset with:
  - Real drug names (not just IDs)
  - 86 interaction type descriptions
  - Severity labels (minor/moderate/major/contraindicated)
- Bundled 34MB data file in Docker image
- Handler loads real data instead of curated samples
2026-02-03 04:34:54 +00:00
39922e8d2e feat: Add Gemma 3 12B with QLoRA fine-tuning
- Added PEFT, bitsandbytes, TRL for LoRA training
- 4-bit QLoRA quantization for 48GB GPU fit
- Instruction-tuning format for Gemma chat template
- Auto-detect model type (BERT vs LLM)
- Updated GPU tier to ADA_24/AMPERE_48
2026-02-03 03:58:25 +00:00
4ff491f847 feat: Use self-hosted runner + curated DDI dataset
- Switch to self-hosted runner on compute-01 for faster builds
- Replace PyTDC with curated DDI dataset (no heavy deps)
- 60+ real drug interaction patterns based on clinical guidelines
- Generates up to 10K training samples with text variations
- Maintains 5-level severity classification
2026-02-03 03:27:10 +00:00
afc8fc6690 feat: Add real DrugBank DDI dataset support via TDC
- Added PyTDC dependency for DrugBank access
- Implemented DDI type -> severity label mapping (0-4)
- Added train/eval split with stratification
- Added accuracy and F1 metrics for evaluation
- Default: 50K samples from DrugBank DDI
- Supports both real data and custom inline data
2026-02-03 02:48:31 +00:00
0f4858d22f fix: disable checkpoint saving to avoid tensor contiguity error 2026-02-03 02:38:15 +00:00
2680ad5502 fix: remove MinIO dependency, use inline training data 2026-02-03 02:27:49 +00:00
9ca3d6c195 Add DDI training pipeline with RunPod serverless GPU support 2026-02-02 23:56:05 +00:00