Greg Hendrickson 67a1095100 feat: Add 176K real DrugBank DDI samples with drug names
- Downloaded 191K DDI pairs from TDC DrugBank
- Fetched 1,634 drug names from PubChem API (96% hit rate)
- Created complete training dataset with:
  - Real drug names (not just IDs)
  - 86 interaction type descriptions
  - Severity labels (minor/moderate/major/contraindicated)
- Bundled 34MB data file in Docker image
- Handler loads real data instead of curated samples
2026-02-03 04:34:54 +00:00
2026-02-03 00:45:27 +00:00

Kubeflow Pipelines - GitOps Repository

This repository contains ML pipeline definitions managed via ArgoCD.

Structure

.
├── pipelines/           # Pipeline Python definitions
│   └── examples/        # Example pipelines
├── components/          # Reusable pipeline components
├── experiments/         # Experiment configurations
├── runs/               # Scheduled/triggered runs
└── manifests/          # K8s manifests for ArgoCD

Usage

  1. Add a pipeline: Create a Python file in pipelines/
  2. Push to main: ArgoCD auto-deploys
  3. Monitor: Check Kubeflow UI at <KUBEFLOW_URL>

Quick Start

from kfp import dsl

@dsl.component
def hello_world() -> str:
    return "Hello from Kubeflow!"

@dsl.pipeline(name="hello-pipeline")
def hello_pipeline():
    hello_world()

Environment

  • Kubeflow: <KUBEFLOW_URL>
  • MinIO: <MINIO_URL>
  • ArgoCD: <ARGOCD_URL>
Description
Languages
Python 99%
Dockerfile 1%