mirror of https://github.com/ghndrx/kubeflow-pipelines.git synced 2026-02-10 06:45:13 +00:00

Go to file

Greg Hendrickson 67a1095100 feat: Add 176K real DrugBank DDI samples with drug names

- Downloaded 191K DDI pairs from TDC DrugBank
- Fetched 1,634 drug names from PubChem API (96% hit rate)
- Created complete training dataset with:
  - Real drug names (not just IDs)
  - 86 interaction type descriptions
  - Severity labels (minor/moderate/major/contraindicated)
- Bundled 34MB data file in Docker image
- Handler loads real data instead of curated samples

2026-02-03 04:34:54 +00:00

.github/workflows

feat: Use self-hosted runner + curated DDI dataset

2026-02-03 03:27:10 +00:00

components/runpod_trainer

feat: Add 176K real DrugBank DDI samples with drug names

2026-02-03 04:34:54 +00:00

manifests

Initial Kubeflow GitOps setup with example pipelines

2026-02-02 23:37:53 +00:00

pipelines

Fix Python/transformers version compatibility

2026-02-03 01:11:20 +00:00

.gitignore

Add .gitignore to prevent credential leaks

2026-02-02 23:39:06 +00:00

ddi_data_prep_ts.yaml

Remove internal domains from README

2026-02-03 00:45:27 +00:00

ddi_data_prep.yaml

Remove internal domains from README

2026-02-03 00:45:27 +00:00

ddi_training_runpod.yaml

Fix MinIO endpoint to use internal cluster service

2026-02-03 00:15:26 +00:00

hello_world.yaml

Add GHA workflow for pipeline compilation

2026-02-02 23:41:49 +00:00

README.md

Remove internal domains from README

2026-02-03 00:45:27 +00:00

README.md

Kubeflow Pipelines - GitOps Repository

This repository contains ML pipeline definitions managed via ArgoCD.

Structure

.
├── pipelines/           # Pipeline Python definitions
│   └── examples/        # Example pipelines
├── components/          # Reusable pipeline components
├── experiments/         # Experiment configurations
├── runs/               # Scheduled/triggered runs
└── manifests/          # K8s manifests for ArgoCD

Usage

Add a pipeline: Create a Python file in pipelines/
Push to main: ArgoCD auto-deploys
Monitor: Check Kubeflow UI at <KUBEFLOW_URL>

Quick Start

from kfp import dsl

@dsl.component
def hello_world() -> str:
    return "Hello from Kubeflow!"

@dsl.pipeline(name="hello-pipeline")
def hello_pipeline():
    hello_world()

Environment

Kubeflow: <KUBEFLOW_URL>
MinIO: <MINIO_URL>
ArgoCD: <ARGOCD_URL>