mirror of https://github.com/ghndrx/kubeflow-pipelines.git synced 2026-02-10 06:45:13 +00:00

Go to file

Greg Hendrickson afc8fc6690 feat: Add real DrugBank DDI dataset support via TDC

- Added PyTDC dependency for DrugBank access
- Implemented DDI type -> severity label mapping (0-4)
- Added train/eval split with stratification
- Added accuracy and F1 metrics for evaluation
- Default: 50K samples from DrugBank DDI
- Supports both real data and custom inline data

2026-02-03 02:48:31 +00:00

.github/workflows

Add GHA workflow to build DDI trainer image

2026-02-03 00:28:09 +00:00

components/runpod_trainer

feat: Add real DrugBank DDI dataset support via TDC

2026-02-03 02:48:31 +00:00

manifests

Initial Kubeflow GitOps setup with example pipelines

2026-02-02 23:37:53 +00:00

pipelines

Fix Python/transformers version compatibility

2026-02-03 01:11:20 +00:00

.gitignore

Add .gitignore to prevent credential leaks

2026-02-02 23:39:06 +00:00

ddi_data_prep_ts.yaml

Remove internal domains from README

2026-02-03 00:45:27 +00:00

ddi_data_prep.yaml

Remove internal domains from README

2026-02-03 00:45:27 +00:00

ddi_training_runpod.yaml

Fix MinIO endpoint to use internal cluster service

2026-02-03 00:15:26 +00:00

hello_world.yaml

Add GHA workflow for pipeline compilation

2026-02-02 23:41:49 +00:00

README.md

Remove internal domains from README

2026-02-03 00:45:27 +00:00

README.md

Kubeflow Pipelines - GitOps Repository

This repository contains ML pipeline definitions managed via ArgoCD.

Structure

.
├── pipelines/           # Pipeline Python definitions
│   └── examples/        # Example pipelines
├── components/          # Reusable pipeline components
├── experiments/         # Experiment configurations
├── runs/               # Scheduled/triggered runs
└── manifests/          # K8s manifests for ArgoCD

Usage

Add a pipeline: Create a Python file in pipelines/
Push to main: ArgoCD auto-deploys
Monitor: Check Kubeflow UI at <KUBEFLOW_URL>

Quick Start

from kfp import dsl

@dsl.component
def hello_world() -> str:
    return "Hello from Kubeflow!"

@dsl.pipeline(name="hello-pipeline")
def hello_pipeline():
    hello_world()

Environment

Kubeflow: <KUBEFLOW_URL>
MinIO: <MINIO_URL>
ArgoCD: <ARGOCD_URL>