StreamDFP

This repository is based on the open-source StreamDFP project and extends it with an LLM-enhanced workflow for root-cause extraction, rule fusion, and model-level policy evaluation in disk failure prediction.

The codebase keeps both the upstream Python + Java prediction pipeline and the extension work added on top of it. It is organized for research reproduction rather than as a minimal library package. Source code, experiment scripts, and result notes are kept together, while large datasets, logs, generated caches, and local demo bundles stay outside normal versioned source paths.

Overview

Classic StreamDFP pipeline for HDD/SSD failure prediction with Python preprocessing and Java simulation.
LLM-enhanced framework_v1 pipeline for Phase1 window summarization, Phase2 root-cause extraction, and Phase3 policy evaluation.
New-model calibration branch for pilot20k admission testing before a disk model is added to the per-model LLM policy registry.
Local workbench UI plus normalized workflows/ wrappers so common tasks do not depend on memorizing historical script names.
Default local base model: Qwen3-4B-Instruct-2507. API-side comparison runs such as Qwen3.5-Plus remain optional comparison branches rather than the repository default.

Pipeline Overview

flowchart TB
    A[SMART CSV data]

    subgraph U[Upstream StreamDFP baseline]
        A --> B[pyloader/run.py<br/>preprocessing, labeling, sample generation]
        B --> C[simulate.Simulate + MOA<br/>training and simulation]
        C --> D[parse.py<br/>baseline metrics]
    end

    subgraph L[LLM extension on top of StreamDFP]
        B --> E[Phase1: window_to_text.py<br/>window_text and references]
        E --> F[Phase2: llm_offline_extract.py<br/>root-cause cache extraction]
        F --> G[Phase3: build_cache_variant.py + grid scripts<br/>cache variants and policy search]
        G --> H[run.py + simulate.Simulate<br/>re-evaluation with LLM signals]
        H --> I[merged reports and per-model policy<br/>llm_enabled vs fallback]
    end

    subgraph N[New-model calibration branch]
        B --> J[new model onboarding<br/>event mapping and feature contract]
        D --> K[no-LLM baseline reference]
        J --> L1[pilot20k Phase1 + Phase2 + Phase3]
        K --> M[guard check<br/>compare against no-LLM]
        L1 --> M
        M --> N1[policy registration<br/>llm_enabled or fallback]
        N1 --> I
    end

Upstream Attribution

This project builds on the open-source StreamDFP framework:

Upstream repository: https://github.com/shujiehan/StreamDFP

The work in this repository focuses on extending StreamDFP with an LLM-enhanced pipeline for semantic root-cause extraction, rule blending, fallback control, and model-level policy evaluation.

Quick Start

If you only need the main entrypoints:

Read docs/guides/PUBLIC_REPRODUCIBILITY.md for environment setup.
Start the local workbench with ./run_workbench.sh.
Use the workbench or the workflows/ wrappers to launch curated classic and LLM flows.

Main Entry Points

Task	Entry
Start the local UI	`./run_workbench.sh`
Browse normalized CLI wrappers	`workflows/`
Classic preprocessing and simulation	`workflows/classic/`
LLM Phase0/1/2/3 workflows	`workflows/llm/`
New model onboarding	`workflows/llm/new-model-onboarding-calibration.sh`
Single-model pilot20k calibration	`workflows/llm/pilot20k-single-model-calibration.sh`
Public environment and rerun steps	docs/guides/PUBLIC_REPRODUCIBILITY.md
Experiment/document index	docs/README.md

Repository Layout

StreamDFP/
├── pyloader/          # Python preprocessing, feature extraction, labeling, sample generation
├── simulate/          # Java simulation and prediction entry points
├── moa/               # MOA dependency source tree used by the Java pipeline
├── llm/               # LLM prompts, extraction logic, event mappings, contracts, tests
├── ui/                # Local Web UI, workflow registry, and static workbench assets
├── workflows/         # Canonical wrapper entrypoints with normalized names
├── scripts/           # Phase2/Phase3 orchestration, watchers, probes, reproducibility helpers
├── docs/              # Public guides, retained summaries/tables, and archived research notes
├── parse.py           # Parse simulation outputs into metric tables
├── run_workbench.sh   # Stable launcher for the local workbench UI
└── run_*.sh           # Legacy example launchers for baseline experiments

Detailed directory notes are in docs/guides/REPOSITORY_LAYOUT.md. Documentation entry points are indexed in docs/README.md. Historical experiment notes and academic-report materials are kept under docs/archive/README.md.

Workbench UI

The repository now includes a lightweight local Web UI that wraps the most important workflows behind normalized names and categories.

Start it from the repository root:

./run_workbench.sh

Default URL:

http://127.0.0.1:8765

The goal is to make the repository easier to operate without breaking existing script paths. The UI uses a curated workflow registry backed by canonical workflows/... wrappers and keeps the original script names as compatibility metadata.

More details are in docs/guides/WORKBENCH_UI.md. The normalized CLI alias layer is documented in docs/guides/WORKFLOW_ALIASES.md.

Runtime Paths and Onboarding

Classic StreamDFP Pipeline

Generate train/test samples with pyloader/run.py or the pyloader/run_*_loader.sh helpers.
Train and simulate with the Java entrypoint in simulate/ using simulate.Simulate.
Parse metrics with parse.py.

Relevant files:

LLM-Enhanced Framework (`framework_v1`)

Convert sliding windows into textual summaries with llm/window_to_text.py.
Run offline LLM extraction with llm/llm_offline_extract.py.
Build cache variants and evaluate them through the Phase3 grid scripts.
Merge per-model results into model-level policy decisions (llm_enabled vs fallback).

Relevant files:

New-Model Calibration Branch

Provide the raw DISK_MODEL name from the HDD CSV data.
Use the onboarding workflow to derive model_key, build the feature contract, generate the event mapping, and run the classic no-LLM baseline automatically.
Let the same onboarding flow run a pilot20k Phase1 + Phase2 + Phase3 calibration cycle for that model.
Compare the best LLM result against the no-LLM baseline with the policy guards, then review the suggested policy output and register the model as llm_enabled or fallback.

This branch is the intended admission workflow for a previously unseen disk model. A new model should not skip directly to the default runtime policy without this calibration step.

Relevant file:

Core Documents

docs/guides/PUBLIC_REPRODUCIBILITY.md: environment setup and end-to-end reproduction steps
docs/reports/cross_model_llm_framework_v1_final.md: main write-up for the LLM-enhanced pipeline
docs/reports/cross_model_policy_registry_v1_all12.md: merged all12 model-level policy table
docs/reports/llm_robust_eval_report_v4_merged_all12.md: merged policy guard report
docs/reports/llm_vs_nollm_metrics_all12_summary.md: retained model-level metric summary
docs/tables/README.md: public CSV tables retained with the repository
docs/archive/README.md: archived experiment notes and academic-report assets

Environment

Minimum runtime dependencies:

Python 3
numpy, pandas
Java JDK 8

Optional LLM runtime:

vllm for GPU-backed Phase2 extraction
Qwen-family model weights downloaded locally from HuggingFace or ModelScope

Public repo environment files:

The public reproducibility walkthrough is in docs/guides/PUBLIC_REPRODUCIBILITY.md.

Data and Models

This repository does not require committing raw datasets or downloaded model weights.

Public HDD data typically comes from Backblaze SMART records.
Public SSD experiments can use Alibaba SSD SMART datasets.
Local datasets under data/ are ignored by .gitignore.
Local model directories outside the repo are recommended for Qwen checkpoints.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
configs		configs
docs		docs
llm		llm
moa		moa
pyloader		pyloader
scripts		scripts
simulate		simulate
ui		ui
workflows		workflows
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment-public.yml		environment-public.yml
parse.py		parse.py
parse_reg.py		parse_reg.py
pom.xml		pom.xml
requirements-llm-public.txt		requirements-llm-public.txt
requirements-public.txt		requirements-public.txt
run_cross_model_llm_recall_controller.sh		run_cross_model_llm_recall_controller.sh
run_hi640_transfer.sh		run_hi640_transfer.sh
run_hi7.sh		run_hi7.sh
run_hi7_reg.sh		run_hi7_reg.sh
run_hi7_rnn.sh		run_hi7_rnn.sh
run_llm_feature_flow_mc1_qwen3_4b_2507.sh		run_llm_feature_flow_mc1_qwen3_4b_2507.sh
run_llm_feature_flow_qwen3_4b_2507.sh		run_llm_feature_flow_qwen3_4b_2507.sh
run_mc1_mlp.sh		run_mc1_mlp.sh
run_robust_eval_report_v2.sh		run_robust_eval_report_v2.sh
run_stage2_7models_fs_20140901_20141109.sh		run_stage2_7models_fs_20140901_20141109.sh
run_stage2_remaining5_fs_zs_then_shutdown.sh		run_stage2_remaining5_fs_zs_then_shutdown.sh
run_stage2_remaining5_resume_safe_then_shutdown.sh		run_stage2_remaining5_resume_safe_then_shutdown.sh
run_stage3_5_for_completed_map70_models.sh		run_stage3_5_for_completed_map70_models.sh
run_workbench.sh		run_workbench.sh
stop_after_model_fs_zs.sh		stop_after_model_fs_zs.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StreamDFP

Overview

Pipeline Overview

Upstream Attribution

Quick Start

Main Entry Points

Repository Layout

Workbench UI

Runtime Paths and Onboarding

Classic StreamDFP Pipeline

LLM-Enhanced Framework (`framework_v1`)

New-Model Calibration Branch

Core Documents

Environment

Data and Models

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

StreamDFP

Overview

Pipeline Overview

Upstream Attribution

Quick Start

Main Entry Points

Repository Layout

Workbench UI

Runtime Paths and Onboarding

Classic StreamDFP Pipeline

LLM-Enhanced Framework (framework_v1)

New-Model Calibration Branch

Core Documents

Environment

Data and Models

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

LLM-Enhanced Framework (`framework_v1`)

Packages