Skip to content

e-south/dnadesign

Repository files navigation

CI Codecov Python 3.12

dnadesign banner

dnadesign is a collection of modular bioinformatics tools for biological sequence design. It brings dataset management, sequence generation, assembly of designed or wild-type sequences into larger constructs, sequence-model inference, unsupervised analysis of learned sequence representations, optimization toward defined phenotypes, and cross-tool workflow orchestration into one repository.


Documentation

Use the docs index to choose a workflow, inspect existing work, or jump to a tool.


Available tools

Use the tool READMEs below for task-specific docs and CLI entrypoints.

Tool Description Coverage
usr Inspect and manage USR datasets and Parquet-backed records. usr coverage
ops Plan, submit, and inspect batch runbooks across tools. ops coverage
studies Hold study-family adapters that read checked-in study records and plug family-owned observation/preflight logic into Ops. studies coverage
notify Send webhook notifications for local runs and scheduler-backed jobs. notify coverage
densegen Generate DNA sequence libraries from workspace configs and inputs. CLI: uv run dense. densegen coverage
infer Run sequence-model inference and write derived outputs back to datasets. infer coverage
construct Build template-based DNA constructs and record placement metadata. construct coverage
opal Run active-learning campaigns over labeled sequence datasets. opal coverage
cluster Cluster feature tables, render UMAPs, and write analysis outputs. cluster coverage
billboard Measure regulatory diversity across DenseGen libraries. billboard coverage
libshuffle Subsample libraries iteratively and analyze each round with billboard. libshuffle coverage
nmf Apply NMF to sequence libraries and summarize recurring TFBS combinations. nmf coverage
latentdna Materialize workspace-owned latent views, samples, projections, and plots. latentdna coverage
cruncher Optimize sequences against PWM-driven objectives. cruncher coverage
tfkdanalysis Analyze TFKD libraries in PPTP-seq context. tfkdanalysis coverage
aligner Score global alignments with Biopython PairwiseAligner. aligner coverage
baserender Render sequences through a schema-driven adapter runtime. baserender coverage
contracts Publish shared cross-tool artifact contracts without importing tool internals. contracts coverage
permuter Generate sequence permutations and evaluate downstream outputs. permuter coverage

About

A collection of bioinformatic modules and data analysis pipelines, all related to DNA sequence design.

Resources

License

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors