Releases: srlearn/relational-datasets
Releases · srlearn/relational-datasets
Regression and Classification Set + Recommended Loading Scripts
What's Changed
Release Notes: https://srlearn.github.io/relational-datasets/changelog/#v040-2022-11-03
- ✨ Bump
datasetstov0.0.6, Addcalifornia_housingandroofworld20by @hayesall in #28 - 🚧 Bump codecov-action to
v3by @hayesall in #29 - ✨ Dataset loading recommendations by @hayesall in #30
- 🔧 Set
pythonpublish.ymlto usepypi-publishby @hayesall in #31 - 📝 Add changelog entry for
v0.4.0by @hayesall in #32
Full Changelog: v0.3.0...v0.4.0
Unique names assumption and avoiding type conflicts
Multiclass support
Software Changes:
- Add multiclass support to
convert.from_numpy
Documentation:
- Add notebook with overview on converting multiclass vector datasets
Testing:
- Add tests for
convert.from_numpy
Drug Interactions and Toy Machines
Software Changes
- Add
drug_interactionsandtoy_machinesdatasets - Add
v0.0.5as the latestsrlearn/datasetsrelease
v0.2.0
Software Changes:
- Add
convertmodule withfrom_numpyimplementation to convert binary
classification and regression datasets based on ordinal encodings. - Fix type annotations in
relational_datasets.request - Fix type annotations in
relational_datasets.types
Documentation:
- Add tutorial for converting vector/propositional datasets to relational
- Add
mkdocsdependency:pymdownx.tasklist - Add
binderandcolablaunch badges to Jupyter notebook tutorials
Testing:
- Add
lgtmbuild step + README badge - Add
codecovbuild step + README badge - Add
numpy>=1.20.0as an optional setup target, and test against it
(this is the earliest version ofnumpywhere type annotations formypy
seem to be consistently available)
v0.1.1 - Split loading and deserializing / Project Documentation
Software Changes:
- Bump default dataset version:
v0.0.3→v0.0.4. - Between
v0.0.3andv0.0.4ofsrlearn/datasets, all zipfiles now have the version number appended
(e.g.toy_cancer_v0.0.4.zip). Add logic to request the correct filename from GitHub. - Add
deserialize_zipfilefunction, split out code for pulling zipfile content from theloadmethod. - Add private
_make_file_pathfunction to handle where zipfiles are stored on a user's filesystem. - Move
RelationalDatasettype intorelational_datasets/types.py - Fix
hayesall/relational_datasets→srlearn/relational_datasetsinsetup.py - Clarify
typing.Optionalin function signatures where default file paths are allowed. - Add
__version__to the main__init__, soprint(relational_datasets.__version__)is valid.
Documentation:
- Add
mkdocsbuilds with each push to themainbranch. - Add
requirements_dev.txtwith requirements to build documentation. - Add
docs/build._docs.pyto build a Downloads page and an overview of each dataset pulled from thesrlearn/datasetsrepository. - Add
docs/notebooks/directory for literate tutorials- Add
00_loading_and_fetching.ipynb
- Add
- Pages for functions and types:
types.RelationalDatasetrequest.deserialize_zipfilerequest.fetchrequest.latest_versionrequest.load
🔖 Initial Release
Release basic specification:
load(name: str, version: str = "", fold: int = 1) -> Tuple[RelationalDataset, RelationalDataset]: Load examples and facts for a dataset/version/foldfetch(name: str, version: str = "") -> str: Download and cache an archive locallyget_data_home(data_home=None) -> str: Get the path to the cache directoryclear_data_home(data_home=None) -> None: Remove the cache directorylatest_version() -> str: Check latest version of datasets on GitHub