CrocoLake-Julia is a collection of Jupyter notebooks that shows how to interface with CrocoLake and Argo's parquet databases with Julia.
To install the necessary packages, run these commands:
git clone https://github.com/boom-lab/crocolake-julia
cd crocolake-julia
julia --project=. -e 'using Pkg; Pkg.instantiate()'- You can then launch the notebook via
jupyter notebook example.ipynb(or viajupyter lab). - Click on ⏩ to run all cells at once. The notebook will download small sample datasets to run the examples.
- Example 1 shows how to read, subset, and visaulize data from the Parquet folder.
The following databases are currently available:
- CrocoLake: contains the best available data from Argo, GLODAP, and Spray Gliders. More details here.
- Argo 'QC': contains the best available data, that is real time values are reported only when delayed values are not available. This version is the same used in CrocoLake, and here you can find more details on how it is generated.
- Argo 'ALL': contains all real time and adjusted variables as reported in the core ('<PLATFORM_NUMBER>_prof.nc') and synthetic ('<PLATFORM_NUMBER>_Sprof.nc') profile files, for the physical and biogeochemical versions respectively.
Each database comes in 'PHY' and 'BGC' versions.
- install Julia
- install Jupyter
- Install Jupyter kernel for Julia
For any questions, bugs, missing information, etc, open an issue or get in touch!