From 26258f96b34c16709a3de28c84b5af91fcd4d447 Mon Sep 17 00:00:00 2001 From: Jia Yu Date: Wed, 6 May 2026 16:42:13 -0700 Subject: [PATCH] [GH-2869] Update Glue docs for Sedona 1.8+ (Glue 5.0, Spark 3.5) The Maven Central link for sedona-spark-shaded-3.3_2.12 returns 404 for current Sedona versions because Spark 3.3 / Java 8 support was dropped in Sedona 1.8.0. Update the Glue tutorial to use Glue 5.0 (Spark 3.5.4, Java 17, Python 3.11) with the sedona-spark-shaded-3.5_2.12 artifact, and tell readers stuck on Glue 4.0 to pin Sedona <= 1.7.1. --- docs/setup/glue.md | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/docs/setup/glue.md b/docs/setup/glue.md index ec6c6722a87..c31284ff2c2 100644 --- a/docs/setup/glue.md +++ b/docs/setup/glue.md @@ -22,23 +22,23 @@ This tutorial will cover how to configure both a glue notebook and a glue ETL jo have a working knowledge of AWS Glue jobs. In the tutorial, we use -Sedona {{ sedona.current_version }} and [Glue 4.0](https://docs.aws.amazon.com/glue/latest/dg/release-notes.html) which runs on Spark 3.3.0, Java 8, Scala 2.12, -and Python 3.10. We recommend Sedona-1.3.1-incubating and above for Glue. +Sedona {{ sedona.current_version }} and [Glue 5.0](https://docs.aws.amazon.com/glue/latest/dg/release-notes.html) which runs on Spark 3.5.4, Java 17, Scala 2.12, +and Python 3.11. We recommend Sedona 1.8.0 and above for Glue 5.0. !!!warning - **Important:** Since Sedona 1.8.0, Java 8 support is dropped and Spark 3.3 support is dropped. For Sedona 1.8.0+, you need to use Glue 5.0+ which supports Java 11 and Spark 3.4+. + **Important:** Since Sedona 1.8.0, Java 8 support is dropped and Spark 3.3 support is dropped. Sedona 1.8.0+ requires Glue 5.0+ which supports Java 17 and Spark 3.5+. If you must stay on Glue 4.0 (Spark 3.3, Java 8), use Sedona 1.7.1 or lower along with the `sedona-spark-shaded-3.3_2.12` artifact instead of the `sedona-spark-shaded-3.5_2.12` artifact shown below. ## Gather Maven Links -You will need to point your glue job to the Sedona and Geotools jars. We recommend using the jars available from maven. The links below are those intended for Glue 4.0 +You will need to point your glue job to the Sedona and Geotools jars. We recommend using the jars available from maven. The links below are those intended for Glue 5.0. -Sedona Jar: [Maven Central](https://repo1.maven.org/maven2/org/apache/sedona/sedona-spark-shaded-3.3_2.12/{{ sedona.current_version }}/sedona-spark-shaded-3.3_2.12-{{ sedona.current_version }}.jar) +Sedona Jar: [Maven Central](https://repo1.maven.org/maven2/org/apache/sedona/sedona-spark-shaded-3.5_2.12/{{ sedona.current_version }}/sedona-spark-shaded-3.5_2.12-{{ sedona.current_version }}.jar) Geotools Jar: [Maven Central](https://repo1.maven.org/maven2/org/datasyslab/geotools-wrapper/{{ sedona.current_geotools }}/geotools-wrapper-{{ sedona.current_geotools }}.jar) !!!note - Ensure you pick a version for Scala 2.12 and Spark 3.3. The Spark 3.4 and Scala - 2.13 jars are not compatible with Glue 4.0. + Ensure you pick a version for Scala 2.12 and Spark 3.5. The Scala + 2.13 jars are not compatible with Glue 5.0. ## Configure Glue Job @@ -55,7 +55,7 @@ and the second installs the Sedona Python package directly from pip. ```text # Sedona Config -%extra_jars https://repo1.maven.org/maven2/org/apache/sedona/sedona-spark-shaded-3.3_2.12/{{ sedona.current_version }}/sedona-spark-shaded-3.3_2.12-{{ sedona.current_version }}.jar, https://repo1.maven.org/maven2/org/datasyslab/geotools-wrapper/{{ sedona.current_geotools }}/geotools-wrapper-{{ sedona.current_geotools }}.jar +%extra_jars https://repo1.maven.org/maven2/org/apache/sedona/sedona-spark-shaded-3.5_2.12/{{ sedona.current_version }}/sedona-spark-shaded-3.5_2.12-{{ sedona.current_version }}.jar, https://repo1.maven.org/maven2/org/datasyslab/geotools-wrapper/{{ sedona.current_geotools }}/geotools-wrapper-{{ sedona.current_geotools }}.jar %additional_python_modules apache-sedona=={{ sedona.current_version }} ``` @@ -63,12 +63,12 @@ If you are using the example notebook from glue, the first cell should now look ```text %idle_timeout 2880 -%glue_version 4.0 +%glue_version 5.0 %worker_type G.1X %number_of_workers 5 # Sedona Config -%extra_jars https://repo1.maven.org/maven2/org/apache/sedona/sedona-spark-shaded-3.3_2.12/{{ sedona.current_version }}/sedona-spark-shaded-3.3_2.12-{{ sedona.current_version }}.jar, https://repo1.maven.org/maven2/org/datasyslab/geotools-wrapper/{{ sedona.current_geotools }}/geotools-wrapper-{{ sedona.current_geotools }}.jar +%extra_jars https://repo1.maven.org/maven2/org/apache/sedona/sedona-spark-shaded-3.5_2.12/{{ sedona.current_version }}/sedona-spark-shaded-3.5_2.12-{{ sedona.current_version }}.jar, https://repo1.maven.org/maven2/org/datasyslab/geotools-wrapper/{{ sedona.current_geotools }}/geotools-wrapper-{{ sedona.current_geotools }}.jar %additional_python_modules apache-sedona=={{ sedona.current_version }}