add RAT support to rasters. #25 by davemfish · Pull Request #122 · natcap/geometamaker

davemfish · 2026-03-20T20:30:22Z

Added read support for a GDAL Raster Attribute Table. If it exists, the
table will be included as band metadata under the 'raster_attribute_table'
key. It can be retrieved by the get_rat method of a RasterResource
instance.

Fixes #25

emilyanndavis

Great addition, @davemfish! I have a handful of suggestions, mainly around documentation.

emilyanndavis · 2026-03-24T21:44:40Z

src/geometamaker/models.py

+    type: str
+    """Datatype of the content of the column, see ``gdal.GFT_*``."""
+    usage: str
+    """The intended use of the column, see ``gdal.GFU_*``."""


Do you think it might be more helpful to refer to the GDAL enum names here? If nothing else, they're probably more reliable search terms for finding the relevant GDAL documentation.

e.g.,

Suggested change

type: str

"""Datatype of the content of the column, see ``gdal.GFT_*``."""

usage: str

"""The intended use of the column, see ``gdal.GFU_*``."""

type: str

"""Datatype of the content of the column, see ``gdal.GDALRATFieldType``."""

usage: str

"""The intended use of the column, see ``gdal.GDALRATFieldUsage``."""

or even

Suggested change

type: str

"""Datatype of the content of the column, see ``gdal.GFT_*``."""

usage: str

"""The intended use of the column, see ``gdal.GFU_*``."""

type: str

"""Datatype of the content of the column. String representation of one of ``gdal.GDALRATFieldType``."""

usage: str

"""The intended use of the column. String representation of one of ``gdal.GDALRATFieldUsage``."""

(though I'm not 100% sure I'm using "one of" properly there—I always get confused about the singular vs. plural nature of enums. 🙃 )

Just to be thorough, we could also cover the case where we're reading the RAT directly from a .vat.dbf file:

Suggested change

type: str

"""Datatype of the content of the column, see ``gdal.GFT_*``."""

usage: str

"""The intended use of the column, see ``gdal.GFU_*``."""

type: str

"""Datatype of the content of the column. String representation of one of ``gdal.GDALRATFieldType``, or a string returned by ``osgeo.ogr.FieldDefn.GetTypeName``."""

usage: str

"""The intended use of the column. String representation of one of ``gdal.GDALRATFieldUsage``."""

emilyanndavis · 2026-03-24T21:45:52Z

src/geometamaker/models.py

+    model_config = ConfigDict(frozen=True)
+
+    table_type: str
+    """Thematic or Athematic, see ``gdal.GRTT_*``."""


Suggested change

"""Thematic or Athematic, see ``gdal.GRTT_*``."""

"""Thematic or Athematic. String representation of one of ``gdal.GDALRATTableType``."""

Again, to be thorough, it might be worth mentioning 'Unknown' is a possibility here as well.

Suggested change

"""Thematic or Athematic, see ``gdal.GRTT_*``."""

"""Thematic, Athematic, or Unknown. String representation of one of ``gdal.GDALRATTableType``, or 'Unknown' if this information is unavailable."""

emilyanndavis · 2026-03-24T21:51:26Z

src/geometamaker/models.py

+        for i in range(rat.GetColumnCount()):
+            columns.append(RATColumnDefn(
+                name=rat.GetNameOfCol(i),
+                type=utils._GFT_INT_TO_STR[rat.GetTypeOfCol(i)],
+                usage=utils._GFU_INT_TO_STR[rat.GetUsageOfCol(i)]))
+        rows = []
+        for i in range(rat.GetRowCount()):
+            row = {}
+            for j in range(rat.GetColumnCount()):


There's probably very little overhead involved in the call to GetColumnCount, but it still might be worth defining a variable (e.g., num_cols) once, outside these two loops, rather than invoking the method once for every row.

Suggested change

for i in range(rat.GetColumnCount()):

columns.append(RATColumnDefn(

name=rat.GetNameOfCol(i),

type=utils._GFT_INT_TO_STR[rat.GetTypeOfCol(i)],

usage=utils._GFU_INT_TO_STR[rat.GetUsageOfCol(i)]))

rows = []

for i in range(rat.GetRowCount()):

row = {}

for j in range(rat.GetColumnCount()):

num_cols = rat.GetColumnCount()

for i in range(num_cols):

columns.append(RATColumnDefn(

name=rat.GetNameOfCol(i),

type=utils._GFT_INT_TO_STR[rat.GetTypeOfCol(i)],

usage=utils._GFU_INT_TO_STR[rat.GetUsageOfCol(i)]))

rows = []

for i in range(rat.GetRowCount()):

row = {}

for j in range(num_cols):

emilyanndavis · 2026-03-24T21:56:15Z

src/geometamaker/models.py

+                    case 'Integer':
+                        row[columns[j].name] = rat.GetValueAsInt(i, j)
+                    case 'String':
+                        row[columns[j].name] = rat.GetValueAsString(i, j)
+                    case 'Real':
+                        row[columns[j].name] = rat.GetValueAsDouble(i, j)
+                    case 'Boolean':
+                        row[columns[j].name] = rat.GetValueAsBoolean(i, j)
+                    case 'DateTime':
+                        row[columns[j].name] = rat.GetValueAsDateTime(i, j)
+                    case 'WKBGeometry':
+                        row[columns[j].name] = rat.GetValueAsWKBGeometry(i, j)
+                    case _:
+                        row[columns[j].name] = rat.GetValueAsString(i, j)


May as well use col in all these case statements, since col is already defined as a reference to columns[j].

Suggested change

case 'Integer':

row[columns[j].name] = rat.GetValueAsInt(i, j)

case 'String':

row[columns[j].name] = rat.GetValueAsString(i, j)

case 'Real':

row[columns[j].name] = rat.GetValueAsDouble(i, j)

case 'Boolean':

row[columns[j].name] = rat.GetValueAsBoolean(i, j)

case 'DateTime':

row[columns[j].name] = rat.GetValueAsDateTime(i, j)

case 'WKBGeometry':

row[columns[j].name] = rat.GetValueAsWKBGeometry(i, j)

case _:

row[columns[j].name] = rat.GetValueAsString(i, j)

case 'Integer':

row[col.name] = rat.GetValueAsInt(i, j)

case 'String':

row[col.name] = rat.GetValueAsString(i, j)

case 'Real':

row[col.name] = rat.GetValueAsDouble(i, j)

case 'Boolean':

row[col.name] = rat.GetValueAsBoolean(i, j)

case 'DateTime':

row[col.name] = rat.GetValueAsDateTime(i, j)

case 'WKBGeometry':

row[col.name] = rat.GetValueAsWKBGeometry(i, j)

case _:

row[col.name] = rat.GetValueAsString(i, j)

emilyanndavis · 2026-03-24T22:02:50Z

src/geometamaker/models.py

+            if name == 'VALUE':
+                usage = 'MinMax'
+            elif name == 'COUNT':
+                usage = 'PixelCount'
+            # I'm not sure how standard any other fields are, so just calling
+            # them all 'Generic' may be good enough.
+            else:
+                usage = 'Generic'


It might be worth referencing the _GFU_INT_TO_STR dictionary here to define these usage strings, just to make sure they don't get out of sync with that source of truth.

emilyanndavis · 2026-03-24T22:08:28Z

src/geometamaker/models.py

    """Unit of measurement for the pixel values."""
    gdal_metadata: dict = {}
    """Metadata key:value pairs stored in the GDAL band object."""
+    raster_attribute_table: RasterAttributeTable | None = None


Probably worth adding a short docstring here, for consistency?

davemfish · 2026-03-25T14:49:44Z

Thanks, @emilyanndavis , these are all clear improvements. Back to you.

davemfish added 9 commits March 20, 2026 12:03

add RAT support to rasters. natcap#25

78a0847

add a more convenient get_rat method. natcap#25

143ac46

remove the to_dataframe method because we do not require pandas. natc…

65841d0

…ap#25

make RAT attributes immutable. natcap#25

d4488c1

a note for HISTORY. natcap#25

be3fec9

replace integers with gdalconst variables. natcap#25

1ce4a41

add support for reading DBF RAT with older gdal versions. natcap#25

6d36a05

added test and test data for reading rat from dbf. natcap#25

b772280

clear whitespace. natcap#25

b10ad75

davemfish marked this pull request as ready for review March 24, 2026 16:41

davemfish added 2 commits March 24, 2026 12:48

replace references to gdalconst with gdal module. natcap#25

8ef1366

fixing grammar in a comment. natcap#25

b2f504b

emilyanndavis requested changes Mar 24, 2026

View reviewed changes

davemfish requested a review from emilyanndavis March 25, 2026 14:49

improve attribute docstrings. natcap#25

02f5f0f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add RAT support to rasters. #25#122

add RAT support to rasters. #25#122
davemfish wants to merge 12 commits intonatcap:mainfrom
davemfish:feature/25-raster-attribute-table

davemfish commented Mar 20, 2026 •

edited

Loading

Uh oh!

emilyanndavis left a comment

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

emilyanndavis Mar 24, 2026

Uh oh!

davemfish commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	"""Thematic or Athematic, see ``gdal.GRTT_*``."""
	"""Thematic or Athematic. String representation of one of ``gdal.GDALRATTableType``."""

	"""Thematic or Athematic, see ``gdal.GRTT_*``."""
	"""Thematic, Athematic, or Unknown. String representation of one of ``gdal.GDALRATTableType``, or 'Unknown' if this information is unavailable."""

Conversation

davemfish commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

emilyanndavis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davemfish commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

davemfish commented Mar 20, 2026 •

edited

Loading