Skip to content

Releases: HXLStandard/libhxl-python

libhxl-python v4.28

20 Mar 14:35

Choose a tag to compare

  • update requirements to allow latest versions of dependencies
  • don't fall back to CSV if we have a MIME type or file extension that's not in the allow list (which is fairly liberal)
  • change behaviour of ReplaceDataFilter so that with a map, it stops after the first successful replacement for each value
  • change to ReplaceDataFilter means that replacement maps can now have default values at the end
  • speed up ReplaceDataFilter by precompiling matching column indices for each pattern/replacement
  • simplify Excel handling to remove dependency on mmap and tmpfile
  • add patch from wynnw to fix crash reading some local files
  • fixed issue where libhxl can't read directly from sys.stdin

libhxl-python v4.27.3

12 Dec 11:21
5906e6e

Choose a tag to compare

Interim release to decrease log levels and thus reduce the amount of generated logs.

libhxl-python v4.27.2

25 Nov 16:09

Choose a tag to compare

Interim release, mostly around logging changes to support the HXL Proxy.

  • use structlog module
  • add hxl.util.logup function to include function name in log message
  • switch to logup in input module
  • add extra help text for hxlcount script

libhxl-python v4.27

30 Sep 16:46

Choose a tag to compare

  • raise a HXLHTMLException (subclass of HXLIOException) when HTML markup found
  • clean dates that are integers, assuming seconds or days since epoch (with cutouts for years and month days)
  • bypass RequestResponseIOWrapper, because it's causing grief; need to make sure content still gets uncompressed
  • fix input options for secondary datasets in scripts (e.g. an appended or joined dataset)
  • more debug logging about HXL hashtag detection
  • add a new logger to hxl.REMOTE_ACCESS specifically for external URL access
  • various command-line script updates and fixes

libhxl-python v4.26

05 Aug 17:31

Choose a tag to compare

  • use loglevel CRITICAL only for infrastructure failures
  • don't put tracebacks in logging (avoid logger.exception())

(4.25.2)

  • raise exception for HTML input (prevents tagger exploit in HXL Proxy)
  • make hxl.input.munge_url() public
  • remove default tags for hxlcount script so that it can just count lines of data

(4.25.1)

  • support optionally filling merged areas (XLS and XLSX only)
  • support optionally scanning CKAN datasets for the first HXLated resource (in hxl.data(), not make_input())
  • support Google Sheets interactive view URLs for XLSX files (but not tab identifiers)
  • make Excel workbook handling more efficient
  • add --encoding option to all command-line scripts (tested with CSV)
  • add --expand-merged option to all comand-line scripts
  • add --scan-ckan-resources option to all command-line scripts
  • add hxl.input.InputOption object to hold all input options
  • add info method to input objects to get general info about an XLSX file (including HXLated and merged areas in each sheet)
  • refactor input and model classes
  • upgrade to xlrd3 version 1.1.0

libhxl-python v4.25.2

15 Jul 15:39

Choose a tag to compare

  • raise exception for HTML input (prevents tagger exploit in HXL Proxy)
  • make hxl.input.munge_url() public
  • remove default tags for hxlcount script so that it can just count lines of data

libhxl-python v4.25.1

28 Jun 13:18

Choose a tag to compare

  • support optionally filling merged areas (XLS and XLSX only)
  • support optionally scanning CKAN datasets for the first HXLated resource (in hxl.data(), not make_input())
  • support Google Sheets interactive view URLs for XLSX files (but not tab identifiers)
  • make Excel workbook handling more efficient
  • add --encoding option to all command-line scripts (tested with CSV)
  • add --expand-merged option to all comand-line scripts
  • add --scan-ckan-resources option to all command-line scripts
  • add hxl.input.InputOption object to hold all input options
  • add info() method to input objects to get general info about an XLSX file (including HXLated and merged areas in each sheet)
  • refactor input and model classes
  • upgrade to xlrd3 version 1.1.0

libhxl-python v4.25

28 Mar 15:23

Choose a tag to compare

  • remove discontinued "encoding" keyword for json.load (Python 3.9)
  • rename the hxl.io module to hxl.input (to avoid problems in Python on Windows)

libhxl-python v4.24

23 Apr 13:55

Choose a tag to compare

  • fix bug that prevented tagger from adding tags if there was a blank row before the text headers

libhxl-python v4.23

08 Feb 23:29

Choose a tag to compare

  • handle encoding errors in CSV more gracefully (replace with "?")
  • fix bug in JSON recipes (correct "key" to "tags" in the sort filter)
  • fix bug in the sort filter when there is no key and untagged columns on the right side of the dataset