Releases: HXLStandard/libhxl-python
Releases · HXLStandard/libhxl-python
libhxl-python v4.28
- update requirements to allow latest versions of dependencies
- don't fall back to CSV if we have a MIME type or file extension that's not in the allow list (which is fairly liberal)
- change behaviour of ReplaceDataFilter so that with a map, it stops after the first successful replacement for each value
- change to ReplaceDataFilter means that replacement maps can now have default values at the end
- speed up ReplaceDataFilter by precompiling matching column indices for each pattern/replacement
- simplify Excel handling to remove dependency on mmap and tmpfile
- add patch from wynnw to fix crash reading some local files
- fixed issue where libhxl can't read directly from sys.stdin
libhxl-python v4.27.3
Interim release to decrease log levels and thus reduce the amount of generated logs.
libhxl-python v4.27.2
Interim release, mostly around logging changes to support the HXL Proxy.
- use structlog module
- add hxl.util.logup function to include function name in log message
- switch to logup in input module
- add extra help text for hxlcount script
libhxl-python v4.27
- raise a HXLHTMLException (subclass of HXLIOException) when HTML markup found
- clean dates that are integers, assuming seconds or days since epoch (with cutouts for years and month days)
- bypass RequestResponseIOWrapper, because it's causing grief; need to make sure content still gets uncompressed
- fix input options for secondary datasets in scripts (e.g. an appended or joined dataset)
- more debug logging about HXL hashtag detection
- add a new logger to hxl.REMOTE_ACCESS specifically for external URL access
- various command-line script updates and fixes
libhxl-python v4.26
- use loglevel CRITICAL only for infrastructure failures
- don't put tracebacks in logging (avoid logger.exception())
(4.25.2)
- raise exception for HTML input (prevents tagger exploit in HXL Proxy)
- make hxl.input.munge_url() public
- remove default tags for hxlcount script so that it can just count lines of data
(4.25.1)
- support optionally filling merged areas (XLS and XLSX only)
- support optionally scanning CKAN datasets for the first HXLated resource (in hxl.data(), not make_input())
- support Google Sheets interactive view URLs for XLSX files (but not tab identifiers)
- make Excel workbook handling more efficient
- add --encoding option to all command-line scripts (tested with CSV)
- add --expand-merged option to all comand-line scripts
- add --scan-ckan-resources option to all command-line scripts
- add hxl.input.InputOption object to hold all input options
- add info method to input objects to get general info about an XLSX file (including HXLated and merged areas in each sheet)
- refactor input and model classes
- upgrade to xlrd3 version 1.1.0
libhxl-python v4.25.2
- raise exception for HTML input (prevents tagger exploit in HXL Proxy)
- make hxl.input.munge_url() public
- remove default tags for hxlcount script so that it can just count lines of data
libhxl-python v4.25.1
- support optionally filling merged areas (XLS and XLSX only)
- support optionally scanning CKAN datasets for the first HXLated resource (in hxl.data(), not make_input())
- support Google Sheets interactive view URLs for XLSX files (but not tab identifiers)
- make Excel workbook handling more efficient
- add
--encodingoption to all command-line scripts (tested with CSV) - add
--expand-mergedoption to all comand-line scripts - add
--scan-ckan-resourcesoption to all command-line scripts - add
hxl.input.InputOptionobject to hold all input options - add
info()method to input objects to get general info about an XLSX file (including HXLated and merged areas in each sheet) - refactor input and model classes
- upgrade to xlrd3 version 1.1.0
libhxl-python v4.25
- remove discontinued "encoding" keyword for json.load (Python 3.9)
- rename the hxl.io module to hxl.input (to avoid problems in Python on Windows)
libhxl-python v4.24
- fix bug that prevented tagger from adding tags if there was a blank row before the text headers
libhxl-python v4.23
- handle encoding errors in CSV more gracefully (replace with "?")
- fix bug in JSON recipes (correct "key" to "tags" in the sort filter)
- fix bug in the sort filter when there is no key and untagged columns on the right side of the dataset