You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The OMERO 4.2.0 release in July 2010 included an alteration to the database schema to add name, path and repo columns to the Pixels table with a similar meaning as the columns in the OriginalFile table. This change was part of the initial work adding native file format support in OMERO via Bio-Formats also known as FS lite. For a subset of file formats (primarily single file and with large XY dimensions), the original file was uploaded to the binary repository and linked from the Pixels object. This allowed the server to perform certain operations including the generation of OMERO pyramids.
Full support for native file format support in OMERO, also known as OMERO.fs, was introduced in OMERO 5.0.0 in February 2014 with the introduction of the Fileset table linked to the Image. Each Fileset row is linked to an ordered set of FilesetEntry rows each of these being themselves associated with a single OriginalFile entry. This change effectively superseded the FS Lite concept allowing native support for single and multi-file formats as well as multi-image formats. In OMERO 5.1, the series column was also introduced to the Image table to store the mapping between an image and the underlying Bio-Formats series.
Current API
Despite OMERO 5 actually deprecating their usage, the Pixels.name, Pixels.path and Pixels.repo columns are still currently heavily used server-side as of OMERO 5.6.x:
The current logic is problematic for several reasons:
for each FS image, with an associated Fileset, identical metadata is maintained in several locations: the Pixels object as well as the OriginalFile linked to the FilesetEntry
the requirements to update the Pixels metadata in the importer can cause substantial DB operations especially in the case of high-content screening
while the API and the UIs have been updated to expose the Fileset/FilesetEntry, the Pixelsname, repo and path attributes are still hidden from an API perspective.
a client like Glencoe's image-region micro-service still requires direct access to the database and the omero.db configurations properly set in its configuration in order to execute the PostgresSQLAction
As an additional related complication, a historical bug has been reported in the image.sc forum where the OriginalFile are incorrectly linked to FilesetEntry for some multi-file filesets.
Proposal
Fix the ordering of the FilesetEntry so that it can be used as the single source of truth
a. Fix the mapping between OriginalFile and FilesetEntry at import time - see Fix ordering of OriginalFile objects registered in saveFileset() omero-blitz#148
b. Create an upgrade script allowing to fix all existing FilesetEntry/OriginalFile links in existing OMERO databases
c. Review the API and technical documentation of Bio-Formats and OMERO and if needed clarify and enforce that the first file in IFormatReader.getUsedFiles, the output of ImportCandidates and the first FilesetEntry is the file that should be passed to IFormatReader.setId
d. Optionally, create an upgrade script allowing to convert FS lite imports into Fileset
Remove all legacy FS lite API and use the Fileset API consistently
a. Create a new version of the OMERO database schema dropping the Pixels.name, Pixels.path and Pixels.repo columns
b. Update all the server APIs to use the OriginalFile from the first FilesetEntry as the source of truth
Background
The OMERO 4.2.0 release in July 2010 included an alteration to the database schema to add
name,pathandrepocolumns to thePixelstable with a similar meaning as the columns in theOriginalFiletable. This change was part of the initial work adding native file format support in OMERO via Bio-Formats also known as FS lite. For a subset of file formats (primarily single file and with large XY dimensions), the original file was uploaded to the binary repository and linked from the Pixels object. This allowed the server to perform certain operations including the generation of OMERO pyramids.Full support for native file format support in OMERO, also known as OMERO.fs, was introduced in OMERO 5.0.0 in February 2014 with the introduction of the
Filesettable linked to theImage. EachFilesetrow is linked to an ordered set ofFilesetEntryrows each of these being themselves associated with a singleOriginalFileentry. This change effectively superseded the FS Lite concept allowing native support for single and multi-file formats as well as multi-image formats. In OMERO 5.1, theseriescolumn was also introduced to theImagetable to store the mapping between an image and the underlying Bio-Formats series.Current API
Despite OMERO 5 actually deprecating their usage, the
Pixels.name,Pixels.pathandPixels.repocolumns are still currently heavily used server-side as of OMERO 5.6.x:Challenges
The current logic is problematic for several reasons:
Pixelsobject as well as theOriginalFilelinked to theFilesetEntryPixelsmetadata in the importer can cause substantial DB operations especially in the case of high-content screeningFileset/FilesetEntry, thePixelsname,repoandpathattributes are still hidden from an API perspective.omero.dbconfigurations properly set in its configuration in order to execute the PostgresSQLActionPixelsobject via SQL script - see Document NGFF Fileset replacement workflow IDR/idr-metadata#656As an additional related complication, a historical bug has been reported in the image.sc forum where the
OriginalFileare incorrectly linked toFilesetEntryfor some multi-file filesets.Proposal
a. Fix the mapping between OriginalFile and FilesetEntry at import time - see Fix ordering of OriginalFile objects registered in saveFileset() omero-blitz#148
b. Create an upgrade script allowing to fix all existing FilesetEntry/OriginalFile links in existing OMERO databases
c. Review the API and technical documentation of Bio-Formats and OMERO and if needed clarify and enforce that the first file in
IFormatReader.getUsedFiles, the output ofImportCandidatesand the firstFilesetEntryis the file that should be passed toIFormatReader.setIdd. Optionally, create an upgrade script allowing to convert FS lite imports into
Fileseta. Create a new version of the OMERO database schema dropping the
Pixels.name,Pixels.pathandPixels.repocolumnsb. Update all the server APIs to use the OriginalFile from the first FilesetEntry as the source of truth
/cc @joshmoore @jburel @kkoz @chris-allan @will-moore @dominikl @Tom-TBT