Merge pull request #1 from Tom-TBT/master

Merge changes from readthedocs tests
German-BioImaging · Mar 15, 2024 · c4e24f5 · c4e24f5
2 parents 7bf0a78 + 6ece345
commit c4e24f5
Show file tree

Hide file tree

Showing 20 changed files with 465 additions and 453 deletions.
diff --git a/.readthedocs.yaml b/.readthedocs.yaml
@@ -16,8 +16,8 @@ sphinx:
 # Optionally build your docs in additional formats such as PDF and ePub
 formats: all
 
-# Optionally set the version of Python and requirements required to build your docs
-python:
-  version: 3.7
-  install:
-    - requirements: requirements.txt
+
+build:
+  os: ubuntu-22.04
+  tools:
+    python: "3.11"
diff --git a/docs/errors.rst → docs/errors_explained.rst b/docs/errors.rst → docs/errors_explained.rst
@@ -24,11 +24,16 @@ This error shows either that one of the csv row has a different number of item f
 something wrong happened with the parsing of the csv. (make sure that your csv separator is not a comma and that \
 you use also a comma to have multiple value per cells, like in the case of tags).
 
-Failed to sniff CSV delimiter please specify the separator
-----------------------------------------------------------
+Error while reading the csv, convert your file to utf-8 encoding'utf-8'
+-----------------------------------------------------------------------
+The csv must be in a UTF-8 encoding. You can convert it with a text editor (e.g. Notepad++ for Windows, \
+TextEdit for Mac). We recommend to start from a csv generated by the "Export to csv" script.
+
+Failed to sniff CSV delimiter
+-----------------------------
 It happens that the sniffer to detect fails to understand what separator was \
-used inside you .csv file. Please indicate in the parameter the separator used \
-inside your .csv.
+used inside you .csv file. Try instead providing the separator used \
+inside your .csv as parameter of the script (instead of guessing).
 
 File annotation ID must be given when using Tag as source
 ---------------------------------------------------------
@@ -58,8 +63,8 @@ objects, in order for the script to understand which corresponds to which.
 
 Please confirm that you understood the risks
 --------------------------------------------
-It seems that you forgot to confirm that you understood the risk associated \
-with a batch deletion of annotations. :ref:`Reason here <why the checkbox for delete script>`
+It seems that you went a bit too fast and forgot to confirm that you understood \
+the risk associated with a batch deletion of annotations.
 
 Target objects identified by name have duplicate
 ------------------------------------------------
@@ -75,7 +80,6 @@ The ID of the file annotation seems incorrect. Make sure that you gave the \
 FileAnnotation ID and not the LinkAnnotation ID, as indicated in the image bellow.
 
 .. image:: images/expert_3_file_annotation_id.png
-   :scale: 100%
 
 The tag ID:{TAG_ID} is not in the permitted selection of tags
 -------------------------------------------------------------
@@ -103,3 +107,8 @@ The .csv contains duplicates {DUPL} which makes it impossible to correctly alloc
 When no ID is used to identify the objects to annotate, names are use instead. In that case, there can not \
 be two object with the same name in the csv (and in the selected object on OMERO), in which case it is \
 impossible to identify which object should be annotated.
+
+:Authors:
+    Tom Boissonnet
+
+:Version: 1.0 of 2024/03/01
diff --git a/docs/extended_description.rst b/docs/extended_description.rst
@@ -0,0 +1,189 @@
+====================
+Extended description
+====================
+
+Selecting High-Content-Screening
+--------------------------------
+Selecting HCS data follows the same selection parent/child logic as for \
+Projects/Datasets/Images, but with the HCS containers: Screen/Plate/Well/Run/Image
+
+One specificity though is about the **Run** (Acquisition). **Runs** and **Wells** are both \
+children objects of a **Plate**, and **Images** are attached to both a **Run** and a **Well**.
+
+  * Selecting **Images** from a **Well** will select all the **Images** inside
+    this **Well**.
+  * Selecting **Images** from a **Run** will select all the **Images** coming from that
+    **Run** (corresponds to a subset of **Images** from all combined **Well**).
+  * Selecting **Images** from a **Screen** or a **Plate** will follow the **Well** route selecting all **Images** inside all **Wells**.
+
+
+Features of KeyVal_from_csv
+---------------------------
+* Generation of Key-Value pairs
+* Automatic Generation of multiple Key-Value pairs with the same Key if multiple Values are separated by a defined character
+* Assignement of namespaces from the csv file (or one for all Key-Value pairs from the script parameter).
+* Automatic exclusion of Key-Value pairs with empty Values
+* Generation of Tags
+   * Supports Tag Sets
+   * Supports selection of Tags by Tag IDs
+   * Creation of new Tags and Tag Sets
+   * Allows restriction to personal Tags only
+
+Exclude empty values from import
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+If you utilise a .csv with empty values like
+
+.. csv-table::
+   :header: "name", "id", "manual analysis", "observation"
+   :widths: 20, 10, 20,20
+
+   "A.tif", "120", "Hit",""
+   "B.jpg", "121", "",""
+   "C.jpg","122","","missing"
+
+by default, the empty cells are skipped during Key-Value pairs creation.
+If you wish to create a key for empty cells, you can uncheck the box "*Exclude empty values*"
+
+This might be useful if you plan to fill it in manually later on. If you used an empty key value pair as a title \
+of a "key-value pair section", consider instead to assign a namesapce to your key-value pair subsets.
+
+Multiple Values per Key
+^^^^^^^^^^^^^^^^^^^^^^^
+If you utilise the Advanced parameter "*Split value on*" you can specify one character \
+for which multiple values get split. E.g. setting this to "!" in the following example, \
+it would lead to the creation of two Key-Value pairs ``key_1 : value_1`` and ``key_1 : value_2`` \
+for the Image *A.jpg*. Despite looking like a list of multiple values for Image *B.tif* it \
+would generate one Key-Value pair ``key_1 : value_1, value_2`` as the comma is not used \
+in our example as a separator for multiple Key-Value pairs.
+
+.. csv-table::
+   :header: "OBJECT_NAME", "OBJECT_ID", "key_1"
+   :widths: 20, 10, 20
+
+   "A.tif", "120", "value_1!value_2"
+   "B.jpg", "121", "value_1, value_2"
+   "C.jpg","122","value_1!"
+
+Be aware, that for the Image *C.jpg* two Key-Value pairs ``key_1 : value_1`` and ``key_1 :`` would be created (if the "*Exclude empty values*" parameter is unchecked).
+
+Namespaces
+^^^^^^^^^^
+The default namespace for OMERO Key-Value pairs generated in the web-interface and by our script is ``openmicroscopy.org/omero/client/mapAnnotation``.
+Only Key-Value pairs with this namespace can be edited in the web-interface later!
+
+Leaving the "*Namespace*" parameter blank will assign the default namespace to the new key-value pairs created, or \
+
+Additionally, you can choose to use custom namespaces, e.g. to utilize the `OMERO-Mapr <https://github.com/ome/omero-mapr>`_ functionalities, for each different Key.
+To do so just add a first row in the .csv with the first column containing "*namespace*". The corresponding namespace for each key is then put above the respective key. \
+If no namespace is given, the script will fall back to the default namespace.
+
++-------------+------------+-----------+------------------------+
+| namespace   |            | Custom_NS |REMBI_Biosample         |
++-------------+------------+-----------+------------------------+
+| OBJECT_NAME | OBJECT_ID  | key_1     |  Organism              |
++=============+============+===========+========================+
+| A.tif       | 120        | value_1   |Drosophila melanogaster |
++-------------+------------+-----------+------------------------+
+| B.jpg       | 121        | value_2   |Mus musculus            |
++-------------+------------+-----------+------------------------+
+| C.jpg       | 122        | value_1   |Saccharomyces cerevisiae|
++-------------+------------+-----------+------------------------+
+
+Tags
+^^^^
+Specified in dedicated columns in the csv, you can also annotate your objects with tags by batch.
+To do this simply put "*tag*" where you would normally put the Key name. Multiple \
+Tags can be specified as a comma separated list (or by the input parameter "*Split value on*" if given).
+
+If you want to select a Tag in a specific Tag-Set you just add the Tag-Set in square \
+brackets directly following the Tag. You can also specify the Tag by its Tag-Id in \
+square brackets directly following the Tag.
+By default the script will not create new Tags to avoid "Tag bloat", if you want new Tags created according \
+to the .csv values you have to check the "*Create new Tags*" checkbox under "*Advanced parameters*".
+
+Furthermore, by default the script will search through all available Tags of the group, \
+if you want to use only your own Tags you have to check the "*Use only personal Tags*" checkbox \
+under "*Advanced parameters*".
+
+.. csv-table::
+   :header: "OBJECT_NAME", "OBJECT_ID", "tag"
+   :widths: 20, 10, 20
+
+   "A.tif", "120", "Tag_1, Tag_2, Tag_3"
+   "B.jpg", "121", "Tag_1[*Tag-Set*]"
+   "C.jpg","122","Tag_1[*Tag-Id*]"
+
+
+Choosing the CSV separator
+--------------------------
+When importing annotations from a .csv file, the script tries by default \
+to detect the CSV separator automatically (one of ``,`` ``;`` ``TAB``).
+
+It is possible to specify directly which one is used (in the case the automatic \
+detection fails for example). As the text in the annotations may contain \
+commas or semi-column, it is recommended to use ``TAB`` as separators.
+
+Columns of parent names
+-----------------------
+To clarify the provenance of objects listed in the csv, you can choose to export \
+additional columns for the name of the parent objects.
+
+Those columns are by default excluded from the Key-value pairs using the import \
+script (<PARENT> value of the "Columns to exclude" parameter matches all parent \
+containers: PROJECT, DATASET, SCREEN, PLATE, WELL and RUN)
+
+Default Namespace
+-----------------
+Leaving the namespace parameter to blank always refers to the same namespace, \
+the "Client namespace", corresponding to the one given to new Key-Value pairs \
+created inside OMERO.web. This namespace \
+(``openmicroscopy.org/omero/client/mapAnnotation`` in full) is treated \
+differently by OMERO.web as it is the only one that can be edited in its \
+interface.
+
+Target ID, name and excluding column from Key-Value pairs
+---------------------------------------------------------
+The name of the columns for the exported objects IDs and names are always \
+the same: OBJECT_ID and OBJECT_NAME.
+
+This will be used by default by our import script, so we recommend to use it. \
+If however you wished to keep your own column naming for the IDs and name of objects, \
+you may specify those in the "*Target ID colname*" and "*Target name colname*" parameters \
+of the import script.
+
+While OBJECT_NAME are only used to identify the objects when OBJECT_ID is not there, \
+it remains important to have it inside the .csv to recognize the objects more \
+easily inside a spreadsheet editors. A good use case for the OBJECT_NAME column is to transfer \
+annotations across groups (the IDs of the objects will differ between groups).
+
+Note also that those two columns are excluded by defaults from the Key-Value \
+pairs, by the use of the following three parameters:
+
+.. image:: images/expert_1_exclude_import.png
+
+* Target ID colname: the name of the column in the .csv that contains the
+  objects IDs
+* Target name colname: the name of the column in the .csv that contains the
+  objects names
+* Columns to exclude: <ID> will exclude the column containing the objects IDs,
+  <NAME> will do the same for the objects names, and additional columns can
+  be excluded by indicating their name (e.g. to exclude parent objects
+  column name when used with the export script).
+
+
+Looking at the output log
+-------------------------
+When the execution of the script is over (also when it fails), you will \
+be able to look at the ouput of the script by clicking that button highlighed \
+in red in the picture bellow.
+
+.. image:: images/expert_2_script_output.png
+
+This output will help you understand what has been done/changed, and may help \
+you understand things when they don't work out the way you expected them.
+
+:Authors:
+    Tom Boissonnet
+    Jens Wendt
+
+:Version: 1.0 of 2024/03/01