Resolve "raw_data_path for ewoks should take into account processed datasets"
Closes #632 (closed) Hi @denolf
It was more complex that what I was expect in the implementation. The whole picture of the example mentioned below is: https://data2.esrf.fr/investigation/1830146619/datasets?sampleId=1902358869
Scenario "Process a data collection"
It means you want to run autoproc integration over a data collection with one or multiple pipelines
The raw_data_path
you are interested in is the same as the dataset because it contains the images.
Example:
raw_data_path : /data/visitor/mx2649/id23eh2/20240906/RAW_DATA/1433/1433-MB013_MB028_ChREBP_G2_U/run_03_datacollection
Scenario "Reprocess a pipeline"
You might want to tune some parameters of an already processed dataset. Then you pick a processed datasetand the raw_data_path
should still point to the folder where the images are.
The raw_data_path
you are interested in is the same as the dataset because it contains the images.
Example:
raw_data_path : /data/visitor/mx2649/id23eh2/20240906/RAW_DATA/1433/1433-MB013_MB028_ChREBP_G2_U/run_03_datacollection
Scenario "Reprocess downstream"
You might want to reprocess further downstream. For instance, phasing. In that case the input dataset for phasing is also another processed dataset that the workflow might want to know (or not). The API will concatenate the input datasets.
raw_data_path : /data/visitor/mx2649/id23eh2/20240906/PROCESSED_DATA/1433/1433-MB013_MB028_ChREBP_G2_U/run_03_datacollection/autoprocessing/grenades_parallelproc/grenades_parallelproc_pointless_21_63.62_148.11_76.55_90_90_90,/data/visitor/mx2649/id23eh2/20240906/RAW_DATA/1433/1433-MB013_MB028_ChREBP_G2_U/run_03_datacollection