Dataset sc lung #27

Kraftfahrzeughaftpflichtversicherung · 2025-03-19T17:56:32Z

I tried my best to integrate this dataset, and it even passed tests, failing only in the latest one
Unfortunatelly my computational recources said "bye" for me now, and I need to fix it, so it probably will take some time

Now I will add all that I have for now.

and I didn't change anything in common folder, why is it highlited here ??

rcannood · 2025-03-22T06:58:20Z

This PR removed or changed files that were very important for the project, including the _viash.yaml project config. I restored them for now -- please be careful not to remove essential project files :)

rcannood

Hi @Kraftfahrzeughaftpflichtversicherung ! Thanks for your contributions!

I proposed some changes to make this align better with existing components :)

rcannood · 2025-03-23T08:50:11Z

src/datasets/loaders/nsclc_sc_zuani/script.py

+FILE_PATHS = {"file": TMP_DIR / "cropped_sc.h5ad"}
+os.system(f'wget http://192.168.2.46:8000/file/cropped_sc.h5ad -P ./tmp/')
+adata = ad.read_h5ad( './tmp/cropped_sc.h5ad')


This code is pointing to local endpoints, which unfortunately won't work.

rcannood · 2025-03-23T08:53:11Z

src/datasets/workflows/process_nsclc_sc_zuani/config.vsh.yaml

+name: process_nsclc_sc_zuani
+namespace: datasets/workflows
+
+argument_groups:


Can we add a section for the input file?

Suggested change

argument_groups:

argument_groups:

- name: Inputs

arguments:

- type: file

name: --input

description: Path to the dataset

required: true

example: "https://ftp.ebi.ac.uk/biostudies/fire/E-MTAB-/526/E-MTAB-13526/Files/10X_Lung_Tumour_Annotated_v2.h5ad"

rcannood · 2025-03-23T08:55:02Z

src/datasets/loaders/nsclc_sc_zuani/script.py

+uns_info = { "dataset_id": "E-MTAB-13526" ,
+              "dataset_name":"E-MTAB-13526" , 
+              "dataset_url":"https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-13526" ,
+              "dataset_reference": "https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-13526",
+              "dataset_summary": 'none',
+              "dataset_description":'none',
+              "dataset_organism": 'Homo sapiens' 
+}
+
+for key in ["dataset_id", "dataset_name", "dataset_url", "dataset_reference", "dataset_summary", "dataset_description", "dataset_organism"]:
+    adata.uns[key] = uns_info[key]


values like 'dataset_id' and 'dataset_name' are arguments, but are also being hardcoded here. These values should be retrieved from the par and should be passed as part of the dataset script.

Suggested change

uns_info = { "dataset_id": "E-MTAB-13526" ,

"dataset_name":"E-MTAB-13526" ,

"dataset_url":"https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-13526" ,

"dataset_reference": "https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-13526",

"dataset_summary": 'none',

"dataset_description":'none',

"dataset_organism": 'Homo sapiens'

}

for key in ["dataset_id", "dataset_name", "dataset_url", "dataset_reference", "dataset_summary", "dataset_description", "dataset_organism"]:

adata.uns[key] = uns_info[key]

for key in ["dataset_id", "dataset_name", "dataset_url", "dataset_reference", "dataset_summary", "dataset_description", "dataset_organism"]:

adata.uns[key] = par[key]

make sure to add a script similar to this one: https://github.com/openproblems-bio/task_ist_preprocessing/blob/ea67087326ae00912e0006d1f643d990576ed414/scripts/create_resources/process_10x_xenium.sh

Kraftfahrzeughaftpflichtversicherung and others added 4 commits March 17, 2025 00:00

message

3c53e8c

comment some lines

94a85e8

Fix naming issues for nsclc_sc_zuani dataset workflow

ba0c4dd

Deleted unnecessary files and modified configurations

7a7f359

Kraftfahrzeughaftpflichtversicherung requested a review from LouisK92 March 19, 2025 17:56

Merge remote-tracking branch 'origin/main' into dataset_sc_lung

7390de3

rcannood requested changes Mar 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dataset sc lung #27

Dataset sc lung #27

Uh oh!

Kraftfahrzeughaftpflichtversicherung commented Mar 19, 2025

Uh oh!

rcannood commented Mar 22, 2025 •

edited

Loading

Uh oh!

rcannood left a comment

Uh oh!

rcannood Mar 23, 2025

Uh oh!

rcannood Mar 23, 2025

Uh oh!

rcannood Mar 23, 2025

Uh oh!

Uh oh!

-argument_groups:
+argument_groups:
+  - name: Inputs
+    arguments:
+      - type: file
+        name: --input
+        description: Path to the dataset
+        required: true
+        example: "https://ftp.ebi.ac.uk/biostudies/fire/E-MTAB-/526/E-MTAB-13526/Files/10X_Lung_Tumour_Annotated_v2.h5ad"

Dataset sc lung #27

Are you sure you want to change the base?

Dataset sc lung #27

Uh oh!

Conversation

Kraftfahrzeughaftpflichtversicherung commented Mar 19, 2025

Uh oh!

rcannood commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rcannood left a comment

Choose a reason for hiding this comment

Uh oh!

rcannood Mar 23, 2025

Choose a reason for hiding this comment

Uh oh!

rcannood Mar 23, 2025

Choose a reason for hiding this comment

Uh oh!

rcannood Mar 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rcannood commented Mar 22, 2025 •

edited

Loading