-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the subject / sample question #9
Comments
In the openMINDS_core - full metadata model for a dataset slide Lyuba showed us, there was a tag named ProtocolExecution linked to study target. As far as I understand from ontological descriptions in general, names are defined by their relationships, A Parent has a child. A child can be anything even a parent. But a child is a child because it has a Parent. So a ProtocolExecution/Experimental Session has a study target /or Subject on which the experiment is performed. As an experimenter, I have no problem calling the subject of my experiment the subject, whether my subject is an animal, tissue/blood sample or whatever. But of course only in the context of an experiment and I think this is how we need to solve this problem about subjects or samples. I record samples from a subject in an experiment. The names depend on the relationship. |
@chrisvdt I would say that sometimes a protocol execution corresponds to an experimental session. However, in general several protocols might be carried out during a single session; conversely, a protocol could span multiple sessions (e.g. training a rat). @lzehl please could you edit the issue description to explain in more detail what the question is? |
A case study for needing both a subject and multiple samples at different levels would be paired patch-clamp recordings, in which an experimenter records from two neurons simultaneously in a brain slice in vitro, with two different electrodes. We will have metadata for each of the recorded cells (e.g. the electrode solution, access resistance), for the brain slice (e.g. the bath solution), and for the subject (sex, genotype, tracer injection prior to brain slicing, etc.) |
in dandi we are using the following hierarchy of information for the moment: subject > tissue sample > slice > cell/probe (everything is optional except subject and anything can have a one to many mapping) the one thing that's harder in the current schema is when a cell id, say in a volume image corresponds to multiple slices. |
Although I think this is a bit off-topic in this issue, I need to correct here a bit the interpretation of openMINDS (sorry that this did not became clear in the documentation; an update there will follow shortly). The ProtocolExecution is a schema that captures the individual conduction of an experimental or analysis process and therefore has "inputs", "outputs" and "study targets". Here an example: The brains of two Subjects, or actually the default mode network of two Subjects at specific SubjectStage(s), e.g. at age 40 years (sub-01) and at age 32 years (sub-02), were imaged in a resting-state fMRI experiment. In openMINDS we would register:
Possible "input" to a ProtocolExecution: Subject(Group)Stage, TissueSample(Collection)Stage, FileInstance, FileBundle A "study target" can be attached to a Protocol or a ProtocolExecution. Possible "study targets" (selection): a specific species, a specific cell type, a specific network, a specific receptor type, a specific brain region, etc. In BIDS this would mean the ProtocolExecution will be at least a session, but could also be more fine-grained than that (e.g. files that belong to one data-type). The Subject (or actually its Stage) would remain the input to that ProtocolExecution. The "study target" depends on the experiment (technique/task):
@chrisvdt : I hope I could clarify a bit more the openMINDS model part of this? Let me know if you have questions on my summary. |
@lzehl : Based on your last description I think I have interpreted study target correctly as an entity that depends on an experiment. Although not the subject or subject stage (because in your model they are inputs), it is something about that subject, in your example the "default network" that is studied. What if I give this "default network" an id that I can use as a reference to a subject stage or actual subject. @apdavison : mentioned the invitro slice recording with simultaneously recorded neural activity. (A similar What is a session in this context? In recording timeseries, my view of a session is simply one timeseries recording (which can include various behavioural modalities in separate files) using one stimulus or intervention sequence. So coming back to the questions posed at the beginning; How would the following tissue sample options fit in such a hierarchical structure? These questions are relevant from an archivist point of view, but not from an experimental view. I am stressing the experimental view, because BIDS and any alternative schema should be applied by experimentalists in the context of their research. Fitting this research in to MINDS is in my view a whole other level of conceptualization. |
for background, in BIDS for human cognitive neuroimaging I would say the hierarchy is: subject (expressed as directory) - session (expressed as directory) - run (expressed in multiple files) - trial (expressed in a single file and in events.tsv) |
Following our discussion during the April 8 2021 meeting of the INCF Working Group on neuroscience data structure, we have reached a consensus within the group to keep working with BIDS' current hierarchy and deal with the information that is not represented in the hierarchy itself (e.g: tissue, slice, cell etc.) by adding them in the metadata, rather than trying to add extra levels in the hierarchy. These extra info (tissue, slice, cell, probe, sample etc.) might be added as new BIDS entities (i.e in the filenames); we will open an issue on the BIDS github repo to discuss this further directly at the BIDS level (rather than discussing them separately in different BEPs). |
my suggestion would be whenever the new issue is posted to simply have a sample entity and then in the proposed samples.tsv to add a sample_type (block, slice, cell, etc.,.) however, this would require sample entities to be unique to disambiguate those levels. in dandi we add them in names to simplify filename based search, but i wanted to stick them in the metadata. (@yarikoptic made me take them out :) ) but in seriousness this goes more to a broader identifier discussion. |
for info, an issue has been opened directly at the BIDS level so that we deal with this in a common manner across BEPs (at the moment, the animal-ephys BEP and microscopy BEP )... so unless there are specific non-BIDS topics, I suggest we keep on exchanging overthere: bids-standard/bids-specification#779 |
Here is the promised write-up with an overview of the problem space, a potential model, and a review of the trade-offs that I see for BIDS based on my experience implementing and maintaining the SDS and its validation pipelines. I'm also dropping this in bids-standard/bids-specification#779. https://github.com/SciCrunch/sparc-curation/blob/master/docs/participants.org If you have targeted questions or comments you can leave them on this commit. |
summary of conclusions of the microscopy BEP (by @tgbugs):
Suggestion 2 is part of the microscopy BEP PR: bids-standard/bids-specification#812 |
I just read in the paper A collaborative resource platform for non-human primate neuroimaging on PRIME-RE
(emphasis mine) The specific experiences of those authors (probably all from an imaging rather than ephys background) might be relevant to follow up. |
GENERAL TOPIC:
While BIDS at the moment focuses on living (human) beings as a whole (in BIDS defined as subject or participant), neuroscience in general can be conducted on any living or dead (human or non-human) being as a whole ("subject") and any possible living or dead tissue sample extracted of that being ("tissue sample"). A being itself can reach from human to animal to a single-celled organism. How would a BIDS extension look that covers all these subject / tissue samples?
Other efforts to coordinate with considering this topic:
THE FOLDER HIERARCHY ISSUE:
The classical BIDS model foresees a hierarchical folder structure with an inheritance principle for metadata associated on each hierarchical level. For the raw data on living (or diseased) human beings (subjects) that means:
In order to be on the same page here the definitions from BIDS for some terms:
[Note that we do not have to stick to those definitions, but if we vary from them we should explicitly state it to avoid misunderstandings in the discussions.]
To trigger the discussions: Let us assume we generalize the definition of a "subject" being the "thing" that is studied (a whole species [living or dead] or any part of a species [living or dead]). If we strictly follow the inheritance principle of BIDS, the following structure could be assumed for several use cases:
Questions:
2.1) In how far do relevant metadata differ between a "subject" or a "tissue sample" (as defined in the general description)?
3.1) Could a subject-folder also be interpreted as a group / collection and still allowing the identification of a member of that group/collection?
4.1) How would that affect the metadata storage / concept of BIDS?
The text was updated successfully, but these errors were encountered: