-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: Support TD data #11064
base: main
Are you sure you want to change the base?
ENH: Support TD data #11064
Conversation
…a type as outlined in the SNIRF specification document
…s, moments and processed (hbo/hbr)
Thanks for jumping in to finish this off @larsoner
Agreed Ping me when you want a review, it doesn't need to be a final review, I am happy to take a look early if that helps. |
Feel free to look now @rob-luke to see if you're happy with the overall design, the rest of the work is "just" some remaining details (adding unit tests, playing with the data a little bit to set scale factors, etc.) that I think we can rely on CIs to check, or that we can iterate on in follow-ups (e.g., tweaking plot scale factors) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good, it will be great to have support for TD in MNE! I will review again once there are tests.
* upstream/main: (264 commits) BUG: Fix deprecated API usage in example (mne-tools#11512) Deprecate 'kind' and 'path' in favor of 'fname' in the layout reader (mne-tools#11500) EGI/MFF events outside EEG recording should not break the code (mne-tools#11505) fixed annotations error on export (mne-tools#11435) DOC: Update installer links [skip azp] [skip actions] [skip cirrus] (mne-tools#11506) BUG: updates for MPL 3.7 compatibility (mne-tools#11409) Fix docstrings by replacing str with path-like and fix double backticks for formatting (mne-tools#11499) Use pathlib.Path instead of os.path to handle files and folders [circle deploy] (mne-tools#11473) MAINT: Fix Circle [circle deploy] (mne-tools#11497) MAINT: Use mamba in CIs (mne-tools#11471) Updating documentation to clarify full vs half-bandwidth and defaults in time_frequency.multitaper.py (mne-tools#11479) Fix typo in tutorial (mne-tools#11498) Typo fix and added colons before code (mne-tools#11496) [MRG] ENH/DOC: demo custom spectrum creation (mne-tools#11493) Accept only left-clicks for adding annotations (mne-tools#11491) [BUG, MRG] Fix pial surface loading, logging in locate_ieeg (mne-tools#11489) [ENH] Added unit_role to add non-breaking space between magnitude and units (mne-tools#11469) MAINT: Fix CircleCI build (mne-tools#11488) [DOC] Updated decoding.SSD documentation and internal variable naming (mne-tools#11475) Typo fix (mne-tools#11485) ...
* upstream/main: (824 commits) Add `psd_args` to `plot_ica_sources` and `ICA.plot_sources` (mne-tools#12912) Fix GDF NumPy >= 2 (mne-tools#12909) [pre-commit.ci] pre-commit autoupdate (mne-tools#12908) ENH: Improve report usability (mne-tools#12901) MAINT: Avoid problematic PySide6 (mne-tools#12902) Sync README dependencies with pyproject.toml (mne-tools#12890) remove trailing slash from pybv base URL [ci skip] (mne-tools#12892) Cast tuple of filenames to list to improve error handling (mne-tools#12891) Website (mne-tools#12885) [pre-commit.ci] pre-commit autoupdate (mne-tools#12888) BUG: Fix bugs with coreg (mne-tools#12884) Bump mamba-org/setup-micromamba from 1 to 2 in the actions group (mne-tools#12887) Update spacing for comments in pyproject.toml (mne-tools#12886) make HTML repr for Forward match others (mne-tools#12883) MAINT: Linkchecks [circle deploy] (mne-tools#12882) Update roadmap (mne-tools#12872) [MRG] Require good and bad channels when creating a SpectrumArray object (mne-tools#12877) [pre-commit.ci] pre-commit autoupdate (mne-tools#12879) MAINT: Update code credit (mne-tools#12880) BUG: Fix bug with Path casting (mne-tools#12878) ...
Co-authored-by: Robert Luke <[email protected]> Co-authored-by: John Griffiths <[email protected]> Co-authored-by: Julien Dubois <[email protected]>
@@ -137,6 +137,7 @@ | |||
.. _Joan Massich: https://github.com/massich | |||
.. _Johann Benerradi: https://github.com/HanBnrd | |||
.. _Johannes Niediek: https://github.com/jniediek | |||
.. _John Griffiths: https://www.grifflab.com |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JohnGriffiths @julien-dubois-k @Zahra-M-Aghajan let me know if you want a different URL to link from your name in the changelog
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
grifflab.com is perfect, thanks
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Okay code has been updated. I suspect the scaling could be wrong as I mention in a comment above ☝️ below 👇 but in any case people can test it with some data!
It's possible/likely that in addition to the possible scaling bug, we'll want to adjust the default plot scaling limits.
It would also be great to have an example added (maybe using mne-misc-data
) showing that this data can at least be read!
fnirs_time_delays[bin_idx - 1] | ||
* fnirs_time_delay_widths[bin_idx - 1] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I modified this to index into fnirs_time_delay_widths
when it didn't before. It didn't error in either case, and it seems like one of them should have! So we still need to add a suitable SNIRF test file to mne-testing-data @JohnGriffiths (or others) if you have a tiny one you can share
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@julien-dubois-k this one is also for you ☝️
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
test file shared here, hopefully it helps with this case
data = raw.get_data("fnirs_td_moments_amplitude") | ||
assert data.shape == raw._data.shape | ||
norm = np.nanmedian(np.linalg.norm(data, axis=-1)) | ||
assert 1e5 < norm < 1e6 # TODO: 429256, is this reasonable Molars!?? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be in units of M (mol). Looking at the values in the test file:
>>> import h5py
>>> import numpy as np
>>> x = h5py.File("td_moments.snirf", "r")
>>> list(x["nirs/data1/measurementList1"]) # no dataUnit, so code assumes M not uM!
['dataType', 'dataTypeIndex', 'dataTypeLabel', 'detectorIndex', 'sourceIndex', 'wavelengthIndex']
>>> np.array(x["nirs/data1/measurementList1/dataTypeLabel"]).item()
b'Time Domain - Moments - Amplitude'
>> np.nanmedian(np.linalg.norm(np.array(x["nirs/data1/dataTimeSeries"]), axis=0))
np.float64(429256.98709975014)
should the code assume that if no dataUnit is present, the units are actually uM?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@julien-dubois-k this one is for you ☝️
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
b'Time Domain - Moments - Amplitude' so this is moments data.
The units will be:
unitless
for the "sum" moment (it's a photon count)- some multiple of
s
for the "mean" moment (it's a time of flight) - some multiple of
s**2
for the "var" moment (it's a variance of the time of flight)
The SNIRF spec doesn't require dataUnit
to be populated (here). In cedalion it looks like they just treat unspecified dataUnit as unitless or a.u. (here). I think vendors who generate SNIRF files should populate the dataUnit
field always if there is a unit. Kernel didn't originally (hence the missing dataUnit
in the current test files), but does now. I'll provide new test files shortly.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
test files shared here
* upstream/main: eegbci api: allow downloading multiple subjects (mne-tools#12918) DOC: Document Linux desktop workaround (mne-tools#12900) Allow exporting edf where a channel contains only constant values (mne-tools#12911) Autogenerate environment.yml file from pyproject.toml (mne-tools#12914)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
confirming that the example runs here, and also that it seems to load other data files properly. the latter point I will can make and share a separate NB demonstrating that for 'peace of mind', in lieu of an analysis example discussed on the issue thread. Also noting that given @julien-dubois-k 's comment re some .snirf
attributes not being read, so those steps may also need to be added either to this PR or (probably better) a subsequent one. I will be looking at this next.
Afraid I don't know the answer to the uM units Q off hand.
@larsoner thanks for this. Have gone through, left review above. Not quite sure if I missed something specific other than the units Qs and the I don't not know to these off the top of my head, but I imagine @julien-dubois-k does. I can take a look at relevant documentation if that is useful. Re: datasets LMK if I have got this right re next steps
|
thank you @larsoner and @JohnGriffiths for your work pushing this PR
forward. I’ll review tomorrow and answer any remaining questions that I can
answer!
…On Wed, Nov 6, 2024 at 17:56 John Griffiths ***@***.***> wrote:
@larsoner <https://github.com/larsoner> thanks for this. Have gone
through, left review above.
Not quite sure if I missed something specific other than the units Qs and
the time_delay_widths Qs that you needed an immediate answer to; LMK
I don't not know to these off the top of my head, but I imagine
@julien-dubois-k <https://github.com/julien-dubois-k> does. I can take a
look at relevant documentation if that is useful.
Re: datasets
LMK if I have got this right re next steps
- You have two small .snirf files in the example data that is loaded
here so those aren't needed now for this test function
- But you are suggesting to add some other, larger files to
mne-misc-data for an analysis example (?)
- So, would you like me to do a PR to that repo with a .snirf data
file a a more fulsome analysis example?
- If so yes I can get on that. What I would do for the example code is
use the colab notebook example @julien-dubois-k
<https://github.com/julien-dubois-k> linked to in his above comment
<#11064 (comment)>,
and the example file the one used in that nb.
- ( I can add other finger tapping .snirf files to this if warranted )
- As I work on that I will take a look at the comments from
@julien-dubois-k <https://github.com/julien-dubois-k> noted in my
review comment above about some .snirf attributes not being read from
the .hdf properly by the mne reader. Which may lead to another
.io.snirf PR before the analysis example PR.
- But I think it's prob best not to muddy the current PR with that Q
(unless you disagree?)
—
Reply to this email directly, view it on GitHub
<#11064 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AWIKIDKMADYHYTI6KHYECT3Z7LCGNAVCNFSM57FOV3OKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TENBWGEYTKNZRG44A>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Well related to the
Sure, then we modify an example here or more likely in
Sure if it adds something complementary from a documentation perspective. But if it's just additional subjects or a different paradigm from what we currently have, might not be worth it. (Just an aside, not sure if you've looked at mne-bids-pipeline but there is mne-tools/mne-bids-pipeline#365 that we could resurrect if you're interested in a BIDS pipeline for your data.)
I don't mind adding more stuff here, a +305/-85 PR is fairly small actually. So based on discussion I think the next steps would be:
|
Ok I'm on it.
Will do. What is the heuristic on acceptable size for a file in
So just temporarily change this requirements line to the commit url for the current PR, right?
That's great to know. thanks. We are actually currently in the process of getting an |
We try to keep our datasets to < 1GB if possible. Looks like
CircleCI is the CI that builds the doc so I would do it here actually |
@@ -60,6 +62,8 @@ | |||
fnirs_fd_ac_amplitude="V", | |||
fnirs_fd_phase="rad", | |||
fnirs_od="V", | |||
fnirs_td_gated_amplitude="M", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
these are photon counts, should they be "AU"?
@@ -117,6 +123,8 @@ | |||
fnirs_fd_ac_amplitude=1.0, | |||
fnirs_fd_phase=1.0, | |||
fnirs_od=1.0, | |||
fnirs_td_gated_amplitude=1e6, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I imagine this should be 1.0; same for fnirs_td_moments_amplitude
@@ -60,6 +62,8 @@ | |||
fnirs_fd_ac_amplitude="V", | |||
fnirs_fd_phase="rad", | |||
fnirs_od="V", | |||
fnirs_td_gated_amplitude="M", | |||
fnirs_td_moments_amplitude="M", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is a bit weird, the dataType Time domain – Moments (TD Moments) - Amplitude
can be any of the moments, which all have different units:
- sum or intensity is photon counts
- mean is s
- variance is s^2
@@ -88,6 +92,8 @@ | |||
fnirs_fd_ac_amplitude="V", | |||
fnirs_fd_phase="rad", | |||
fnirs_od="V", | |||
fnirs_td_gated_amplitude="µM", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
update
@@ -57,7 +57,9 @@ | |||
) | |||
|
|||
# Kernel | |||
kernel_hb = testing_path / "SNIRF" / "Kernel" / "Flow50" / "Portal_2021_11" / "hb.snirf" | |||
kernel_path = testing_path / "SNIRF" / "Kernel" / "Flow50" / "Portal_2021_11" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here are some files to add. The path could be updated if it matters, these are no longer Flow50/Flow1 but Flow2, and the version is latest (aka, Portal_2024-10-23).
- Type 201 (Time Domain Gated): c345d04_2.snirf (there is no current equivalent)
- Type 301 (Time Domain Moments): c345d04_3.snirf (replace
td_moments.snirf
) - Type 99999 (Time Domain Hb Moments): c345d04_5.snirf (replace
hb.snirf
)
Closes #9661
@rob-luke how about this plan:
.item()
instead of using_correct_shape
latest.inc
for this nice contribution (I'm just cleaning up really, you all did the hard work!)For now I'm inclined to make it so if you get a
SNIRF_PROCESSED
data type that we don't know how to handle, we should maybe raise an error in this case...? For now I've removed it as acoil_type
because this_PROCESSED
really seems to mean something else specific, and we should perhaps add those one by one as we find use cases for them.I have some collaborators who have seen Kernel help files telling them to use this very out of date branch, so it would be nice to get this in!