Add documentation to handle (physical) units of recording channels #3844

h-mayorquin · 2025-04-07T17:06:58Z

Ok guys, I will need your feedback. I am also adding another preprocessor wrapper that handles this for the user.

doc/how_to/physical_units.rst

src/spikeinterface/preprocessing/scale.py

chrishalcrow · 2025-04-08T13:16:45Z

Looks great - doc was super clear and easy to read!

Co-authored-by: Chris Halcrow <57948917+chrishalcrow@users.noreply.github.com>

zm711

Before being really careful my one concern with this doc as written is that for the average user we don't need to scale_to_uV. It happens with argument flags in various functions. The way this documentation works it sounds like I always have to create a scaled recording rather than keep my unscaled and scale at the function level. I think the doc about other physical units is great, but I'm worried that this might actually confuse users on how to use the library since the scaling for uV is baked in to other functions the class is for people that just need their recording scaled for other purposes right?

zm711 · 2025-04-16T15:22:43Z

doc/how_to/physical_units.rst

@@ -0,0 +1,96 @@
+Working with physical units in SpikeInterface recordings
+===============================================


rendering is impossible for the suggestion but this needs to go to the end of the heading.

Generated by copilot before.
Fixed.

zm711 · 2025-04-16T15:22:52Z

doc/how_to/physical_units.rst

+
+
+Converting to Physical Units
+-------------------------


same here to end of the heading.

Thanks, fixed.

h-mayorquin · 2025-04-16T16:16:52Z

but I'm worried that this might actually confuse users on how to use the library since the scaling for uV is baked in to other functions

what would be the confusion exactly?

for more information, see https://pre-commit.ci

zm711 · 2025-04-16T18:05:29Z

The confusion is do I need to do a scale_to_uV step in all of my pipelines? ie

recording = read_intan()
scaled_rec = scale_to_uV(recording)
sorting = run_sorter(xx)
# or spikeglx
recording = read_spikeglx()
scaled_rec= scale_to_uV(recording)
sorting=run_sorter()

The scaling is not required in a pipeline. But when I read these docs I feel like I have to explicitly run this step no matter what. I think it is helpful for people to know scaling is important but the main way the library has people interact with ephys scaling is with the return_scaled soon to be return_in_uV (or whatever we decided) as a function argument. Not as a preprocessing step. Does that make sense? I think the docs are good, but I'm worried people will not realize when they should scale outside of function calls vs inside. For another example. If I take a scale_to_uV recording what happens if I try to return_scaled? I shouldn't rescale again (I think you may have added an error, but then that mechanism seems weird. Because data should be scaled. Maybe this is worth an actual discussion because I'm worried I'm not being clear.

h-mayorquin · 2025-04-16T18:12:22Z

The scaling is not required in a pipeline. But when I read these docs I feel like I have to explicitly run this step no matter what.

Thanks.
We can add a comment that scaling is not needed in sorters.

I think the docs are good, but I'm worried people will not realize when they should scale outside of function calls vs inside.

This is not clear to me either. Is it to you?

but the main way the library has people interact with ephys scaling is with the return_scaled soon to be return_in_uV

I don't agree with this or want it.

Yes, I think we should discuss this on the meeting tomorrow. I am fine if you want add a list of pipes/algorithms where the units don't matter and an explanation on why they don't but I don't have that knowledge and I users are in a better position to understand if they want to run their algorithms on raw data or in units.

zm711 · 2025-04-16T18:28:22Z

@yger it seems like maybe some changes to SC2 broke testing?

h-mayorquin · 2025-04-21T16:53:19Z

Ok @zm711 I added some notes in the direction that you wanted. Check it out and let me know what you think. I still think that this is a good topic to discuss on the next meeting.

zm711

Few more comments.

doc/how_to/physical_units.rst

zm711 · 2025-04-22T12:45:42Z

doc/how_to/physical_units.rst

+SpikeInterface provides tools to handle both situations.
+
+It's important to note that **most spike sorters work fine on raw digital (ADC) units** and not scaling is needed.
+Many preprocessing tools are also linear transformations, and if the ADC is implemented as a linear transformation, the overall effect can be preserved.


we could say that is relatively common here to be a linear transformation (although you discuss gain/offset later on).

I added a comment on this direction.

zm711 · 2025-04-22T12:50:59Z

doc/how_to/physical_units.rst

+Therefore, **it is usually safe to work in raw ADC units unless a specific tool or analysis requires physical units**.
+If you are interested in visualizations, comparability across devices, or outputs with interpretable physical scales (e.g., microvolts), converting to physical units is recommended.
+Otherwise, remaining in raw units can simplify processing and preserve performance.
+


We might want Alessio or Sam to comment on some of the internal tooling. I think the scaling is automatic for some of our stuff so we should make it clear if/when we do that. I just don't remember off the top of my head. If Alessio doesn't have the time to look this over I can doublecheck in the code.

I think it is better if you check it.

I think this is only missing part to move this forward?

Yeah I agree @alejoe91 could you comment here when you have a moment :)

zm711 · 2025-04-22T12:51:49Z

doc/how_to/physical_units.rst

+
+    physical_value = raw_value * gain + offset
+
+


Maybe a note here saying that as we discussed above because this is a linear transformation we can do preprocessing etc without an issue.

Can you add a suggestion?

Yep will do. Have some experiments all day today, but my hope is tomorrow should be a little freer.

zm711 · 2025-04-22T12:55:07Z

doc/how_to/physical_units.rst

+    values = ["volts"] * num_channels
+    recording.set_property(key='physical_unit', value=values)
+
+    values = [0.001] * num_channels  # Convert from ADC to volts


Maybe we say

gain_values = [0.001] * num_channels

to be even more clear and then below we would say
offset_values

just so we aren't using the same variable being overwritten for both? This is definitely optional.

Yeah, makes sense.

zm711 · 2025-04-22T12:56:39Z

src/spikeinterface/preprocessing/scale.py

+
+    def __init__(self, recording):
+        if "gain_to_physical_unit" not in recording.get_property_keys():
+            raise ValueError("Recording must have 'gain_to_physical_unit' property to convert to physical units")


Any interest in adding the way to do this in the error? for example. adding a line like

please use the set_property function in order to set "gain_to_physical_unit"

Co-authored-by: Zach McKenzie <92116279+zm711@users.noreply.github.com>

chrishalcrow · 2025-05-09T08:52:15Z

src/spikeinterface/preprocessing/scale.py

+        self.set_channel_offsets(offsets=0.0)
+
+
+scale_to_physical_units = ScaleToPhysicalUnits


Hello, could you change this to

from spikeinterface.core.core_tools import define_function_handling_dict_from_class scale_to_physical_units = define_function_handling_dict_from_class(ScaleToPhysicalUnits, name="scale_to_physical_units")

then will works with dicts of recs

chrishalcrow · 2025-05-09T08:52:26Z

src/spikeinterface/preprocessing/scale.py

+        if "gain_to_physical_unit" not in recording.get_property_keys():
+            error_msg = (
+                "Recording must have 'gain_to_physical_unit' property to convert to physical units. \n"
+                "Set the gain using `recording.set_property(key='gain_to_physical_unit', value=values)`."


Suggested change

"Set the gain using `recording.set_property(key='gain_to_physical_unit', value=values)`."

"Set the gain using `recording.set_property(key='gain_to_physical_unit', values=values)`."

chrishalcrow · 2025-05-09T08:52:36Z

src/spikeinterface/preprocessing/scale.py

+        if "offset_to_physical_unit" not in recording.get_property_keys():
+            error_msg = (
+                "Recording must have 'offset_to_physical_unit' property to convert to physical units. \n"
+                "Set the offset using `recording.set_property(key='offset_to_physical_unit', value=values)`."


Suggested change

"Set the offset using `recording.set_property(key='offset_to_physical_unit', value=values)`."

"Set the offset using `recording.set_property(key='offset_to_physical_unit', values=values)`."

add documentation to handle units

0f61068

h-mayorquin added documentation Improvements or additions to documentation core Changes to core module preprocessing Related to preprocessing module labels Apr 7, 2025

h-mayorquin self-assigned this Apr 7, 2025

h-mayorquin added 2 commits April 7, 2025 11:28

fix scaling

ee6542b

add tests

3c81942

h-mayorquin mentioned this pull request Apr 7, 2025

Improve ElectricalSeries units in recording interfaces catalystneuro/neuroconv#1292

Merged