You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The problem is caused because zarr chooses the wrong default filters for O dtype arrays -- zarr chooses VlenBytes, when it should be choosing VlenUTF8. Since the structure of the default codecs is likely to change soon, the fix should probably be made in the context of #2463
this is still broken on main. I can "fix" it by associating the dtype string O with the VlenUTF8 codec instead of VlenBytes, but that shouldn't work in general for O dtype arrays (and it breaks some tests).
knowing little about how zarr encodes arbitrary python objects, I checked how this works in v2, and it seems that zarr.create(dtype='O') would error if an object_codec was not provided. Not sure we want to emulate that. The original bug report came from xarray's integration tests with zarr main; I can look into how those tests are calling zarr APIs exactly
this example does not work in
main
:The problem is caused because zarr chooses the wrong default filters for
O
dtype arrays -- zarr choosesVlenBytes
, when it should be choosingVlenUTF8
. Since the structure of the default codecs is likely to change soon, the fix should probably be made in the context of #2463cc @dcherian
The text was updated successfully, but these errors were encountered: