Consolidated Metadata serializes NaN incorrectly #2990

mpiannucci · 2025-04-16T19:54:58Z

Zarr version

v3.0.6

Numcodecs version

v0.16.0

Python Version

3.11

Operating System

Mac

Installation

uv

Description

NaN fill values are serialized incorrectly when using consolidated metadata, The output is

"fill_value": NaN,

when it should be

"fill_value": "NaN",

Steps to reproduce

import numpy as np
import zarr
from zarr.core.buffer import default_buffer_prototype

store = zarr.storage.MemoryStore()
root = zarr.group(store, zarr_format=2)
time = root.create_array("time", shape=(12,), dtype=np.float64, fill_value=np.nan)

# Check the metadata for the fill_value
array_buff = await store.get("time/.zarray", prototype=default_buffer_prototype())
print('fill_value: "NaN"', '"fill_value": "NaN"' in array_buff.to_bytes().decode())

# Consolidate the metadata
zarr.consolidate_metadata(store)

# Check the metadata for the fill_value
array_buff = await store.get(".zmetadata", prototype=default_buffer_prototype())
print('fill_value: "NaN"', '"fill_value": "NaN"' in array_buff.to_bytes().decode())
print('fill_value: NaN', '"fill_value": NaN' in array_buff.to_bytes().decode())

# Output:
# fill_value: "NaN" True
# fill_value: "NaN" False
# fill_value: NaN True

Additional output

No response

The text was updated successfully, but these errors were encountered:

mpiannucci · 2025-04-17T18:18:03Z

The key thing here I think is that V2 metadata has its own NaN handling, but the consolidated metadata encoder uses the v3 encoder. Which to the eye should do the same thing but in practice is does not

mpiannucci · 2025-04-17T18:24:02Z

import json
import zarr

o = {
    "a": 1,
    "b": np.nan,
}

json.dumps(o, cls=zarr.core.metadata.v3.V3JsonEncoder)

# '{\n  "a": 1,\n  "b": NaN\n}'

mpiannucci added the bug Potential issues with the zarr-python library label Apr 16, 2025

mpiannucci mentioned this issue Apr 16, 2025

Update to use zarr python >= 3.0 xpublish-community/xpublish#289

Merged

This was referenced Apr 17, 2025

Fix nan encoding in consolidated metadata #2996

Merged

Structured dtype serialization with consolidated metadata fails #2998

Closed

TomAugspurger closed this as completed in #2996 Apr 18, 2025

This was referenced May 1, 2025

Monthly issue metrics report #3030

Open

Monthly issue metrics report sanketverma1704/zarr-python#10

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidated Metadata serializes NaN incorrectly #2990

Consolidated Metadata serializes NaN incorrectly #2990

mpiannucci commented Apr 16, 2025

mpiannucci commented Apr 17, 2025

mpiannucci commented Apr 17, 2025 •

edited

Loading

Consolidated Metadata serializes NaN incorrectly #2990

Consolidated Metadata serializes NaN incorrectly #2990

Comments

mpiannucci commented Apr 16, 2025

Zarr version

Numcodecs version

Python Version

Operating System

Installation

Description

Steps to reproduce

Additional output

mpiannucci commented Apr 17, 2025

mpiannucci commented Apr 17, 2025 • edited Loading

mpiannucci commented Apr 17, 2025 •

edited

Loading