[Bug]: get_default_backend_configuration: auto chunk not good for time series data #1099

bendichter · 2024-09-24T15:27:18Z

What happened?

When using get_default_backend_configuration for long time series, the recommended chunks are similar to the dataset size, which creates very long chunks that are sub-optimal for viewing windows of time e.g. the way data is accessed in neurosift. A better chunking for time series would deviate from the similarity convention, and provide chunks that hold more channels.

Steps to Reproduce

import numpy as np
from pynwb.testing.mock.ecephys import mock_ElectricalSeries
from pynwb.testing.mock.file import mock_NWBFile
from neuroconv.tools.nwb_helpers import get_default_backend_configuration


data = np.ones((10000000,128))

nwbfile = mock_NWBFile()

ts = mock_ElectricalSeries(data=data, nwbfile=nwbfile)
nwbfile

backend_config = get_default_backend_configuration(nwbfile, backend="hdf5")
backend_config.dataset_configurations["acquisition/ElectricalSeries/data"].chunk_shape

output: (312500, 4)

Traceback

No response

Operating System

macOS

Python Executable

Conda

Python Version

3.10

Package Versions

No response

Code of Conduct

I agree to follow this project's Code of Conduct
Have you ensured this bug was not already reported?

The text was updated successfully, but these errors were encountered:

bendichter added the bug label Sep 24, 2024

h-mayorquin self-assigned this Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: get_default_backend_configuration: auto chunk not good for time series data #1099

[Bug]: get_default_backend_configuration: auto chunk not good for time series data #1099

bendichter commented Sep 24, 2024

[Bug]: get_default_backend_configuration: auto chunk not good for time series data #1099

[Bug]: get_default_backend_configuration: auto chunk not good for time series data #1099

Comments

bendichter commented Sep 24, 2024

What happened?

Steps to Reproduce

Traceback

Operating System

Python Executable

Python Version

Package Versions

Code of Conduct