Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add basic hooks for dask dataframe #30

Merged
merged 9 commits into from
Dec 18, 2023
Merged

Conversation

martindurant
Copy link
Member

@martindurant martindurant commented Apr 19, 2023

This makes some things work with an awkward series in a dask-dataframe.

The hard-coded guess at a two-element array works for simple operations. I played with attaching a real form to the dtype object, but it rapidly gets complicated and towards what dask-awkward does. Which raises the question whether we should plumb in dask-awkward at least when using the accessor, and then we'll know the true dtype of any output via the typetracer. This applies even more to IO: we should use dask-awkard for reading from parquet/JSON rather than have dask.dataframe make objects that we then make back into awkward structures.

One rough edge I already found: if the operation being performed returns a different number of elements than the two supplied by the fake series, you get an error. How much do we need to push users to provide their own meta=? Can we make it for them while we have more information before passing to make_array_nonempty ?

@douglasdavis douglasdavis merged commit 0269282 into intake:main Dec 18, 2023
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants