You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using Dask shuffle to lazily partition a Dask GeoDataFrame correctly returns a dask_geopandas.core.GeoDataFrame. However, computing the task returns a pd.DataFrame not a gpd.GeoDataFrame. Below is a quick example;
@tastatham thanks for the clear example. Looking into it, now I remember that I previously ran into the same problem with set_index: #59. And the problem lies in serialization/deserialization step for which dask uses partd: dask/partd#52
The set_geometry is probably a good workaround for now.
@tastatham thanks for the clear example. Looking into it, now I remember that I previously ran into the same problem with set_index: #59. And the problem lies in serialization/deserialization step for which dask uses partd: dask/partd#52
The set_geometry is probably a good workaround for now.
ah I see. Since this is a Dask issue, I will use set_geometry as a workaround for the "spatial" shuffle for now and we can drop this later when this has been resolved.
Using
Dask shuffle
to lazily partition a Dask GeoDataFrame correctly returns a dask_geopandas.core.GeoDataFrame. However, computing the task returns a pd.DataFrame not a gpd.GeoDataFrame. Below is a quick example;I will have a look at the Dask shuffle function further and follow this up. It could well be an indexing/conversion to pd.DatraFrame issue #L358.
The text was updated successfully, but these errors were encountered: