not required column and un-ordered columns in `add_resource()` #254

Rafnuss · 2024-08-21T17:05:58Z

I'm not expecting an error message when adding a resource which has a column missing which is not required according to the schema.

Here [https://raw.githubusercontent.com/Rafnuss/GeoLocator-DP/main/measurements-table-schema.json] does not require the valid column, which is missing in my data.

library(frictionless)
create_package() |>
  add_resource(
    "measurements",
    data.frame(
      "tag_id" = "18LY",
      "sensor" = "pressure", 
      "datetime" = "2020-05-01", 
      "value" = 12
    ),
    schema = jsonlite::read_json("https://raw.githubusercontent.com/Rafnuss/GeoLocator-DP/main/measurements-table-schema.json"))
#> Error in `check_schema()`:
#> ! Field names in `schema` must match column names in `data`.
#> ℹ Field names: "tag_id", "sensor", "datetime", "value", and "valid".
#> ℹ Column names: "tag_id", "sensor", "datetime", and "value".

Also I am not sure why providing in the same order than in the schema is necessary. Is it no possible to re-order the data according to schema?

library(frictionless)
create_package() |>
  add_resource(
    "measurements",
    data.frame(
      "tag_id" = "18LY",
      "sensor" = "pressure", 
      "datetime" = "2020-05-01", 
      "valid" = F,
      "value" = 12
    ),
    schema = jsonlite::read_json("https://raw.githubusercontent.com/Rafnuss/GeoLocator-DP/main/measurements-table-schema.json"))
#> Error in `check_schema()`:
#> ! Field names in `schema` must match column names in `data`.
#> ℹ Field names: "tag_id", "sensor", "datetime", "value", and "valid".
#> ℹ Column names: "tag_id", "sensor", "datetime", "valid", and "value".

The text was updated successfully, but these errors were encountered:

Rafnuss · 2024-08-21T17:37:15Z

Actually reading more on this , I realised this dependant on fieldsMatch. Maybe a more complex solution is required?

peterdesmet · 2024-08-23T14:20:36Z

Hi @Rafnuss, you (and many others, including me) want optional and reordered fields.

This feature that is not supported in Data Package 1.0, which is the version that frictionless currently implements. So right now, you (annoyingly) need to add all columns in your data, even if those are empty. Or you will need to do some preprocessing on your schema before adding it to your resource.

The feature has indeed been added as fieldsMatch in Data Package 2.0. Frictionless currently doesn't support 2.0 yet, but we aim to do so (including fieldMatch). Fully supporting v2 is a daunting task though, so it won't be soon.

Rafnuss added a commit to Rafnuss/frictionless-r that referenced this issue Aug 21, 2024

Potential solution for frictionlessdata#254

677a953

peterdesmet closed this as completed Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

not required column and un-ordered columns in `add_resource()` #254

not required column and un-ordered columns in `add_resource()` #254

Rafnuss commented Aug 21, 2024

Rafnuss commented Aug 21, 2024 •

edited

Loading

peterdesmet commented Aug 23, 2024

not required column and un-ordered columns in add_resource() #254

not required column and un-ordered columns in add_resource() #254

Comments

Rafnuss commented Aug 21, 2024

Rafnuss commented Aug 21, 2024 • edited Loading

peterdesmet commented Aug 23, 2024

not required column and un-ordered columns in `add_resource()` #254

not required column and un-ordered columns in `add_resource()` #254

Rafnuss commented Aug 21, 2024 •

edited

Loading