Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diffing streams #1234

Open
jbothma opened this issue Sep 5, 2023 · 1 comment
Open

Diffing streams #1234

jbothma opened this issue Sep 5, 2023 · 1 comment

Comments

@jbothma
Copy link
Contributor

jbothma commented Sep 5, 2023

It would be nice to be able to diff two streams and see

  • which fields differ between the same entity in the two streams
  • which entities only occur in one or the other stream

This needs to ignore differences in the ordering, of entities in the stream, property values in an entity

It would also be nice if it could

  • ignore differences in timestamps, perhaps optionally
  • perhaps ignore user-defined properties and attributes

This, along with the ability to extract one or more enitities using a whitelist of IDs would make it much easier to pull out a sample of specific entities, and inspect differences.

My use case is when working on code producing a stream, and wanting to see the differences between a known version and a new version.

@pudo
Copy link
Contributor

pudo commented Sep 5, 2023

Some fraction of this is implemented here, but it's so bad we never really advertised it :/

https://github.com/opensanctions/opensanctions/tree/main/contrib/delta

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants