Sigchain Class API should provide paginated ordered claims by returning Array-POJO and indexed access #327

CMCDragonkai · 2022-02-08T11:39:35Z

Specification

The Sigchain API requires some rework as discovered here: #310 (comment)

The ChainData and ChainDataEncoded types are record POJOs, which are not the right data type we should be returning to callers. Record POJOs do not preserve order, and when returning chain data, we would want to have the ability to iterate over the data, and it should be in the same order as the chain data (as ordered by the claim id).
When fetching a collection of resources in JS, always return an Array. Record POJOs are not ordered. Alternatively to preserve both order and index access, a Map can be used. HOWEVER, Array and POJOs are pure data. The Map and Set are not. This is why by convention, we only use Map as a persistent rich construct, and when exchanging data between systems, we use arrays and POJOs.
This means we need Sigchain.getChain and Sigchain.getChainEncoded. Both methods should be returning the type Array<Claim> and Array<ClaimEncoded> respectively. The encoded version is the base64ed JWT, but usually in our own application we would be working off the Claim structure. At the same time there should be obvious conversion functions from Claim to ClaimEncoded and back.
These 2 functions should expose seek and limit parameters to enable cursor pagination. Note that seek needs to be type of the seek key. The seek key of Sigchain is the ClaimId. Which we know to be based on IdSortable which itself uses timestamps internally. Both parameters are optional. One should be able to synthetically construct a ClaimId to represent a point in time cursor as well. This is a secondary priority, do this after the main design works.
For indexed access, we would not expect the user to acquire a collection, and then query their collection. Instead we would expect the user to directly call a getClaim which you can pass the ClaimId to acquire the Claim itself. If they want an encoded version, we can expose getClaimEncoded.
The ChainData and ChainDataEncoded may no longer be necessary as types. Just use Array<Claim> and Array<ClaimEncoded>. However it is essential that one can iterate over these 2 arrays, and be able to use extractors/transformers/converters to be able to acquire the ClaimId. This means the ClaimId must always be acquirable given a Claim or ClaimEncoded, ideally efficiently.
Remote users that call over GRPC, they should be able to pass pagination parameters in and acquire a collection of claims, they can then process it the way they want and do indexed access if so.
We should endeavour to always be using ClaimId internally in our application and ClaimIdString only when using them as POJO keys. The ClaimIdEncoded is purely used for external reference. And the Claim should be using ClaimId, and ClaimEncoded can use ClaimIdEncoded since you have to have JSON anyway. Make sure to use the reviver and replacer pattern as we do inside the Status class.

Additional context

Extracting Node Connection Management out of NodeManager to NodeConnectionManager #310 (comment) - Discussion about Sigchain ClaimId, ClaimIdString, ClaimIdEncoded
GRPC API Review Changes #275 - GRPC API review and pagination concerns
Growing the Gestalt Graph and Implementing Social Discovery #320 - The discovery system has been refactored significantly, and will be impacted by changes to the Sigchain

Tasks

...
...
...

The text was updated successfully, but these errors were encountered:

joshuakarp · 2022-02-11T02:16:20Z

This will also have implications for our Discovery process, once we provide support for revocations in the sigchain, such that we can iterate over claims in chronological order. See #320 (comment)

CMCDragonkai · 2022-10-17T07:38:47Z

In order to provide a "generic" paginated system.

We have to change the Sigchain.getClaims method call.

It needs to take a options object like:

    {
      order = 'asc',
      seek
    }: {
      order?: 'asc' | 'desc';
      seek?: ClaimId;
    } = {},

The above means that by default we are going to get C1, C2, C3. But you can reverse it by changing the order to desc. Furthermore, the seek allows one to seek to a particular ClaimId which is lexicographically ordered. This is used as gte under asc or lte under desc. This means the seek is actually inclusive.

Now this does mean you need to have an ID to actually be able to seek. However one could also "synthetically" construct a ClaimId because it's an IdSortable, although that can be a bit weird to do so, there's no utility function to do this atm though.

There's no limit, because it is an async generator enabling you to just stop consuming when you want to stop. Finally the whole thing is transactional too.

The original method had the ability to filter to specific kinds of claims. This should be generalised to a general indexing filter. To do this, you would need the ability to filter the records according to index. However there will only be some items that are indexed, not all of them would be, and we would statically limit these.

Examples of things to index to:

sequence number - so you can look up be sequence number instead
claim type: node or identity
claim about a node or about an identity

The third thing would be useful if you want to look up the most recent claim specified for a given NodeId or a given third party provider identity.

When done like this, the function can get alot more complicated, because rather than iterating over all claims, they would instead iterate over a particular index.

This is where a "general" query engine would be good, and that's what SQL was good for. However with a lack of a query engine here, we have to instead build a fixed set of indices that can be used.

CMCDragonkai · 2022-10-17T07:41:54Z

The return type is also AsyncGenerator<[ClaimId, TokenClaim]>.

We can follow this pattern across other domains too. Although I realise we haven't really standardised on "collection" iteration and seeking.

Perhaps if custom indexing is required, one might move these into other functions since ultimately indexing in our key value system is going to be static.

getClaims()
getClaimsByX() // where X is an index
getClaimsByNodeId
getClaimsByIdentityId

Then this should translate to changes for getCerts and getTasks and whatever else.

CMCDragonkai · 2022-10-26T11:14:59Z

Everything here except 7. is done as part of #481, and put into PR #446.

CMCDragonkai · 2022-12-05T04:28:38Z

In order to solve 7. we need to do things:

Refactor src/agent/service/nodesChainDataGet.ts to instead be src/agent/service/sigchainClaimsGet.ts and this should also receive pagination parameters. This includes seek, limit and order. The order should be a protobuf enum: https://developers.google.com/protocol-buffers/docs/proto3#enum
The Sigchain.getClaims with the apporpriate pagination parameters should be tested in tests/sigchain/Sigchain.test.ts. Right now this isn't being tested. We need to test both ascending and descending order and ensure that whatever ClaimId is being seeked is also included in the output.
Remove the tests/sigchain/Sigchain.old.test.ts.

CMCDragonkai · 2022-12-05T04:34:28Z

Apparently we don't really use the seek/limit parameters that much. I think this is because this was a recent addition to the js-db. But we do use lte a few more places like TaskManager.

Points 2, and 3 should be done in #466.

It seems that other places would be suitable as well to incorporate pagination parameters.

CMCDragonkai · 2022-12-05T04:35:49Z

For 2. make sure that the tests cover both getClaims and getSignedClaims.

CMCDragonkai · 2022-12-05T04:36:24Z

ATM only the Sigchain have top level methods taking the pagination parameters:

    {
      order = 'asc',
      seek,
      limit
    }: {
      order?: 'asc' | 'desc';
      seek?: ClaimId;
      limit?: number;
    } = {},

CMCDragonkai · 2022-12-05T04:43:47Z

We will need a new epic to deploy pagination-based streaming to all the relevant domain collection structures, and bubbling that up to the service handlers and CLI commands. That epic can incorporate point 1.

CMCDragonkai · 2022-12-05T04:46:11Z

We can hijack #237 for this general pagination deployment.

CMCDragonkai added the development Standard development label Feb 8, 2022

CMCDragonkai mentioned this issue Feb 8, 2022

Extracting Node Connection Management out of NodeManager to NodeConnectionManager #310

Merged

46 tasks

joshuakarp mentioned this issue Feb 11, 2022

Growing the Gestalt Graph and Implementing Social Discovery #320

Merged

12 tasks

teebirdy added the r&d:polykey:core activity 3 Peer to Peer Federated Hierarchy label Jul 24, 2022

tegefaulkes mentioned this issue Sep 29, 2022

Integrating TaskManager into Discovery #451

Merged

10 tasks

CMCDragonkai self-assigned this Oct 17, 2022

CMCDragonkai mentioned this issue Oct 17, 2022

Updating Crypto to use WebCrypto API and to replace RSA with ECC #446

Merged

75 tasks

CMCDragonkai mentioned this issue Oct 26, 2022

Replace JOSE with our own tokens domain and specialise tokens for Sigchain, Notifications, Identities and Sessions #481

Closed

CMCDragonkai mentioned this issue Nov 6, 2022

Discovery Refactoring - Derived from JOSE replacement #493

Closed

11 tasks

tegefaulkes mentioned this issue Dec 6, 2022

Pagination Deployment to Service Handlers and CLI Commands and Domain Collections #237

Open

9 tasks

tegefaulkes closed this as completed in #446 Dec 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sigchain Class API should provide paginated ordered claims by returning Array-POJO and indexed access #327

Sigchain Class API should provide paginated ordered claims by returning Array-POJO and indexed access #327

CMCDragonkai commented Feb 8, 2022

joshuakarp commented Feb 11, 2022

CMCDragonkai commented Oct 17, 2022

CMCDragonkai commented Oct 17, 2022 •

edited

Loading

CMCDragonkai commented Oct 26, 2022

CMCDragonkai commented Dec 5, 2022 •

edited

Loading

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Dec 5, 2022 •

edited

Loading

CMCDragonkai commented Dec 5, 2022

Sigchain Class API should provide paginated ordered claims by returning Array-POJO and indexed access #327

Sigchain Class API should provide paginated ordered claims by returning Array-POJO and indexed access #327

Comments

CMCDragonkai commented Feb 8, 2022

Specification

Additional context

Tasks

joshuakarp commented Feb 11, 2022

CMCDragonkai commented Oct 17, 2022

CMCDragonkai commented Oct 17, 2022 • edited Loading

CMCDragonkai commented Oct 26, 2022

CMCDragonkai commented Dec 5, 2022 • edited Loading

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Dec 5, 2022 • edited Loading

CMCDragonkai commented Dec 5, 2022

CMCDragonkai commented Oct 17, 2022 •

edited

Loading

CMCDragonkai commented Dec 5, 2022 •

edited

Loading

CMCDragonkai commented Dec 5, 2022 •

edited

Loading