QName Deserializer #543

vitorpamplona · 2022-08-26T18:31:07Z

QName is a special case of an XML Deserializer. This PR parses the contents of the QName and assigns the correct namespace from the XML stack. This mimics the behavior of JAXB parsers.

An XML like this:

<parent xmlns:t="urn:example:types:r1">
    <level1 name="t:DateTime" />"
</parent>";

Should automatically generate a parent.level1.name as QName("urn:example:types:r1", "DateTime", "t")

cowtowncoder · 2022-08-30T23:37:55Z

src/main/java/com/fasterxml/jackson/dataformat/xml/deser/QNameDeserializer.java

+        if (qName.getLocalPart().indexOf(":") > 0) {
+            String prefix = qName.getLocalPart().split(":")[0];
+            String localPart = qName.getLocalPart().split(":")[1];
+            String namespace = ((FromXmlParser)ctxt.getParser()).getStaxReader().getNamespaceContext().getNamespaceURI(prefix);


This won't work if content is buffered (with TokenBuffer), so I am not sure this approach is valid unfortunately.

Humm.. interesting.. do you have a simple test case for the Buffered use case?

I am using the same design in our solution so, even if it is not a viable solution here, I would love to improve the code in our solution. Or at least know em when exactly it will fail.

It's a side-effect of processing, not a feature to enable: if you create a POJO with @JsonCreator(mode = JsonCreator.Mode.PROPERTIES), property values will likely be buffered. Similarly for polymorphic types if Type Id is not the first value deserialized.
Basically any case where stream-order is not the same as the order in which values are needed will be buffered.

So, the correct way to solve this would be to change the token streams to include namespaces and prefixes available at each one of them?

Yes. TokenBuffer, however, is not XML specific and cannot (should not) be changed.
2.13 allows replacement of buffer implementation so we could have XmlTokenBuffer sub-class (or such) -- XML backend was the reason to add this -- but there's then the question of how to access information. Probably FromXmlParser and XmlTokenBuffer could implement an interface (to be added) for additional accessors.

I have not, unfortunately, had time to take this approach any further but this would be a very valuable improvement and unblock work on solving multiple issues.
So I would be happy to help you or anybody who had time to look into improvements in this area.

cowtowncoder · 2022-08-30T23:43:43Z

Ok, first of all: thank you for contributing this patch!
I can see what it is attempting to do and that makes sense.

Unfortunately I am not sure this can be implemented in robust manner: the main problem being that there is no guarantee that

There is a Stax parser associated with content -- in particular when buffering (using TokenBuffer) is used (most commonly when @JsonCreator annotated constructor used)
Parser itself may not point to actual XML content element (in case of textual content it's probably pointing to closing element, in case of elements)

latter might not be a huge issue as long as namespace resolution context is still available, but former is problematic.
I guess one could make code check accessibility and avoid lookup if parser not available. But that would be hugely confusing since it "sometimes works, sometimes not" (with no obvious signs to user).

vitorpamplona added 2 commits August 26, 2022 14:16

Correctly parses QName

31e70a3

Moving to the right packages and folders

99d8459

cowtowncoder reviewed Aug 30, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QName Deserializer #543

QName Deserializer #543

vitorpamplona commented Aug 26, 2022 •

edited

Loading

cowtowncoder Aug 30, 2022 •

edited

Loading

vitorpamplona Aug 31, 2022

cowtowncoder Aug 31, 2022

vitorpamplona Aug 31, 2022

cowtowncoder Sep 1, 2022

cowtowncoder commented Aug 30, 2022

QName Deserializer #543

Are you sure you want to change the base?

QName Deserializer #543

Conversation

vitorpamplona commented Aug 26, 2022 • edited Loading

cowtowncoder Aug 30, 2022 • edited Loading

Choose a reason for hiding this comment

vitorpamplona Aug 31, 2022

Choose a reason for hiding this comment

cowtowncoder Aug 31, 2022

Choose a reason for hiding this comment

vitorpamplona Aug 31, 2022

Choose a reason for hiding this comment

cowtowncoder Sep 1, 2022

Choose a reason for hiding this comment

cowtowncoder commented Aug 30, 2022

vitorpamplona commented Aug 26, 2022 •

edited

Loading

cowtowncoder Aug 30, 2022 •

edited

Loading