use ConcurrentLinkedHashMap as backing for LRUMap #3531

pjfanning · 2022-06-26T15:11:31Z

Relates to #2502

@cowtowncoder if this seems like a good way to go, I can add more tests

pom.xml

src/main/java/com/fasterxml/jackson/databind/util/LRUMap.java

ben-manes · 2022-06-26T16:54:41Z

...ava/com/fasterxml/jackson/databind/util/concurrentlinkedhashmap/ConcurrentLinkedHashMap.java

+ *      http://code.google.com/p/concurrentlinkedhashmap/</a>
+ */
+@ThreadSafe
+public final class ConcurrentLinkedHashMap<K, V> extends AbstractMap<K, V>


fwiw, there is an embedded fork in Groovy (and then taken by Micronaut) that adds support for computeIfAbsent and friends. Otherwise you inherit the non-atomic version from ConcurrentMap. That's fine in your usage, but it may be a desirable enhancement. However, since I did not add or review it all I know is the developers seem good but I don't know if their changes are good or not. But if helpful then you might check it out. I'd probably run Lincheck as a sanity test when code diving.

cowtowncoder · 2022-06-26T17:01:37Z

Ok, sounds good to me. LMK when things are ready, happy to merge. Hopefully can be merged relatively cleanly to 3.0 too.

cowtowncoder · 2022-06-26T19:32:39Z

src/main/java/com/fasterxml/jackson/databind/util/LRUMap.java

    protected Object readResolve() {
-        return new LRUMap<Object,Object>(_jdkSerializeMaxEntries, _jdkSerializeMaxEntries);
+        return new LRUMap<K,V>(_initialEntries, _maxEntries);


Possibly not a big deal, but the idea here was that JDK serialization would retain size settings, if possible.
But perhaps CLHM does not expose these settings after construction?

long capacity() and setCapacity(long) expose those if helpful. CLHM is serializable where it retains the entries (to mirror ConcurrentHashMap) and discards the eviction order. That differs in Guava/Caffeine, fwiw, which don't keep the entries and just the configuration.

I hit issues with the old _jdkSerializeMaxEntries values - after my initial changes, the cloned LRUMap was being constructed with _jdkSerializeMaxEntries set to 0 and this meant that CLHM immediately evicted all new entries.

@cowtowncoder would it be ok to allow the backing CLHM be non-transient, then we wouldn't need any special serialization logic? As @ben-manes highlights, the eviction order would be lost but that isn't too bad.

I think cached content serialization should not be attempted, since contents are not necessarily serializable -- and at least for serializers, deserializers, are not. Alternatively if we really want to retain contents in some cases, could make it configurable. But at this point I think I'd prefer just wiping out contents.

I think losing some of the settings is probably fine: ability to JDK serialize Jackson components is bit of an odd feature in general. Although, to be honest, I think the main important thing there is to retain configuration: as far as I know, this is relied upon by frameworks like Spark etc where workers are being sent serialized set up of handlers.

cowtowncoder · 2022-06-26T19:35:52Z

Ok, one non-code thing I'd like to figure out is how to give Ben credit for the code. An entry in CREDITS-2.x would be a start I guess, but probably he should be mentioned as co-author in that file as well, for specific subset of classes as they are copied?

Similarly, I think we should add @pjfanning as a co-author, and from this author(s) of fast FP read/write packages.

I am open to suggestions on how to do this beyond Javadocs (I think class Javadocs, or maybe package-info, should definitely have author notes too).

WDYT?

ben-manes · 2022-06-26T19:41:18Z

Thanks, but I don't have any strong opinions on credit and for me it is enough to retain the author tag. Happy just to contribute good code as I've benefited from so many other's open-source work (including jackson).

pjfanning · 2022-06-26T19:42:51Z

I'm happy to leave the source author tag with just Ben's name - I have copied his code and made very few changes

...com/fasterxml/jackson/databind/util/concurrentlinkedhashmap/ConcurrentLinkedHashMapTest.java

cowtowncoder · 2022-06-28T03:53:41Z

I'm happy to leave the source author tag with just Ben's name - I have copied his code and made very few changes

Sounds good. I just want to make sure to give where due -- that's the least we can do and easy to do.

...sterxml/jackson/databind/util/concurrentlinkedhashmap/ConcurrentLinkedHashMapStressTest.java

cowtowncoder · 2022-06-29T23:41:15Z

Reading through the code, I think there is the remaining work to strip the implementation down to struts -- only the minimal interface for LookupCache / LRUMap needs to be supported.
Note that I am fine merging a full(er) implementation first, and then starting to trim things down further, given decent test coverage. So not all of my suggestions here need to be done right away.

And as to JDK serialization: I think that is what LRUMap should implement, keeping track of size values it needs, and re-constructing backing CLHM but keeping that non-serializable to remove any need for it to be serialized on its own.

The very last thing I'd want would be for users to start using Jackson's copy of CLHM -- I know it is difficult to hide it (pre-module info :) ) but let's do our best.

ben-manes · 2022-06-29T23:48:58Z

The very last thing I'd want would be for users to start using Jackson's copy of CLHM -- I know it is difficult to hide it (pre-module info :) ) but let's do our best.

Eclipse hides the internal package (I think by default), no clue about IntelliJ.

cowtowncoder · 2022-06-29T23:55:11Z

Eclipse hides the internal package (I think by default), no clue about IntelliJ.

Interesting.

At one point I used shading to add ".private." to prevent source code references (class loader is not bothered by keywords in package names) but that was apparently frowned upon. :)

I guess we can also use some more obscure name since it's really the auto-completion that introduces lots of unintended (or poorly understood) accidental reuse of internal packages.

pjfanning · 2022-06-29T23:56:27Z

One option is to make jackson's CLHM non-serializable (saving code) because LRUMap does treats the underlying CLHM instance as transient.

This reverts commit 8e6897d.

This reverts commit a5581f7.

This reverts commit 8d29de8.

This reverts commit 1696ac0.

cowtowncoder · 2022-06-30T22:32:19Z

Looks good! Seems like there might be just 2 more things:

Renamed the main ConcurrentLinkedHashMap to something more obscure to avoid IDE auto-completion if anyone is looking for the OG one. I don't greatly care what the name is since this is effectively internal class, not part of API.
If possible, remove various Weighers given that we will use basic equal-weighted entries logic. With the current usage there is no real way to customize caches wrt variable weights.

Other than that, assuming tests pass reliably, I'd be ready to merge this in, I think?

pjfanning · 2022-06-30T23:01:23Z

I removed more Weigher logic and renamed the class as com.fasterxml.jackson.databind.util.internal.PrivateMaxEntriesMap

src/main/java/com/fasterxml/jackson/databind/util/internal/Weigher.java

cowtowncoder · 2022-07-01T00:24:46Z

Ok, good. How about I merge this now: we can continue tweaks if and as necessary.

ben-manes · 2022-07-01T00:33:58Z

Close FasterXML/jackson-module-scala#428?

cowtowncoder · 2022-11-16T05:12:25Z

Alas, I did not realize that the Map implementation here is apparently very heavy memory user, as per #3665... half a meg for all of 4 caches? Phew. I was hoping to see if the initial sizes might be too big but that doesn't seem to be the case; all are between 16 and 64.

I would like to figure out something; further discussion on #3665.

use ConcurrentLinkedHashMap as backing for LRUMap

dd7a597

pjfanning marked this pull request as draft June 26, 2022 15:12

move new package

cd4e033

ben-manes reviewed Jun 26, 2022

View reviewed changes

cowtowncoder reviewed Jun 26, 2022

View reviewed changes

pjfanning added 6 commits June 26, 2022 21:02

remove new jar dependency (jsr 305) - use apache cayenne version of CLHM

47e9ae3

Create ConcurrentLinkedHashMapTest.java

8befa70

Create LRUMapTest.java

8f1bdcd

move serialization test so that internal state can be checked

0ae71ae

Update TestJDKSerialization.java

7d901fb

Update LRUMapTest.java

111f3b3

ben-manes reviewed Jun 27, 2022

View reviewed changes

...com/fasterxml/jackson/databind/util/concurrentlinkedhashmap/ConcurrentLinkedHashMapTest.java Outdated Show resolved Hide resolved

pjfanning added 2 commits June 27, 2022 20:05

extra tests - some fail and will be investigated in coming days

bfc5302

prevent add being used on the map entrySet

3882f17

pjfanning added 8 commits June 28, 2022 15:29

Create ConcurrentLinkedHashMapStressTest.java

b2c9b08

Update ConcurrentLinkedHashMapStressTest.java

0c01d2b

wip

674a407

try to make test reliable

53c3205

Merge pull request #2 from pjfanning/stress2

f246b9c

Update ConcurrentLinkedHashMapStressTest.java

bac364a

Update ConcurrentLinkedHashMapStressTest.java

517f432

Update ConcurrentLinkedHashMapStressTest.java

c57d7e1

ben-manes reviewed Jun 28, 2022

View reviewed changes

...sterxml/jackson/databind/util/concurrentlinkedhashmap/ConcurrentLinkedHashMapStressTest.java Outdated Show resolved Hide resolved

pjfanning marked this pull request as ready for review June 28, 2022 19:39

pjfanning changed the title ~~WIP: use ConcurrentLinkedHashMap as backing for LRUMap~~ use ConcurrentLinkedHashMap as backing for LRUMap Jun 28, 2022

pjfanning added 9 commits June 30, 2022 00:57

remove more code for concurrency level

8d29de8

make this CLHM non-serializable

a5581f7

remove EvictionListener

8e6897d

Revert "remove EvictionListener"

6a5a5b2

This reverts commit 8e6897d.

Revert "make this CLHM non-serializable"

4a26b39

This reverts commit a5581f7.

Revert "remove more code for concurrency level"

2e132ca

This reverts commit 8d29de8.

Revert "remove unnecessary concurrency level"

dfcf976

This reverts commit 1696ac0.

Update package-info.java

fc6fa05

make some more classes package private

6820db2

pjfanning added 3 commits June 30, 2022 23:38

rename internal map

ff39d5f

remove more weigher logic

b438091

Update LRUMap.java

f85c808

ben-manes reviewed Jun 30, 2022

View reviewed changes

src/main/java/com/fasterxml/jackson/databind/util/internal/Weigher.java Outdated Show resolved Hide resolved

pjfanning added 2 commits July 1, 2022 01:02

remove Weigher interface

3bc8acd

remove EntryWeigher interface

a0e8839

cowtowncoder merged commit adf022c into FasterXML:2.14 Jul 1, 2022

cowtowncoder mentioned this pull request Jul 1, 2022

Add a way to configure caches Jackson uses #2502

Closed

mhalbritter mentioned this pull request Nov 15, 2022

ObjectMapper default heap consumption increased significantly from 2.13.x to 2.14.0 #3665

Closed

pjfanning deleted the rework-lrumap branch November 17, 2022 22:04

davseitsev mentioned this pull request Feb 27, 2023

Coordinator OOM caused by new jackson version regression trinodb/trino#16282

Closed

carterkozak mentioned this pull request Apr 12, 2023

TypeFactory cache performance degradation with constructSpecializedType() #3876

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use ConcurrentLinkedHashMap as backing for LRUMap #3531

use ConcurrentLinkedHashMap as backing for LRUMap #3531

pjfanning commented Jun 26, 2022 •

edited

Loading

ben-manes Jun 26, 2022

cowtowncoder commented Jun 26, 2022

cowtowncoder Jun 26, 2022

ben-manes Jun 26, 2022

pjfanning Jun 26, 2022

cowtowncoder Jun 26, 2022

cowtowncoder commented Jun 26, 2022

ben-manes commented Jun 26, 2022

pjfanning commented Jun 26, 2022

cowtowncoder commented Jun 28, 2022

cowtowncoder commented Jun 29, 2022

ben-manes commented Jun 29, 2022 •

edited

Loading

cowtowncoder commented Jun 29, 2022

pjfanning commented Jun 29, 2022

cowtowncoder commented Jun 30, 2022

pjfanning commented Jun 30, 2022

cowtowncoder commented Jul 1, 2022

ben-manes commented Jul 1, 2022

cowtowncoder commented Nov 16, 2022

use ConcurrentLinkedHashMap as backing for LRUMap #3531

use ConcurrentLinkedHashMap as backing for LRUMap #3531

Conversation

pjfanning commented Jun 26, 2022 • edited Loading

ben-manes Jun 26, 2022

Choose a reason for hiding this comment

cowtowncoder commented Jun 26, 2022

cowtowncoder Jun 26, 2022

Choose a reason for hiding this comment

ben-manes Jun 26, 2022

Choose a reason for hiding this comment

pjfanning Jun 26, 2022

Choose a reason for hiding this comment

cowtowncoder Jun 26, 2022

Choose a reason for hiding this comment

cowtowncoder commented Jun 26, 2022

ben-manes commented Jun 26, 2022

pjfanning commented Jun 26, 2022

cowtowncoder commented Jun 28, 2022

cowtowncoder commented Jun 29, 2022

ben-manes commented Jun 29, 2022 • edited Loading

cowtowncoder commented Jun 29, 2022

pjfanning commented Jun 29, 2022

cowtowncoder commented Jun 30, 2022

pjfanning commented Jun 30, 2022

cowtowncoder commented Jul 1, 2022

ben-manes commented Jul 1, 2022

cowtowncoder commented Nov 16, 2022

pjfanning commented Jun 26, 2022 •

edited

Loading

ben-manes commented Jun 29, 2022 •

edited

Loading