feat: Instrument Lambda invocations in AWS SDK #2784

chynesNR · 2024-09-27T15:50:58Z

Instrument calls to Lambdas and Lambda Aliases from the AWS SDK, with span attributes to allow linking the calls to the actual Lambda (specifically the ARN is what's needed). Includes unit and integration tests. We are able to test without a making a successful call to a lambda, so no live testing resources are necessary.

Note that we are deliberately not creating an External segment, as that would result in an extra entity being created.

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

nrcventura · 2024-10-15T18:41:36Z

src/Agent/NewRelic/Agent/Core/Transactions/Transaction.cs

+        {
+            if (string.IsNullOrWhiteSpace(name))
+            {
+                Log.Debug($"AddCloudSdkAttribute - Unable to set Cloud value on transaction because the key is null/empty");


Should this say name instead of key?

I was following the pattern in AddLambdaAttribute and AddFaasAttribute but yeah, "name" is probably clearer.

nrcventura · 2024-10-15T18:54:51Z

src/Agent/NewRelic/Agent/Extensions/NewRelic.Agent.Extensions/Helpers/AwsSdk.cs

+                    if (BadInvocations.Add(invocationName))
+                    {
+                        agent?.Logger.Debug($"Unable to parse function name '{invocationName}'");


It seems like it may be possible to run this code concurrently (two async lambda invocations), and we are not using a thread safe collection. This might cause some problems.

Since we do not log this at the default log level, we may want to perform a is log level enabled check before storing that invocationName indefinitely.

Do we need to put a cap on this collection so that it doesn't grow too large? Should we just log the first N times we encounter this problem?

This comment applies to the other places where we use this collection and log a message.

Would supportability metrics be useful for debugging these types of problems?

Ideally we're never going to hit this, but that is technically possible. I can make it thread safe.

Sure

Since it's a Set, they'd have to be invoking a significant number of unique Lambda names, all of which are un-parse-able. That seems pretty unlikely, but a size check is easy enough.

I've replaced all the logging calls with a helper method.

I don't think so. This is more for troubleshooting individual cases of "my Lambda calls aren't linking". If we saw a large number of parsing failures, we'd still need to see some concrete examples in order to fix it. AWS provides a regex for what's a valid function identifier, and I've tried to cover all the possibilities in the unit tests, so I'm not expecting many surprises.

If we don't think this is likely, and we just need to capture a few examples, maybe we don't need a collection at all and just log the first 10 times we get something we can't parse?

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

nrcventura · 2024-10-15T19:03:31Z

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

+                _arnCache.TryAdd(functionName, arn);
+            }
+            var segment = transaction.StartTransactionSegment(instrumentedMethodCall.MethodCall, "InvokeRequest");
+            segment.GetExperimentalApi().MakeLeaf();


This will suppress the HttpClient instrumentation, so we will no longer get the distributed tracing headers added, or any of the other external call attributes that were previously collected for these calls (prior to this instrumentation).

That's correct, and the expected behavior. If we leave in the HttpClient segment, an additional Entity is created that can't be linked to the Lambda itself, because the URI is not unique enough. I take your point about the missing attributes, though, and will double check with the other devs on this initiative..

Yes, and we need to ensure that we are still generating the expected metrics so that the externals UI will work, and the span.kind is correct for the span that is ultimately generated.

nrcventura · 2024-10-15T19:05:22Z

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

+            }
+            catch (Exception e)
+            {
+                agent.Logger.Debug(e, "Unable to get RequestId from response metadata.");


Is it possible that this could be logged too frequently at the Debug level?

It's possible, though we don't usually limit error logging in our wrappers. I can make it a one-time thing.

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

nrcventura · 2024-10-15T19:10:03Z

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

+                return null;
+            }
+
+            var getResponse = _getResultFromGenericTask.GetOrAdd(task.GetType(), t => VisibilityBypasser.Instance.GeneratePropertyAccessor<object>(t, "Result"));


If you are using dynamic anyways, do you need to use the concurrent dictionary and visibility bypasser combination here?

That's the pattern we use elsewhere, though now that I look at it I'm not sure why. I can simplify it.

...edApplications/Common/MultiFunctionApplicationHelpers/MultiFunctionApplicationHelpers.csproj

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs

tippmar-nr · 2024-10-16T16:33:23Z

Looks like we might want to review the CodeCov report - patch coverage is a bit low...

codecov-commenter · 2024-10-16T17:51:55Z

Codecov Report

Attention: Patch coverage is 66.34615% with 35 lines in your changes missing coverage. Please review.

Project coverage is 81.27%. Comparing base (27a78cb) to head (28a5ebe).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...nsions/NewRelic.Agent.Extensions/Helpers/AwsSdk.cs	84.61%	9 Missing and 3 partials ⚠️
...gent/Core/Attributes/AttributeDefinitionService.cs	9.09%	10 Missing ⚠️
...nt/NewRelic/Agent/Core/Transactions/Transaction.cs	20.00%	8 Missing ⚠️
....Agent.Extensions/Collections/ConcurrentHashSet.cs	0.00%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2784      +/-   ##
==========================================
- Coverage   81.31%   81.27%   -0.05%     
==========================================
  Files         460      461       +1     
  Lines       29239    29358     +119     
  Branches     3231     3252      +21     
==========================================
+ Hits        23777    23861      +84     
- Misses       4669     4701      +32     
- Partials      793      796       +3

Flag	Coverage Δ
Agent	`82.16% <66.34%> (-0.06%)`	⬇️
Profiler	`73.33% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
....Agent.Extensions/Collections/ConcurrentHashSet.cs	`30.00% <0.00%> (-2.73%)`	⬇️
...nt/NewRelic/Agent/Core/Transactions/Transaction.cs	`79.92% <20.00%> (-0.81%)`	⬇️
...gent/Core/Attributes/AttributeDefinitionService.cs	`94.36% <9.09%> (-1.28%)`	⬇️
...nsions/NewRelic.Agent.Extensions/Helpers/AwsSdk.cs	`84.61% <84.61%> (ø)`

... and 1 file with indirect coverage changes

chynesNR added 9 commits September 26, 2024 14:03

First pass at instrumentation

4434a8c

First pass at integration tests

5475b49

Update ARN parsing logic

3d213e6

Updating ARN parsing logic

7c33292

Fixes to integration tests

c15328c

Forgot to add tests to workflow

8728cff

Don't generate a segment for the HTTP request

cdb0bf3

Fix some typos

10a5e4b

Merge branch 'main' into feature/invoke-lambda-instrumentation

71304dc

chynesNR marked this pull request as ready for review October 15, 2024 15:18

chynesNR requested a review from a team as a code owner October 15, 2024 15:18

tippmar-nr reviewed Oct 15, 2024

View reviewed changes

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs Outdated Show resolved Hide resolved

Cache constructed ARNs

db2202c

nrcventura reviewed Oct 15, 2024

View reviewed changes

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs Show resolved Hide resolved

nrcventura reviewed Oct 15, 2024

View reviewed changes

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs Show resolved Hide resolved

nrcventura reviewed Oct 15, 2024

View reviewed changes

...edApplications/Common/MultiFunctionApplicationHelpers/MultiFunctionApplicationHelpers.csproj Outdated Show resolved Hide resolved

nrcventura reviewed Oct 15, 2024

View reviewed changes

src/Agent/NewRelic/Agent/Extensions/Providers/Wrapper/AwsSdk/LambdaInvokeRequestHandler.cs Outdated Show resolved Hide resolved

chynesNR added 2 commits October 15, 2024 14:55

PR feedback

a14adbb

Fix unit test

28a5ebe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Instrument Lambda invocations in AWS SDK #2784

feat: Instrument Lambda invocations in AWS SDK #2784

chynesNR commented Sep 27, 2024 •

edited

Loading

nrcventura Oct 15, 2024

chynesNR Oct 15, 2024

nrcventura Oct 15, 2024

nrcventura Oct 15, 2024

nrcventura Oct 15, 2024

chynesNR Oct 15, 2024

nrcventura Oct 15, 2024

nrcventura Oct 15, 2024

chynesNR Oct 15, 2024

nrcventura Oct 15, 2024

nrcventura Oct 15, 2024

chynesNR Oct 15, 2024

nrcventura Oct 15, 2024

chynesNR Oct 15, 2024

tippmar-nr commented Oct 16, 2024

codecov-commenter commented Oct 16, 2024

feat: Instrument Lambda invocations in AWS SDK #2784

Are you sure you want to change the base?

feat: Instrument Lambda invocations in AWS SDK #2784

Conversation

chynesNR commented Sep 27, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tippmar-nr commented Oct 16, 2024

codecov-commenter commented Oct 16, 2024

Codecov Report

chynesNR commented Sep 27, 2024 •

edited

Loading