Add client span to SignalR .NET client #57101

JamesNK · 2024-07-31T13:05:14Z

Addresses #51557. Specifically, the client activity source item.

Follow up to #55439 and builds on top of #57049.

Changes:

Invocations by the SignalR client now start a local activity.
Activities are started for regular invocations and streaming methods.
Activity starts with the invocation and ends when the client receives a completion message.

Not planned in this PR:

Tracing in other clients.

Nothing is blocking additional work in the future in the future.

JamesNK · 2024-07-31T13:10:06Z

@BrennanConroy A problem I found when implementing this is the SignalR client doesn't know the name of the hub on the server. The client just specifies a URL and then sends messages to a method at that URL.

Because the hub name isn't available, as a substitute I replaced it with the URL path, e.g. builder.WithUrl("https://localhost:5001/chat") will result in the URL path chat being recorded as the service name.

Is there a better option? If not, should the server record the URL path as the OTEL service name instead of the .NET type name? That will keep consistency between what the client and server record.

JamesNK · 2024-07-31T14:06:24Z

@tarekgh @noahfalk Do you know how the Activity name interacts with OTEL? Activity has OperationName and DisplayName. Right now, Activity.OperationName is set to the required OTEL span format. Is this the right thing to do?

It seems like opentelemetry-dotnet uses Activity.DisplayName (which defaults to OperationName if not set). And the docs for Activity.OperationName say its value should be a constant.

Would it be more correct to do this?

var activity = activitySource.CreateActivity("Microsoft.AspNetCore.SignalR.Client.InvocationOut", ...);
activity.DisplayName = $"{serviceName}/{serviceMethod}";

tarekgh · 2024-07-31T16:19:57Z

CC @cijothomas @CodeBlanch @reyang to answer the question in #57101 (comment). What @JamesNK propose is matching the runtime guidelines as OperationName used for grouping and filtering while DisplayName used for the UI.

BrennanConroy

Can we see a screenshot of these changes in a UI like the Aspire Dashboard?

SendCoreAsync doesn't have an activity being created for it. It's for "fire and forget" calls. Do we just create an activity, send the message, and stop the activity immediately?

There should be a few more tests:
Multiple different invokes to make sure there is a different activity every time.
Creating an activity in "user code" and seeing that it's the parent of the activity created in an invoke.
Making sure the activity stopped.

src/SignalR/clients/csharp/Client.Core/src/HubConnection.cs

src/SignalR/clients/csharp/Client.Core/src/Internal/InvocationRequest.cs

cijothomas · 2024-07-31T18:17:42Z

CC @cijothomas @CodeBlanch @reyang to answer the question in #57101 (comment). What @JamesNK propose is matching the runtime guidelines as OperationName used for grouping and filtering while DisplayName used for the UI.

Do you know how the Activity name interacts with OTEL? Activity has OperationName and DisplayName.

OTel has a single concept of Span Name, which is expected to be the low cardinality Spanname, also used for UI. The name can be changed after span creation. Activity.DisplayName is what maps to this. (Operationname exists as legacy on Activity, not used by OTel).

JamesNK · 2024-07-31T22:27:18Z

@cijothomas Thanks!

JamesNK · 2024-07-31T22:30:54Z

SendCoreAsync doesn't have an activity being created for it. It's for "fire and forget" calls. Do we just create an activity, send the message, and stop the activity immediately?

Yes.

Multiple different invokes to make sure there is a different activity every time.

This is done in HubConnectionTests.InvokeAsync_SendTraceHeader.

Creating an activity in "user code" and seeing that it's the parent of the activity created in an invoke.

Done in HubConnectionTests.InvokeAsync_SendTraceHeader and StreamAsyncCore_SendTraceHeader

Making sure the activity stopped.

I'll add some asserts for Activity.IsStopped.

BrennanConroy · 2024-07-31T23:12:33Z

Multiple different invokes to make sure there is a different activity every time.

This is done in HubConnectionTests.InvokeAsync_SendTraceHeader.

Creating an activity in "user code" and seeing that it's the parent of the activity created in an invoke.

Done in HubConnectionTests.InvokeAsync_SendTraceHeader and StreamAsyncCore_SendTraceHeader

Huh, maybe there is something weird happening here since it's merging into another PR, but those tests look like they're being deleted, which would be odd since they seem specific to this change.

JamesNK · 2024-07-31T23:17:39Z

They're deleted from their original location and moved to HubConnectionTests.Tracing.cs.

BrennanConroy · 2024-07-31T23:21:05Z

🙈 The file was collapsed and when I looked for HubConnectionTests.Tracing.cs I only noticed one of them but there are two files with that name.

JamesNK · 2024-08-01T04:05:59Z

Updates:

Client activity for SendAsync added.
Changed OperationName to a constant. Will fix server activity in another PR.
Changed client activity to include acquiring an active connection.

I don't think there are any outstanding changes. There is still this open question: #57101 (comment). I think the server should use the path. ChatHub will almost always have a URL like /chat so it will be understandable. And it will be consistent with the client @BrennanConroy

JamesNK · 2024-08-01T06:00:13Z

Can we see a screenshot of these changes in a UI like the Aspire Dashboard?

😎 😎 😎

src/SignalR/clients/csharp/Client.Core/src/Internal/InvocationRequest.cs

BrennanConroy

I think this is good to go after my last questions/comments.

src/SignalR/clients/csharp/Client.Core/src/HubConnection.cs

src/SignalR/clients/csharp/Client.Core/src/Internal/InvocationRequest.cs

src/SignalR/clients/csharp/Client.Core/src/HubConnection.cs

src/SignalR/clients/csharp/Http.Connections.Client/src/HttpConnection.cs

JamesNK · 2024-08-01T22:48:20Z

I think this is good to go after my last questions/comments.

This PR depends on the propagation PR being merged first

#57049

BrennanConroy · 2024-08-06T18:37:25Z

src/SignalR/clients/csharp/Client.Core/src/HubConnection.cs

+        {
+            ConnectionState connectionState;
+            var connectionStateTask = _state.WaitForActiveConnectionAsync(sendingMethodName, token);
+            if (connectionStateTask.Status == TaskStatus.RanToCompletion)


Do we need to set any server tags if connectionStateTask throws. It throws when you're either not connected or the user canceled the passed in CTS. In either case it's not actually doing anything except creating and stopping an activity.

Plus, it'd clean up the code a bit.

If there is a problem starting the connection then it would be useful to know the server address being called

But the connection failing to start, or closing before the send calls, or not even calling start in the first place, is not reflected in SendAsync or InvokeAsync calls. All it knows is that there is no connection.

With telemetry it's useful to view it from the perspective of the user. The mechanics of what happens internally that could cause the call to fail are invisible to them. They're focused on the end result: the call they want to make failed and there was an error. Having information about what they were trying to do - make a call to a hub with the specified rpc.service, rpc.method, server.address, server.port - is what they care about. Then the error reason would be in error.type.

For an example of prior art, HttpClient always reports server.address and server.port when making a HTTP request. It does this even if the server connection couldn't be established, or the request was canceled before sending any data.

If you want to make it clearer that the call failed because of the connection not being available, there is room to do that by providing good values to error.type. By default, it is the exception type name, but it can be customized in known scenarios. For example, if there is a problem with the connection, error.type could be something like: negotiate-failed, hub-connection-not-started, etc.

Also, server.address and server.port are considered required attributes according to the spec: https://github.com/open-telemetry/semantic-conventions/blob/1e34b57b9f73b08b109cdc0e8841e857e5f5c205/docs/rpc/rpc-spans.md#client-attributes

Ok thanks, I wanted to know some concrete reason we were including it.

src/SignalR/clients/csharp/Client/test/UnitTests/HubConnectionTests.Tracing.cs

JamesNK · 2024-08-07T05:18:09Z

/azp run

azure-pipelines · 2024-08-07T05:18:28Z

Azure Pipelines successfully started running 3 pipeline(s).

JamesNK · 2024-08-07T12:12:40Z

/azp run

azure-pipelines · 2024-08-07T12:13:00Z

Azure Pipelines successfully started running 3 pipeline(s).

JamesNK added the area-signalr Includes: SignalR clients and servers label Jul 31, 2024

JamesNK requested review from mgravell, captainsafia, noahfalk and tarekgh July 31, 2024 13:05

JamesNK requested review from BrennanConroy and halter73 as code owners July 31, 2024 13:05

JamesNK mentioned this pull request Jul 31, 2024

Propagate trace parent to SignalR hub invocations #57049

Merged

BrennanConroy reviewed Jul 31, 2024

View reviewed changes

JamesNK commented Aug 1, 2024

View reviewed changes

src/SignalR/clients/csharp/Client.Core/src/Internal/InvocationRequest.cs Show resolved Hide resolved

BrennanConroy reviewed Aug 1, 2024

View reviewed changes

JamesNK force-pushed the jamesnk/signalr-distributed-tracing branch from eff6c7e to 2f4adcc Compare August 2, 2024 01:22

build-analysis bot mentioned this pull request Aug 2, 2024

Roslyn analyzer throws error AD0001 NullReferenceException dotnet/dnceng#3305

Open

3 tasks

Base automatically changed from jamesnk/signalr-distributed-tracing to main August 2, 2024 07:14

JamesNK force-pushed the jamesnk/signalr-client-span branch from ab2ff3f to 691bf0f Compare August 2, 2024 08:16

JamesNK added 3 commits August 6, 2024 08:11

Add client span to SignalR .NET client

af8f01c

Fix merge

cf474bf

Handle negotiate changing the connection URL

a698667

JamesNK force-pushed the jamesnk/signalr-client-span branch from 691bf0f to a698667 Compare August 6, 2024 04:02

BrennanConroy reviewed Aug 6, 2024

View reviewed changes

Test port

4f676a3

BrennanConroy approved these changes Aug 7, 2024

View reviewed changes

JamesNK enabled auto-merge (squash) August 7, 2024 05:18

Fix test

3e3fc7d

build-analysis bot mentioned this pull request Aug 7, 2024

The active test run was aborted. Reason: Test host process crashed dotnet/dnceng#451

Open

3 tasks

JamesNK merged commit d96d272 into main Aug 7, 2024
26 checks passed

JamesNK deleted the jamesnk/signalr-client-span branch August 7, 2024 14:25

dotnet-policy-service bot added this to the 9.0-rc1 milestone Aug 7, 2024

JamesNK mentioned this pull request Oct 27, 2024

Add docs about new SignalR tracing in .NET 9 dotnet/AspNetCore.Docs#33941

Closed

Add client span to SignalR .NET client #57101

Add client span to SignalR .NET client #57101

Uh oh!

Conversation

JamesNK commented Jul 31, 2024

Uh oh!

JamesNK commented Jul 31, 2024

Uh oh!

JamesNK commented Jul 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tarekgh commented Jul 31, 2024

Uh oh!

BrennanConroy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cijothomas commented Jul 31, 2024

Uh oh!

JamesNK commented Jul 31, 2024

Uh oh!

JamesNK commented Jul 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BrennanConroy commented Jul 31, 2024

Uh oh!

JamesNK commented Jul 31, 2024

Uh oh!

BrennanConroy commented Jul 31, 2024

Uh oh!

JamesNK commented Aug 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JamesNK commented Aug 1, 2024

Uh oh!

Uh oh!

BrennanConroy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JamesNK commented Aug 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BrennanConroy Aug 6, 2024

Choose a reason for hiding this comment

Uh oh!

JamesNK Aug 6, 2024

Choose a reason for hiding this comment

Uh oh!

BrennanConroy Aug 6, 2024

Choose a reason for hiding this comment

Uh oh!

JamesNK Aug 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JamesNK Aug 7, 2024

Choose a reason for hiding this comment

Uh oh!

BrennanConroy Aug 7, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

JamesNK commented Aug 7, 2024

Uh oh!

azure-pipelines bot commented Aug 7, 2024

Uh oh!

JamesNK commented Aug 7, 2024

Uh oh!

azure-pipelines bot commented Aug 7, 2024

Uh oh!

Uh oh!

Uh oh!

JamesNK commented Jul 31, 2024 •

edited

Loading

JamesNK commented Jul 31, 2024 •

edited

Loading

JamesNK commented Aug 1, 2024 •

edited

Loading

JamesNK commented Aug 1, 2024 •

edited

Loading

JamesNK Aug 7, 2024 •

edited

Loading