Exception in NIO client when broker sends a large message over SSL #307

dimas · 2017-09-20T12:23:07Z

It looks like there is a bug in how SslEngineByteBufferInputStream class "reassembles" the stream.
The piece of code below either passes or fails depending on where I set breakpoint.

                int bytesRead = NioHelper.read(channel, cipherIn);
                if (bytesRead > 0) {
                    cipherIn.flip();
                } else {
                    bytesRead = NioHelper.retryRead(channel, cipherIn);
                    if(bytesRead <= 0) {
                        throw new IllegalStateException("Should be reading something from the network");
                    }
                }

If breakpoint is at NioHelper.read (so there is some time for data from the network be available and bytesRead be above zero) - it works.
However if breakpoint is set to the next line, data from the network is not yet available and else branch is executed. It eventually reads the data but never does cipherIn.flip(); so unlike the the "then" branch which exits with cipherIn(position=0, limit=4629) in my test, the "else" branch exits cipherIn(position=4629, limit=16921) - I guess it is still in "read" mode. That later fails on sslEngine.unwrap(). If I manually invoke cipherIn.flip() in the debugger, the packet seem to decode ok.

The text was updated successfully, but these errors were encountered:

acogoluegnes · 2017-09-20T13:18:09Z

Thread on mailing list

References #307

acogoluegnes · 2017-09-20T13:58:00Z

@dimas I cannot reproduce locally, but your analysis looks correct. Can you try the latest 4.2.2 SNAPSHOT? Thanks.

dimas · 2017-09-20T14:55:53Z

It seem to work.
Regarding your PR - isn't it simpler to move cipherIn.flip(); outside of the if? Like this:

                int bytesRead = NioHelper.read(channel, cipherIn);
                if (bytesRead <= 0) {
                    bytesRead = NioHelper.retryRead(channel, cipherIn);
                    if(bytesRead <= 0) {
                        throw new IllegalStateException("Should be reading something from the network");
                    }
                }
                cipherIn.flip();

?
And re the test - my test involves a remote broker, I am not sure it is reproducible with localhost because timing is completely different in that case.
Also, my message is not large as yours - a JSON body of 4224 bytes is being sent + some headers (dunno, probably less than 256 bytes). Maybe for a better coverage it makes sense to run test multiple times with different message sizes.
Cheers

michaelklishin · 2017-09-20T14:57:27Z

Adding a few more tests with varying sizes is fine. Refactoring suggestions are best presented as pull requests :)

Recovery delay was a connection recovery feature from day 1 for a reason: * In practice connection recovery often won't succeed the first time because network failures don't always go away in milliseconds. * There's a natural race condition between server state changes (cleanup of queues and such) and the operations a recovered connection will perform potentially on entities *with the same identifier* (e.g. name) Initial delay avoids a lot of scenarios that stem from the above race condition and can waste a lot of time for operators, developers and the RabbitMQ core team. References #307.

Recovery delay was a connection recovery feature from day 1 for a reason: * In practice connection recovery often won't succeed the first time because network failures don't always go away in milliseconds. * There's a natural race condition between server state changes (cleanup of queues and such) and the operations a recovered connection will perform potentially on entities *with the same identifier* (e.g. name) Initial delay avoids a lot of scenarios that stem from the above race condition and can waste a lot of time for operators, developers and the RabbitMQ core team. References #307. (cherry picked from commit 60eb44d)

acogoluegnes self-assigned this Sep 20, 2017

acogoluegnes added bug effort-low labels Sep 20, 2017

acogoluegnes added this to the 4.2.2 milestone Sep 20, 2017

acogoluegnes added a commit that referenced this issue Sep 20, 2017

Flip ByteBuffer after read retry

9e78a19

References #307

acogoluegnes closed this as completed in 5b92eac Sep 21, 2017

YuryAndr mentioned this issue Jul 28, 2021

Large message deliveries with TLS and NIO enabled results in a "buffer closed" exception in SslEngineHelper #700

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Exception in NIO client when broker sends a large message over SSL #307

Exception in NIO client when broker sends a large message over SSL #307

dimas commented Sep 20, 2017 •

edited

Loading

acogoluegnes commented Sep 20, 2017

Uh oh!

acogoluegnes commented Sep 20, 2017

Uh oh!

dimas commented Sep 20, 2017 •

edited

Loading

Uh oh!

michaelklishin commented Sep 20, 2017

Uh oh!

Exception in NIO client when broker sends a large message over SSL #307

Exception in NIO client when broker sends a large message over SSL #307

Comments

dimas commented Sep 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

acogoluegnes commented Sep 20, 2017

Uh oh!

acogoluegnes commented Sep 20, 2017

Uh oh!

dimas commented Sep 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michaelklishin commented Sep 20, 2017

Uh oh!

dimas commented Sep 20, 2017 •

edited

Loading

dimas commented Sep 20, 2017 •

edited

Loading