Skip to content

Add BLE guide for optimising performance #1460

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Jul 19, 2021
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -834,6 +834,9 @@
},
{
"path": "docs/api/connectivity/bluetooth/SecurityManager.md"
},
{
"path": "docs/api/connectivity/bluetooth/Optimising_for_performance.md"
}
]
},
Expand Down
173 changes: 173 additions & 0 deletions docs/api/connectivity/bluetooth/Optimising_for_performance.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,173 @@
# Optimising application for throughput and power consumption

BLE devices are usually battery powered so performance might mean different things in different applications. You have
to decide what is more important in yours - minimising power consumption on one or both sides of the communication or
maximising throughput and/or latency. Some optimisation steps can in fact achieve both.

This guide will discuss some trade-offs that should be considered and best practices that improve performance on all
fronts.

## Power consumtpion

Any radio activity will consume power. Depending on what the stack is doing you have to power the radio even when no
data is being sent. It is important to understand when radio is active.

### Connections

The most intuitive power consumption rate to understand is when using connections. Each take turns sending and receiving
packets at set interval.


CENTRAL
┌────┐ ┌────┐ ┌────┐ ┌────┐ ┌────┐ ┌────┐ ┌────┐ ┌────┐
│send│ │recv│ │send│ │recv│ │send│ │recv│ │send│ │recv│
└────┘ └────┘ └────┘ └────┘ └────┘ └────┘ └────┘ └────┘
connection interval
◄─────────────────────►

PERIPHERAL
┌────┐ ┌────┐ ┌────┐ ┌────┐ ┌────┐ ┌────┐
│recv│ │send│ │recv│ │send│ │recv│ │send│
└────┘ └────┘ └────┘ └────┘ └────┘ └────┘
slave latency
◄───────────────────────────────────────────►

▲ ▲ ▲
connection event connection event connection event


To maintain a connection, regardless if there is data transfer to be transferred, the central needs to transmit and
receive once very connection interval.

The peripheral needs to receive and may transmit data if it has any. It may skip a number of these connection events
set by `slaveLatency`. If it has no data to transmit empty packets are sent.

More power is consumed if there is data to be exchanged. The exchange can continue until the next connection event would
take place.

It's worth considering if keeping the connection active is worth it. Connection in BLE is extremely fast and if you plan
to only send a quick burst of data every minute it is better to connect and disconnect each time.

The cost of the connection is proportionate to the negotiable connection interval. This can be set during `connect` or
later through `updateConnectionParameters`. The lower the interval the more often radio is active. This is especially
important for the peripheral which needs to enable the radio to receive packets.

This can be further helped through setting a high `slaveLatency` parameter. This allows the peripheral to skip
connection events and save power not just by not sending any packets but by not even listening. This is not free for
central as it increases latency of data transmission from central to peripheral. Central may have to attempt sending
data multiple times before the peripheral accepts the transmission. The peripheral may send data at any connection event
as the central must listen after every transmission.

### Advertising and scanning

Power draw during advertising affected by:
- the advertising interval - lower interval uses more power,
- amount of data sent,
- number of channels used - each advertising event is sent by default to three channels which you can limit to 2 or 1,
- whether extended advertising is used - this will send additional packets on regular channels,
- whether the type is connectable or scannable - it means the advertiser needs to listen on the radio after each
advertisement for potential connection of scan requests.


PERIPHERAL
┌────┐ advertising interval ┌────┐
channel 37 │adv │◄───────────────────────────────────────────►│adv │
└────┘ └────┘
┌────┐ ┌────┐
channel 38 │adv │ │adv │
└────┘ └────┘
┌────┐ ┌────┐
channel 39 │adv │ │adv │
└────┘ └────┘

non-advertising ┌────────────────────┐
channel │extended advertising│
(indicated in regular └────────────────────┘
advertising payload)


Scanning power draw is proportional to time spent scanning. Additional power will be used if you run active scanning
which will send a scan request and listen for the reply.

The interaction between scanning an advertising means that the less power the advertiser spends advertising, the more
power the scanner will have to spend to see the advertising packets. The decision on balance will be dictated by your
design of your devices (which one is more constrained).

### Connection vs advertising

Instead of connecting to the device you can consider transferring data in advertising packets. This depends on the
nature of the data.

A transfer over a connection will allow you to use the ATT protocol, this can handle acknowledgement for you. This might
be a good choice if you're sending data that must get through reliably.

If your data is non-critical then advertising might be cheaper. You might have to accept less reliability and no built
in acknowledgment. Additional benefit is that multiple devices may receive the data and each scanner may make their own
decisions about power consumption.

### Periodic advertising

Periodic advertising allows you to get best of both worlds by having the power characteristics of advertising for the
peripheral but also saving power for the scanner. After finding periodic advertising through `createSync` the scanner
will only have to turn on the radio when the expected packet is due.

## Increasing throughput

### Modulation schemes

Depending on controller support different modulation schemes are available in BLE through `setPreferredPhys()` and
`setPhy()`. While the coded PHY will increase reliability in noisy environments and increase range 2M PHY will increase
the throughput saving power ber bit. If both devices support it and the signal quality is good then this is recommended
to be enabled.

### Data length and ATT_MTU

Packet overhead strongly affects throughput. Newer controllers allow you to negotiate bigger MTUs thus decreasing the
fragmentation overhead.

There are two separate MTUs to consider: the `ATT_MTU` (which affects ATT protocol operations) and data length extension
(which affects transport packets). Increasing the sizes will increase memory usage but greatly increase throughput.
`ATT_MTU` and data length are independent of each other.

The size of ATT_MTU doesn't have any other overhead than memory and should only be limited by your biggest attribute
and available memory.

The default value of data length supported by all controllers is 23 octets. If both controllers support data length
extension and a higher value is negotiated, the BLE stack will call `onDataLengthChange` in the `Gap::EventHandler`
registered by the user. The supported length is set in the link layer with a maximum of 251. For Cordio Link Layer it
is derived from the config option `cordio_ll.max-acl-size`.

Larger data length greatly increases throughput (although diminishing returns quickly set in above 80). The only
potential drawback is in noisy environments where longer packets may cause slower effective transfer due to
retransmissions (this is only related to data length, ATT_MTU does not affect this).

### ATT protocol

GATT client writes and GATT server updates come in two versions - with and without confirmation. Requiring confirmations
limits the throughput severely so to maximise throughput you can move reliability up from the stack to your application.

### Packet timings

If you're not constrained by battery power it might be tempting to use maximum/minimum values where possible.
Advertising at maximum frequency and scanning continuously will speed up connecting. Setting intervals on connections
will minimise latency and maximise number of connection events.

One key thing to consider when setting the connection interval low is that you are creating a boundary between which a
sequence of packets must fit. This means that the last transfer must end before the next connection event starts. This
dead time may become significant if the connection interval is short and packet length is long.

The connection interval shouldn't be shorter than what your data requires in terms of latency.

# Test and measure

Due to complexity of the stack the only reliable way to truly maximise performance is to test your application with
representative data and measure the throughput and power usage. It's important to keep in mind that tweaking
parameters by trial and error and fine-tuning them will only be reliable for sequential operations on known stacks.

Many behaviours are implementation dependant and many operations are best effort and not guaranteed to succeed. The
stack has a lot of latitude to change its behaviour in accordance with resource constrains and other commitments. For
example your advertising may be severely affected by other operations that take precedence like keeping up a connection.

If your device needs to communicate with an unknown device or you run a non-trivial combination of concurrent
operations your fine-tuning should give way to sound principles since stack behaviours vary and you cannot test against
all stacks and sequences of operations.