Information

Table Of Contents
Bosch Sensortec | BME AI-Studio Documentation 42 | 49
Modifications reserved | Data subject to change
without notice Document number: BST-BME688-AN001-00
5.7 Data Balance
This indicates how the data for your algorithm training is distributed over your classes. Ideally, the distribution should be
balanced. If there is a little checkmark, everything is OK and the data is balanced enough for training.
How data balance is calculated
Each class consists of multiple specimens and each specimen has a duration. The application checks if the total duration
(sum of all specimen durations) of any of the classes is below the following threshold:
threshold = 24% / number of classes
E.g.
5.8 Data Channels
Specimen data includes four data channels. You can choose which data channel of each specimen should be part of
the training:
Gas Data Channel (10 data points)
Humidity Data Channel (1 data point)
Temperature Data Channel (1 data point)
Barometric Pressure Data Channel (1 data point)
By default, only the gas data channel is used for training. If other channels play a key role in your use case, you can try
to include additional channels in the training and compare the training results.
Please note
Be careful with using additional channels for your training. Using additional data channels does not automatically
mean better training results. The algorithm might focus only on one of the additional data channels during training,
which might not be what you actually want. E.g. if one of your recorded specimens has a strong impact on humidity,
the algorithm might only focus on the correlation between the specimen and the corresponding rise in humidity,
ignoring all other data. Your training results may look very good, but the algorithm only learnedto distinguish your
specimen by looking at the humidity, completely ignoring the respective gas data.
threshold for 2 classes: 12%
threshold for 3 classes: 8%
threshold for 4 classes: 6%