Information

Table Of Contents

Bosch Sensortec | BME AI-Studio Documentation 42 | 49

Modifications reserved | Data subject to change

without notice Document number: BST-BME688-AN001-00

5.7 Data Balance

This indicates how the data for your algorithm training is distributed over your classes. Ideally, the distribution should be

balanced. If there is a little checkmark, everything is OK and the data is balanced enough for training.

How data balance is calculated

Each class consists of multiple specimens and each specimen has a duration. The application checks if the total duration

(sum of all specimen durations) of any of the classes is below the following threshold:

threshold = 24% / number of classes

E.g.

5.8 Data Channels

Specimen data includes four data channels. You can choose which data channel of each specimen should be part of

the training:

 Gas Data Channel (10 data points)

 Humidity Data Channel (1 data point)

 Temperature Data Channel (1 data point)

 Barometric Pressure Data Channel (1 data point)

By default, only the gas data channel is used for training. If other channels play a key role in your use case, you can try

to include additional channels in the training and compare the training results.

Please note

Be careful with using additional channels for your training. Using additional data channels does not automatically

mean better training results. The algorithm might focus only on one of the additional data channels during training,

which might not be what you actually want. E.g. if one of your recorded specimens has a strong impact on humidity,

the algorithm might only focus on the correlation between the specimen and the corresponding rise in humidity,

ignoring all other data. Your training results may look very good, but the algorithm only “learned” to distinguish your

specimen by looking at the humidity, completely ignoring the respective gas data.

 threshold for 2 classes: 12%

 threshold for 3 classes: 8%

 threshold for 4 classes: 6%