Tool to calculate the Shannon index. The Shannon index is a measure of entropy for characters strings (or any computer data)

Shannon Index - dCode

Tag(s) : Informatics

dCode is free and its tools are a valuable help in games, maths, geocaching, puzzles and problems to solve every day!

A suggestion ? a feedback ? a bug ? an idea ? *Write to dCode*!

Tool to calculate the Shannon index. The Shannon index is a measure of entropy for characters strings (or any computer data)

**Shannon**'s entropy index, as the name suggests, is a measure of the entropy that applies to any numerical data. It measures the frequencies of appearance of the characters, and the more they are different, the more difficult it will be to predict the content (thus a greater uncertainty, and thus a greater entropy).

For a string of characters with $ n $ distinct characters, each character having a frequency of $ p $. The entropy of **Shannon** $ H $ is calculated according to the formula $$ H = -\sum_{i=1}^n p_i \log_2 p_i $$

__Example:__ DCODE has 5 characters including 4 distinct characters, the letter D appears 2 times (frequency: 2/5), and the 3 letters C, O and E each appear 1 time (frequency: 1/5), the calculation is: $ H = -\left( \frac{2}{5} \log_2{\frac{2}{5}} + 3 \times \frac{1}{5} \log_2{\frac{1}{5}} \right) \approx 1.921928 $

The value is always positive, the logarithms of numbers less than 1 are always negative, their sum too, the sign - makes it possible to obtain a positive result.

From the **Shannon** index, the optimal encoding of a string can be deduced. If the **Shannon** index of a string is 3.5, then it will take 4 bits (rounded up) by characters to encode it optimally. The **Shannon** index can then be useful for evaluating a compression ratio, the higher the entropy, the better the compression.

dCode retains ownership of the online 'Shannon Index' tool source code. Except explicit open source licence (indicated CC / Creative Commons / free), any algorithm, applet or snippet (converter, solver, encryption / decryption, encoding / decoding, ciphering / deciphering, translator), or any function (convert, solve, decrypt / encrypt, decipher / cipher, decode / encode, translate) written in any informatic language (PHP, Java, C#, Python, Javascript, Matlab, etc.) no data, script or API access will be for free, same for Shannon Index download for offline use on PC, tablet, iPhone or Android !

Please, check our community Discord for help requests!

shannon,index,entropy,string,character,encoding,compression

Source : https://www.dcode.fr/shannon-index

© 2021 dCode — The ultimate 'toolkit' to solve every games / riddles / geocaching / CTF.

Feedback

▲