Abstract:
An approach to the information analysis is considered for the case when the information is presented by words of finite length over a finite alphabet. A method of generating a measure of symbolic diverseness of words based on peak characteristics of a shift entropy function is proposed. The shift entropy function is formally defined using a unit translation operator and the entropy of discrete distributions. A model example is presented together with some results of application of the proposed measure in the clustering of families of plants using the analysis of genome of their representatives.
Key words:
shift entropy, measure of symbolic diverseness, clustering of plant genomes.
Citation:
Yu. G. Smetanin, M. V. Ulyanov, A. S. Pestova, “Entropy approach to the construction of a measure of word symbolic diverseness and its application to clustering of plant genomes”, Mat. Biolog. Bioinform., 11:1 (2016), 114–126
\Bibitem{SmeUlyPes16}
\by Yu.~G.~Smetanin, M.~V.~Ulyanov, A.~S.~Pestova
\paper Entropy approach to the construction of a measure of word symbolic diverseness and its application to clustering of plant genomes
\jour Mat. Biolog. Bioinform.
\yr 2016
\vol 11
\issue 1
\pages 114--126
\mathnet{http://mi.mathnet.ru/mbb254}
\crossref{https://doi.org/10.17537/2016.11.114}
Linking options:
https://www.mathnet.ru/eng/mbb254
https://www.mathnet.ru/eng/mbb/v11/i1/p114
This publication is cited in the following 3 articles:
V. D. Gusev, L. A. Miroshnichenko, “Slozhnost DNK-posledovatelnostei. Razlichnye podkhody i opredeleniya”, Matem. biologiya i bioinform., 15:2 (2020), 313–337
G. N. Zhukova, Yu. G. Smetanin, M. V. Uljanov, “Informative symbolic representations as a way to qualitatively analyze time series”, 2019 International Conference on Engineering Technologies and Computer Science (Ent): Innovation & Application, ed. S. Prokhorov, IEEE, 2019, 43–47
Mikhail V Ulyanov, Yuri G Smetanin, Mikhail M Shulga, Andrei V Eserkepov, Yuri Yu Tarasevich, “Characterisation of diffusion-driven self-organisation of rodlike particles by means of entropy of generalised two-dimensional words”, J. Phys.: Conf. Ser., 1141 (2018), 012137