This paper addresses the classification task of data mining (a form of supervised learning) in the context of an important
bioinformatics problem, namely the prediction of protein functions. This problem is cast as a hierarchical classification
problem, where the protein functions to be predicted correspond to classes that are arranged in a hierarchical structure,
in the form of a class tree. The main contribution of this paper is to propose a new Artificial Immune System that creates
a new representation for proteins, in order to maximize the predictive accuracy of a hierarchical classification algorithm
applied to the corresponding protein function prediction problem.
Keywords artificial immune systems - data mining - bioinformatics - classification - clustering