The Wikipedia XML documents are considered an interesting challenge to any XML retrieval system that is capable of indexing
and retrieving XML without prior knowledge of the structure. Although the structure of the Wikipedia XML documents is highly
irregular and thus unpredictable, EXTIRP manages to handle all the well-formed XML documents without problems. Whether the
high flexibility of EXTIRP also implies high performance concerning the quality of IR has so far been a question without definite
answers. The initial results do not confirm any positive answers, but instead, they tempt us to define some requirements for
the XML documents that EXTIRP is expected to index. The most interesting question stemming from our results is about the line
between high-quality XML markup which aids accurate IR and noisy “XML spam” that misleads flexible XML search engines.