A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity...http://www.google.ca/patents/US20080275870?utm_source=gb-gplus-sharePatent US20080275870 - Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance