Inventors:
Hinrich Schuetze - San Francisco CA
James E. Pitkow - Palo Alto CA
Peter L. Pirolli - San Francisco CA
Ed H. Chi - Palo Alto CA
Jun Li - Seattle WA
Assignee:
Xerox Corporation - Stamford CT
International Classification:
G06F 700
US Classification:
707 2, 707 3, 707 4, 707 5, 707 10, 709203
Abstract:
A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.