Inventors:
Sanjay G. Mahadi - Dublin CA, US
Richard V. Rifredi - Los Gatos CA, US
International Classification:
G06F 16/903
G06N 20/00
G06F 16/16
G06F 40/58
Abstract:
A method and system for refactoring document content and deriving relationships therefrom are described. For each page of a document to be processed, a processing engine processes a page of the document to create a summary and metadata relating to the page, determines a keyphrase relating to the summary, generates links to other content based on the keyphrase, and stores the summary, the keyphrase, the links, and the metadata. A search engine processes a search term, retrieves a page of a document containing the search term, and returns only the page that contains the search term and not the entire document that contains the search term.