WO2000008572A1 - Search and index hosting system - Google Patents
Search and index hosting system Download PDFInfo
- Publication number
- WO2000008572A1 WO2000008572A1 PCT/US1999/017359 US9917359W WO0008572A1 WO 2000008572 A1 WO2000008572 A1 WO 2000008572A1 US 9917359 W US9917359 W US 9917359W WO 0008572 A1 WO0008572 A1 WO 0008572A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- database
- documents
- user
- code
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/338—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/34—Browsing; Visualisation therefor
- G06F16/345—Summarisation for human users
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Definitions
- the present invention relates to a system for searching a remote database, and more particularly to a system which indexes documents in the database, which identifies documents in the index that include user-specified data, and which outputs a list of documents that contain such data and, optionally, excerpts from those documents.
- the invention has particular utility in connection with text indexing and retrieval systems, such as World Wide Web search engines.
- search engines search through a database for specific data and retrieve titles of documents in the database which contain that data.
- World Wide Web search engines such as
- AltavistaTM and Yahoo! ® provide users with the ability to search the Web for documents containing user-specified words, phrases, or the like.
- conventional search engines, and Web search engines in particular, suffer from a drawback in that they do not allow a user to direct a search to a single database.
- conventional Web search engines operate by generating an index for all sites on the Web, and then retrieving data from that index in response to user queries. Since these search engines generate an index for all sites on the Web, however, they are limited to searching the entire Web. This is disadvantageous, particularly for those users who only want to search specific sites.
- software manufacturers have developed site-specific searching systems, such as UltraseekTM, which ostensibly allow users to limit their searches to specific Web sites. To use these systems, however, they must be installed at each Web site, e.g., by the Web site's provider. Once installed, the systems create an index of the Web site at the provider's location.
- conventional database searching systems provide the user with only names/titles of documents in response to a query.
- conventional Web search engines provide only the name of a document containing a search term, together with a uniform resource locator ("URL") for that document.
- URL uniform resource locator
- a searching system which provides the user with the ability to search a portion of a database, such as one or more sites on the World Wide Web, and which is more cost effective, efficient, and easy to use than the conventional systems described above.
- a database searching system which is able to provide a user with the context of each search term in documents retrieved as a result of the search.
- the present invention addresses the foregoing needs by providing a way to search through a database at one network site (e.g., a Web site) using a host computer which is at another network site.
- a Web site provider for example, is able to create a search engine for the Web site simply by accessing the present invention via the Web and entering a request for a new account.
- the invention assigns the Web site a provider identifier, and then extracts URL(s) from the Web site.
- the invention "crawls" through the site in order to create an index of the site, which comprises data from the site indexed by document (e.g., Web page) and provider identifier.
- document e.g., Web page
- provider identifier e.g., the site provider need simply copy a few (e.g., 10) lines of code into any sites for which searching capabilities are desired.
- each time the Web site is visited it will automatically transmit its provider identifier to the visiting user's site.
- the Web site will display a search line, from which the visiting user may enter queries to search the site for specific data.
- the query is passed from the user's site to the host computer's site, where the actual searching takes place.
- an index corresponding to the provider identifier is retrieved from memory and searched for the data specified in the user's query. Thereafter, a list of documents which contain the data (including
- URLs in the case of the Web is output from the host computer's site to the user's site and displayed there.
- the present invention By conducting the search at the host computer's site, rather than at the Web site itself, the present invention reduces the difficulties involved with installing and maintaining an entire software application at the Web site. As a result, the present invention provides a way to search specified Web sites (and other types of databases as well), which is more efficient and less costly and time consuming than the conventional site-specific searching systems described above.
- the present invention is a system (i.e., a method, an apparatus, and computer-executable process steps) for initiating a search at a first network site for user-specified data in a remote database at a second network site and for conducting the search at a third network site (e.g., at a host computer's site).
- the system receives, at the first network site, a provider identifier associated with the database from the second network site.
- the user-specified data is input at the first network site, following which the user-specified data and the provider identifier are output from the first network site to the third network site.
- the system searches for the user-specified data in a database at the third network site using the provider identifier.
- this database at the third network site includes data that corresponds to data stored in the remote database at the second network site.
- the present invention is a way to configure a computerized searching system (such as the searching system resident at the host computer's site described above) so that the searching system can be used to search a database.
- information identifying the database is input, a provider identifier is assigned to the database, and a search through the database is conducted using the input information in order to identify locations of documents in the database.
- the locations of the documents in the database are stored in memory together with the provider identifier, and the documents in the database are indexed.
- An index of the documents is then stored in memory together with the provider identifier; and data corresponding to data in the database is also stored in memory together with the provider identifier.
- the provider identifier is then output to the database. As noted above, this provider identifier is transmitted to those who visit the site.
- the present invention is a system for identifying which documents in a database contain user-specified data.
- the system stores, in memory, indices of data in plural databases, such as those noted above.
- the system receives the user-specified data and a provider identifier which corresponds to one of the plural databases, and retrieves, from memory, an index of data for a database that corresponds to the provider identified. Thereafter documents in the retrieved index that contain the user-specified data are identified, and identities thereof are output to the user.
- the present invention also provides an optional feature for displaying excerpts from documents identified by a database search.
- the index of documents in the database is stored, and pointers to data segments in the database are generated based on the index.
- These data segments comprise target data together with data surrounding the target data. That is, assuming that the target data comprises a word which matches an input user query, the data segment for that word might comprise, e.g., five words to the left of the word, the word itself, and five words to the right of the word.
- the invention searches the index for the word, it compiles a list of pointers to data segments which include the word. These data segments may then be extracted and passed to a user's site for display along with the list of documents.
- the invention is a system for retrieving a list of documents in a database which include user-specified data, and of retrieving one or more data segments from each document on the list.
- the system includes storing an index of documents from the database, the index including pointers corresponding to data in the database, where the pointers define data segments having a predetermined size.
- One or more documents in the database that contain the user- specified data are then identified based on the index; and a list is created which includes one or more pointers corresponding to each occurrence of the user-specified data in the identified documents.
- Each data segment in the database that contains the user-specified data is extracted based on the list of pointers created in the creating step, whereafter a list of the documents in the database that contain the user-specified data is output, together with the extracted data segments.
- the invention makes it possible to display data excerpts (i.e. , segments) from each document found in the search. A user may then refer to these excerpts in order to determine whether each document is relevant, instead of actually opening the document.
- the present invention facilitates database searching.
- Figure 1 is a perspective view of a host computer system, including a personal computer and a query server, which is used in the present invention.
- Figure 2 shows the architecture of the personal computer shown in Figure 1.
- Figure 3 shows the architecture of the query server shown in Figure 1.
- Figure 4 is a flow diagram showing process steps for setting- up the search hosting system of the present invention on a Web site.
- FIG 5 shows the software architecture used to implement the process steps shown in Figure 4.
- Figures 6A and 6B show a representative new account form used in setting-up the search hosting system of the present invention.
- Figure 7 shows an example of a Web site which does not include the search hosting system of the present invention.
- Figure 8 shows an example of a Web site which includes the search hosting system of the present invention.
- Figure 9 shows the software architecture of the search hosting system of the present invention.
- Figures 10A and 10B show process steps implemented by the software architecture of Figure 9 to perform a search of a Web site at a host computer system.
- Figure 11 shows operation of a concordance generator included in the software architecture of Figure 9.
- Figure 12 shows an example of a results page generated by the present invention.
- FIG. 1 shows a representative embodiment of a host computer system which can be used to implement the present invention.
- host computer system 1 includes PC 2 and query server 4.
- PC 2 includes a local area network connection 5 for interfacing to query server 4, a general network connection 6 for interfacing to a network such as the Internet, and fax/modem connection 7 for interfacing with other remote sources.
- PC 2 also includes display screen 9 for displaying information, keyboard 10 for inputting text and user commands, mouse 11 for positioning a cursor on display screen 9 and for inputting user commands, disk drive 12 for reading from and writing to floppy disks installed therein, and CD-ROM drive 14 for accessing information stored on CD-ROM.
- PC 2 may also have one or more peripheral devices (not shown) attached thereto.
- FIG. 2 shows the internal structure of PC 2.
- PC 2 includes memory 15, which comprises one or more computer- readable media, such as a computer hard disk.
- Memory 15 stores data 16, applications, and an operating system 17.
- page generator 19 the function of which is described in detail below.
- display interface 20 keyboard interface 21, mouse interface 22, disk drive interface 24, CD-ROM drive interface 25, computer bus 26, RAM 27, and processor 29.
- Processor 29 preferably comprises a microprocessor or the like for executing applications out of RAM 27.
- these applications may be stored in memory 15 or, alternatively, on a floppy disk in disk drive 12 or a CD-ROM in CD-ROM drive 14.
- processor 29 accesses applications (or other data) stored on a floppy disk via disk drive interface 24 and accesses applications (or other data) stored on a CD-ROM via CD-ROM drive interface 25.
- Application execution and other tasks of PC 2 and query server 4 may be controlled and/or altered using keyboard 10 or mouse 11, commands from which are transmitted to processor 29 via keyboard interface 21 and mouse interface 22, respectively.
- Output results from some applications running on PC 2 may be processed by display interface 20 and then displayed to a user on display 9. To this end, display interface
- results page generator 19 receive their inputs from, and/or provide outputs to, network interfaces 5 and 6.
- Figure 3 shows a representative architecture for query server 4.
- query server 4 includes processor 30, RAM 31, computer bus 32, and memory 34.
- Memory 34 preferably comprises one or more computer-readable media, such as a computer hard disk.
- Memory 34 stores data and applications, with the applications including Web robot
- Search and index hosting application 39 comprises computer-executable process steps to generate and execute a database-specific search engine A detailed description of these process steps is provided below.
- search engine 36 and indexer module 37 are based upon the search engine and indexer described in "The SMART Retrieval System: Experiments in Automatic
- Web robot 35 can comprise any commercially available Web robot.
- search and index hosting application 39 is executed by processor 30 out of RAM 31, and results therefrom are output to host computer 2 via network connection 5.
- search and index hosting application 39 comprises a database-specific search engine, in that it identifies which documents in a database contain a user- specified query term. The application then retrieves a list of such documents and, optionally, one or more data segments from each document on the list.
- the database searched by application 39 is a Web site and the documents comprise one or more Web pages.
- the invention is not limited to searching Web sites and, in fact, can be used to search a variety of different types of databases.
- Figures 4 and 5 show how this is done. More specifically, as shown in Figure 5, a Web site provider's representative, such as their system administrator, issues request 40, over network 41 (e.g. , the Internet), to a host computer system running search and index hosting application 39. In response, search and index hosting application 39 retrieves Web page forms required to set up a new account from storage 42, and transmits these forms back to the provider's representative, where the forms are displayed on the representative's computer. As shown in Figure 5, displayed form
- contact information e.g., information regarding the provider's Web site
- entry point URLs for the site. This contact information may be input in step S401 of Figure 4.
- FIGS 6A and 6B show a representative example of a new account form generated by a preferred embodiment of the present invention, which will be sold under the tradename PicoSearchTM.
- new account form 45 requests information regarding the provider's representative, such as their name, E-mail address, and account password, together with three entry point URL's to the provider's Web site.
- the invention is not limited to using three entry point
- the preferred embodiment of the invention also provides a way in which to control searching throughout a specified site. For example, in this embodiment, it is possible to restrict searching to the directory level, the server level, or the domain level. Of course, the invention is not limited to the restrictions shown in Figure 6B, and can be modified to include other searching options as well.
- the "filled-out" form is transmitted over the network to the host computer system. There, contact information
- entry point URLs 48 are extracted from the new account form, as shown in Figure 5.
- the contact information i.e. , the provider representative's name, E-mail address, etc.
- user database 49 which may reside on memory 34 or any other memory on the host computer system.
- Search and index hosting application 39 then assigns the provider representative a new provider identifier 50 in step S402.
- This provider identifier is then combined with entry point URLs 48, the combination of which is provided to Web robot 35.
- Web site provider 52 provides data from its Web site (e.g. , plural Web documents/pages) to host computer system 1 via network 41.
- Web robot 35 searches through the data from the Web site for locations (i.e., URLs) and titles of documents therein, starting at the entry point URLs specified in the new account form.
- step S404 the location and title of each Web document that is found in the search is stored in user database 49, together with the provider identifier and a predetermined document identifier for each document.
- the use of these document identifiers is described below.
- indexer module 37 indexes data stored in documents on the Web site. More specifically, indexer module 37 reviews each of Web documents 55 identified by Web robot 35, and creates an index of pointers to data in the Web site, where the data is indexed according to document, location in the document, and provider identifier. This index, together with the provider identifier and predetermined document identifiers, is then stored in multiple index storage 56 in step S406.
- multiple index storage 56 resides in memory 34 of query server 4; although the invention is not limited to storing multiple index storage 56 there.
- indexer module 37 compresses data from the Web site and stores that data in compressed document full text storage 57, together with the provider identifier for the site.
- storage 57 can reside in memory 34; although the invention is not limited to this.
- indexer 37 outputs processing results 59, including the provider identifier, to the provider's representative, as shown in Figure 5.
- Figure 5 shows the provider's representative at a location which is different from that of the Web site provider, this need not be the case. That is, the provider's representative and the provider may also be at the same location.
- FIG. 7 shows an example of a Web site 60 which does not include the site-specific searching capabilities of the present invention
- Figure 8 shows that same Web site with the site-specific searching capabilities.
- the site with the searching capabilities includes entry line 61 and search button 62. To search for a text or the like at Web site 60, all that must be done is to input the text on the entry line, and click on the search button The searching process of the present invention is described below with respect to the remaining figures.
- the invention has been described with respect to only one Web site provider, the invention is not limited to use with only one provider.
- the invention is able to handle searches of any number of Web sites. That is, any number of Web site providers' representatives may set up accounts with the host computer system.
- the host computer system may store indices of data from plural databases (e.g., plural Web sites), locations of documents in those plural databases, and compressed text from the plural databases.
- the invention also stores with each of these a provider identifier, the invention is able to select the appropriate data readily, and thus to perform site-specific searches. This feature of the invention is described more fully below.
- a user initiates a search at first network site 64 (e.g. , the user's computer) for data in a remote database at second network site 65 (e.g., an Internet site or Web site server) and the invention conducts the search at third network site 66 (e.g. , the host computer system).
- second network site 65 e.g., an Internet site or Web site server
- third network site 66 e.g. , the host computer system.
- Figure 9 shows the three network sites 64, 65 and 66 at three different locations (i.e., at three computer systems) on the network, these network sites may reside at the same network location or, alternatively, at only two different network locations.
- first network site 64 receives a provider identifier associated with the database from second network site 65 and then outputs a query containing user-specified data and the provider identifier from first network site 64 to third network site 66.
- Third network site 66 searches for the user-specified data in a database at the third network site using the provider identifier.
- This database at third network site 66 includes data that corresponds to data stored in the remote database at the second network site.
- Web site 65 automatically provides the user's computer with its provider identifier 67, together with Web page(s) 68 from the site.
- An example of such a page is shown in Figure 8.
- the user may enter a query 71 comprised, e.g., of user-specified data such as character strings, text, or the like, into the entry line provided with the Web page.
- the query and the provider identifier are then packetized and packet 72 is passed, via network 69 (e.g., the Internet), to the host computer (i.e., PC 2).
- the host computer receives this packet in step SI 001 of Figure 10A, extracts the provider identifier and data from the query, and passes this information on to query server 4 via local network 70 (i.e. , over local network connection 5 shown in Figure 1).
- index retriever 74 uses provider identifier 67 to retrieve, from multiple index storage 56, an index of data that has the same provider identifier as the Web site. This index with the same provider identifier, labeled 75 in Figure 9, is then passed to search engine 36, along with query 71.
- search engine 36 searches through index 75 for data specified in query 71 in order to identify documents that contain the specified data.
- the search engine retrieves identifiers 76 (e.g., numbers) for these documents from the index. Thereafter, processing proceeds to step S1004.
- Step SI 004 determines whether data segments in each identified document should be output along with a list of identified documents. In a case that step SI 004 determines that no data segments should be output, processing proceeds to step S1005 of Figure 10B. As shown in Figure 9, this processing corresponds to dashed line 77, which bypasses concordance generator 78 (described below). Thus, in step S1005, search and index hosting application 39 extracts locations (e.g.,
- results page generator 19 generates a results page showing titles of, and
- This results page 80 is then passed to the user's computer via network 69, where it is displayed.
- this results page also includes an entry line, such as entry line 81 shown in Figure 9, from which the user may initiate a new search of the
- the present invention includes the capability of also outputting, from each document on the list, a text or data segment which contains the user- specified data.
- These segments comprise target data (i.e. , the user- specified data) together with data adjacent to the target data. That is, assuming that the target data comprises a word which matches the user- specified data, the data segment for that word might comprise, e.g., five words to the left of the word, the word itself, and five words to the right of the word.
- the invention is not limited to using eleven-word data segments, meaning that any size data segment may be used.
- the size and/or locations of the data segments relative to the user-specified data are settable by the user, and thus can be varied as desired.
- step SI 004 concordance generator 78 searches the documents identified in step S1003 for locations of the user-specified data.
- Figure 11 is a close-up view of the operation of concordance generator 78. More specifically, as shown in Figure 11, in step SI 007 concordance generator 78 performs a boolean search through documents 76 for data that matches data specified in query 71. When matching data is found, step SI 008 generates pointers into the corresponding compressed data based on pointers to locations in the index. The generated pointers identify the user-specified data in the compressed data, together with a predetermined amount of additional adjacent data needed to make up a data segment.
- concordance generator 78 extracts the provider identifier for the current Web site from index 75, and selects and retrieves compressed data 82 for that Web site from full compressed text storage 57. Thereafter, in step S1010, concordance generator 78 decompresses selected data segments that correspond to the pointers generated in step SI 008. In preferred embodiments of the invention, only compressed data corresponding to the selected data segments is decompressed. As noted in Figure 11, a data segment may start m characters before the query term and end n characters after the query term - only this segment would be decompressed. Thereafter, arrays
- concordance strings i.e., decompressed data segments
- concordance generator 78 outputs concordance generator 78, together with corresponding document identifiers specifying the document from which each string originated.
- Results page generator 19 uses the decompressed data segments to formulate its results page. In addition, results page generator 19 extracts document identifiers
- step S1011 query server 4 extracts locations (e.g., URLs) and titles of documents from user database 49 that match the document identifiers, and passes this information back to results page generator 19.
- Results page generator 19 formulates a results page which includes document titles, data segments, URLs, and the original query term, and passes the results page, together with the provider identifier, back to the user in step S1012.
- the results page is then displayed on the user's computer.
- the results page may include an entry line, from which the user may conduct another search of the Web site specified by the provider identifier. If no additional search is to be conducted, processing ends. Otherwise, the foregoing process is repeated for the new search.
- Figure 12 is an example of a results page generated by the preferred embodiment of the present invention. Specifically, this results page was generated from a search of the Web site shown in Figure 8, in which the user-specified character string was "labor and employment". As shown in Figure 12, the results page includes document titles 86a, 86b and 86c, and corresponding URLs 87a, 87b and 87c for each document title. In addition, the results page shown in Figure 12 includes excerpts 88a, 88b and 88c from each document, which include the user-specified character string. As noted above, however, this feature of the invention is optional, and the results page need not include these excerpts. As also shown in Figure 12, the results page includes entry line 89, whereby the user may conduct a new search of the Web site shown in Figure 8 for a new search term.
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP99938912A EP1101175A1 (en) | 1998-08-06 | 1999-08-02 | Search and index hosting system |
CA002339543A CA2339543A1 (en) | 1998-08-06 | 1999-08-02 | Search and index hosting system |
AU53294/99A AU5329499A (en) | 1998-08-06 | 1999-08-03 | Search and index hosting system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/130,420 | 1998-08-06 | ||
US09/130,420 US6336116B1 (en) | 1998-08-06 | 1998-08-06 | Search and index hosting system |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2000008572A1 true WO2000008572A1 (en) | 2000-02-17 |
Family
ID=22444614
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/017359 WO2000008572A1 (en) | 1998-08-06 | 1999-08-02 | Search and index hosting system |
Country Status (5)
Country | Link |
---|---|
US (4) | US6336116B1 (en) |
EP (1) | EP1101175A1 (en) |
AU (1) | AU5329499A (en) |
CA (1) | CA2339543A1 (en) |
WO (1) | WO2000008572A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2807537A1 (en) * | 2000-04-06 | 2001-10-12 | France Telecom | HYPERMEDIA RESOURCE SEARCH ENGINE AND INDEXING METHOD THEREOF |
WO2001082129A1 (en) * | 2000-04-27 | 2001-11-01 | Mondosoft A/S | Local search via global search |
WO2002008940A2 (en) * | 2000-07-20 | 2002-01-31 | Johnson Rodney D | Information archival and retrieval system for internetworked computers |
Families Citing this family (63)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6336116B1 (en) * | 1998-08-06 | 2002-01-01 | Ryan Brown | Search and index hosting system |
US6745177B2 (en) * | 1999-04-09 | 2004-06-01 | Metro One Telecommunications, Inc. | Method and system for retrieving data from multiple data sources using a search routing database |
JP2001084169A (en) * | 1999-09-10 | 2001-03-30 | Ricoh Co Ltd | Document database access device |
US6823330B1 (en) * | 1999-11-16 | 2004-11-23 | Gateway, Inc. | Site home navigation control |
WO2001037134A1 (en) | 1999-11-16 | 2001-05-25 | Searchcraft Corporation | Method for searching from a plurality of data sources |
US6625594B1 (en) * | 2000-01-18 | 2003-09-23 | With1Click, Inc. | System and method for searching a global communication system using a sub-root domain name agent |
US6757669B1 (en) * | 2000-01-28 | 2004-06-29 | Xerox Corporation | Self-contained indexing system for an intranet |
EP1269347A1 (en) * | 2000-03-31 | 2003-01-02 | Kapow APS | Method of retrieving attributes from at least two data sources |
US6983320B1 (en) * | 2000-05-23 | 2006-01-03 | Cyveillance, Inc. | System, method and computer program product for analyzing e-commerce competition of an entity by utilizing predetermined entity-specific metrics and analyzed statistics from web pages |
US7013323B1 (en) * | 2000-05-23 | 2006-03-14 | Cyveillance, Inc. | System and method for developing and interpreting e-commerce metrics by utilizing a list of rules wherein each rule contain at least one of entity-specific criteria |
US7003517B1 (en) | 2000-05-24 | 2006-02-21 | Inetprofit, Inc. | Web-based system and method for archiving and searching participant-based internet text sources for customer lead data |
US7082427B1 (en) * | 2000-05-24 | 2006-07-25 | Reachforce, Inc. | Text indexing system to index, query the archive database document by keyword data representing the content of the documents and by contact data associated with the participant who generated the document |
US7096220B1 (en) | 2000-05-24 | 2006-08-22 | Reachforce, Inc. | Web-based customer prospects harvester system |
US7120629B1 (en) | 2000-05-24 | 2006-10-10 | Reachforce, Inc. | Prospects harvester system for providing contact data about customers of product or service offered by business enterprise extracting text documents selected from newsgroups, discussion forums, mailing lists, querying such data to provide customers who confirm to business profile data |
US6823491B1 (en) * | 2000-08-31 | 2004-11-23 | International Business Machines Corporation | System and method for a dynamically integrated search engine |
US6661432B1 (en) * | 2000-08-31 | 2003-12-09 | International Business Machines Corporation | System and method for dynamically searching an identified web site |
US7043531B1 (en) | 2000-10-04 | 2006-05-09 | Inetprofit, Inc. | Web-based customer lead generator system with pre-emptive profiling |
US7330850B1 (en) | 2000-10-04 | 2008-02-12 | Reachforce, Inc. | Text mining system for web-based business intelligence applied to web site server logs |
US20020178213A1 (en) * | 2001-04-11 | 2002-11-28 | Parry John Chad | Remote URL munging |
US20020178186A1 (en) * | 2001-04-11 | 2002-11-28 | Parry John Chad | Remote URL munging business method |
US6934702B2 (en) * | 2001-05-04 | 2005-08-23 | Sun Microsystems, Inc. | Method and system of routing messages in a distributed search network |
US6961723B2 (en) * | 2001-05-04 | 2005-11-01 | Sun Microsystems, Inc. | System and method for determining relevancy of query responses in a distributed network search mechanism |
US7099871B2 (en) * | 2001-05-04 | 2006-08-29 | Sun Microsystems, Inc. | System and method for distributed real-time search |
US6950821B2 (en) * | 2001-05-04 | 2005-09-27 | Sun Microsystems, Inc. | System and method for resolving distributed network search queries to information providers |
US6820078B2 (en) * | 2002-05-09 | 2004-11-16 | Kevin Nip | Method and system for the automated collection and display of database information via a communications network |
US7496636B2 (en) * | 2002-06-19 | 2009-02-24 | International Business Machines Corporation | Method and system for resolving Universal Resource Locators (URLs) from script code |
US6983273B2 (en) * | 2002-06-27 | 2006-01-03 | International Business Machines Corporation | Iconic representation of linked site characteristics |
US20040207659A1 (en) * | 2003-04-02 | 2004-10-21 | International Business Machines Corporation | Program creation by combining web services using graphic user interface controls |
US20050256868A1 (en) * | 2004-03-17 | 2005-11-17 | Shelton Michael J | Document search system |
US20060072723A1 (en) * | 2004-10-05 | 2006-04-06 | Chung Michael M | Method and system for managing folders of email accounts and voice messages |
US7792811B2 (en) * | 2005-02-16 | 2010-09-07 | Transaxtions Llc | Intelligent search with guiding info |
US8131736B1 (en) | 2005-03-01 | 2012-03-06 | Google Inc. | System and method for navigating documents |
US20060235830A1 (en) * | 2005-04-14 | 2006-10-19 | Microsoft Corporation | Web content administration information discovery |
US8533199B2 (en) | 2005-12-14 | 2013-09-10 | Unifi Scientific Advances, Inc | Intelligent bookmarks and information management system based on the same |
US9767184B2 (en) * | 2006-03-14 | 2017-09-19 | Robert D. Fish | Methods and apparatus for facilitating context searching |
US20080015878A1 (en) * | 2006-07-17 | 2008-01-17 | Yahoo! Inc. | Real-time user profile platform for targeted online advertisement and personalization |
US20080033964A1 (en) * | 2006-08-07 | 2008-02-07 | Bea Systems, Inc. | Failure recovery for distributed search |
US20080033943A1 (en) * | 2006-08-07 | 2008-02-07 | Bea Systems, Inc. | Distributed index search |
US20080033910A1 (en) * | 2006-08-07 | 2008-02-07 | Bea Systems, Inc. | Dynamic checkpointing for distributed search |
US9015197B2 (en) * | 2006-08-07 | 2015-04-21 | Oracle International Corporation | Dynamic repartitioning for changing a number of nodes or partitions in a distributed search system |
US20080033925A1 (en) * | 2006-08-07 | 2008-02-07 | Bea Systems, Inc. | Distributed search analysis |
US20080033958A1 (en) * | 2006-08-07 | 2008-02-07 | Bea Systems, Inc. | Distributed search system with security |
US7725470B2 (en) * | 2006-08-07 | 2010-05-25 | Bea Systems, Inc. | Distributed query search using partition nodes |
US8014308B2 (en) * | 2006-09-28 | 2011-09-06 | Microsoft Corporation | Hardware architecture for cloud services |
US20080319991A1 (en) * | 2007-01-08 | 2008-12-25 | Lsr Technologies | System for Searching Network Accessible Data Sets |
US7424471B2 (en) * | 2007-01-08 | 2008-09-09 | Lsr Technologies | System for searching network accessible data sets |
WO2009102765A2 (en) * | 2008-02-11 | 2009-08-20 | Nuix North America Inc. | Parallelization of electronic discovery document indexing |
US9785700B2 (en) | 2008-02-11 | 2017-10-10 | Nuix Pty Ltd | Systems and methods for load-balancing by secondary processors in parallelized indexing |
US9928260B2 (en) | 2008-02-11 | 2018-03-27 | Nuix Pty Ltd | Systems and methods for scalable delocalized information governance |
US9326843B2 (en) | 2009-01-16 | 2016-05-03 | Claret Medical, Inc. | Intravascular blood filters and methods of use |
WO2010083527A2 (en) | 2009-01-16 | 2010-07-22 | Claret Medical, Inc. | Intravascular blood filter |
US20170202657A1 (en) | 2009-01-16 | 2017-07-20 | Claret Medical, Inc. | Intravascular blood filters and methods of use |
WO2011017103A2 (en) | 2009-07-27 | 2011-02-10 | Claret Medical, Inc. | Dual endovascular filter and methods of use |
US8876796B2 (en) | 2010-12-30 | 2014-11-04 | Claret Medical, Inc. | Method of accessing the left common carotid artery |
CN104035955B (en) * | 2014-03-18 | 2018-07-10 | 北京百度网讯科技有限公司 | searching method and device |
US10826930B2 (en) | 2014-07-22 | 2020-11-03 | Nuix Pty Ltd | Systems and methods for parallelized custom data-processing and search |
US11200249B2 (en) | 2015-04-16 | 2021-12-14 | Nuix Limited | Systems and methods for data indexing with user-side scripting |
EP3700464B1 (en) | 2017-10-27 | 2024-02-14 | Boston Scientific Scimed, Inc. | Systems for protecting the cerebral vasculature |
US20190171749A1 (en) * | 2017-12-04 | 2019-06-06 | Sap Se | Data integration framework for multiple data providers using service url |
US11154390B2 (en) | 2017-12-19 | 2021-10-26 | Claret Medical, Inc. | Systems for protection of the cerebral vasculature during a cardiac procedure |
EP3784168B1 (en) | 2018-04-26 | 2024-03-20 | Boston Scientific Scimed, Inc. | Systems for protecting the cerebral vasculature |
JP2021535778A (en) | 2018-08-21 | 2021-12-23 | ボストン サイエンティフィック サイムド, インコーポレイテッドBoston Scientific Scimed, Inc. | A system to protect the cerebrovascular system |
US20210192571A1 (en) * | 2019-12-20 | 2021-06-24 | Microsoft Technology Licensing, Llc | Cobranded search experience for publishing websites |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0530993A2 (en) * | 1991-08-16 | 1993-03-10 | Xerox Corporation | An iterative technique for phrase query formation and an information retrieval system employing same |
WO1998007105A1 (en) * | 1996-08-14 | 1998-02-19 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
US5745890A (en) * | 1996-08-09 | 1998-04-28 | Digital Equipment Corporation | Sequential searching of a database index using constraints on word-location pairs |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2066559A1 (en) * | 1991-07-29 | 1993-01-30 | Walter S. Rosenbaum | Non-text object storage and retrieval |
US5793966A (en) * | 1995-12-01 | 1998-08-11 | Vermeer Technologies, Inc. | Computer system and computer-implemented process for creation and maintenance of online services |
US5732219A (en) * | 1995-03-17 | 1998-03-24 | Vermeer Technologies, Inc. | Computer system and computer-implemented process for remote editing of computer files |
US5748954A (en) * | 1995-06-05 | 1998-05-05 | Carnegie Mellon University | Method for searching a queued and ranked constructed catalog of files stored on a network |
US5649186A (en) * | 1995-08-07 | 1997-07-15 | Silicon Graphics Incorporated | System and method for a computer-based dynamic information clipping service |
US5764906A (en) * | 1995-11-07 | 1998-06-09 | Netword Llc | Universal electronic resource denotation, request and delivery system |
US5974455A (en) * | 1995-12-13 | 1999-10-26 | Digital Equipment Corporation | System for adding new entry to web page table upon receiving web page including link to another web page not having corresponding entry in web page table |
US5778367A (en) * | 1995-12-14 | 1998-07-07 | Network Engineering Software, Inc. | Automated on-line information service and directory, particularly for the world wide web |
US6021433A (en) * | 1996-01-26 | 2000-02-01 | Wireless Internet, Inc. | System and method for transmission of data |
US5920859A (en) * | 1997-02-05 | 1999-07-06 | Idd Enterprises, L.P. | Hypertext document retrieval system and method |
US5745889A (en) * | 1996-08-09 | 1998-04-28 | Digital Equipment Corporation | Method for parsing information of databases records using word-location pairs and metaword-location pairs |
JPH1069423A (en) * | 1996-08-28 | 1998-03-10 | Nec Corp | Hypermedia system and its directory data managing method |
US5978842A (en) * | 1997-01-14 | 1999-11-02 | Netmind Technologies, Inc. | Distributed-client change-detection tool with change-detection augmented by multiple clients |
US5961593A (en) * | 1997-01-22 | 1999-10-05 | Lucent Technologies, Inc. | System and method for providing anonymous personalized browsing by a proxy system in a network |
US6278992B1 (en) * | 1997-03-19 | 2001-08-21 | John Andrew Curtis | Search engine using indexing method for storing and retrieving data |
US5987454A (en) * | 1997-06-09 | 1999-11-16 | Hobbs; Allen | Method and apparatus for selectively augmenting retrieved text, numbers, maps, charts, still pictures and/or graphics, moving pictures and/or graphics and audio information from a network resource |
US5982370A (en) * | 1997-07-18 | 1999-11-09 | International Business Machines Corporation | Highlighting tool for search specification in a user interface of a computer system |
US6044372A (en) * | 1997-07-18 | 2000-03-28 | Dazel Corporation | Method and apparatus for publishing information to a communications network and enabling subscriptions to such information |
US5983216A (en) * | 1997-09-12 | 1999-11-09 | Infoseek Corporation | Performing automated document collection and selection by providing a meta-index with meta-index values indentifying corresponding document collections |
US6085229A (en) * | 1998-05-14 | 2000-07-04 | Belarc, Inc. | System and method for providing client side personalization of content of web pages and the like |
US6070170A (en) * | 1997-10-01 | 2000-05-30 | International Business Machines Corporation | Non-blocking drain method and apparatus used to reorganize data in a database |
US5848410A (en) * | 1997-10-08 | 1998-12-08 | Hewlett Packard Company | System and method for selective and continuous index generation |
US6094649A (en) * | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
US5913210A (en) * | 1998-03-27 | 1999-06-15 | Call; Charles G. | Methods and apparatus for disseminating product information via the internet |
US6256623B1 (en) * | 1998-06-22 | 2001-07-03 | Microsoft Corporation | Network search access construct for accessing web-based search services |
US6336116B1 (en) * | 1998-08-06 | 2002-01-01 | Ryan Brown | Search and index hosting system |
US6345252B1 (en) * | 1999-04-09 | 2002-02-05 | International Business Machines Corporation | Methods and apparatus for retrieving audio information using content and speaker information |
US6345253B1 (en) * | 1999-04-09 | 2002-02-05 | International Business Machines Corporation | Method and apparatus for retrieving audio information using primary and supplemental indexes |
-
1998
- 1998-08-06 US US09/130,420 patent/US6336116B1/en not_active Expired - Lifetime
-
1999
- 1999-08-02 WO PCT/US1999/017359 patent/WO2000008572A1/en not_active Application Discontinuation
- 1999-08-02 EP EP99938912A patent/EP1101175A1/en not_active Withdrawn
- 1999-08-02 CA CA002339543A patent/CA2339543A1/en not_active Abandoned
- 1999-08-03 AU AU53294/99A patent/AU5329499A/en not_active Abandoned
-
2001
- 2001-11-30 US US09/998,940 patent/US6651065B2/en not_active Expired - Lifetime
-
2003
- 2003-02-28 US US10/377,562 patent/US7047246B2/en not_active Expired - Lifetime
-
2006
- 2006-05-09 US US11/430,779 patent/US20060206469A1/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0530993A2 (en) * | 1991-08-16 | 1993-03-10 | Xerox Corporation | An iterative technique for phrase query formation and an information retrieval system employing same |
US5745890A (en) * | 1996-08-09 | 1998-04-28 | Digital Equipment Corporation | Sequential searching of a database index using constraints on word-location pairs |
WO1998007105A1 (en) * | 1996-08-14 | 1998-02-19 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
Non-Patent Citations (2)
Title |
---|
K. RAILSBACK: "Compass Serever 3.0", ZDNET REVIEWS, 13 February 1998 (1998-02-13), XP002126467, Retrieved from the Internet <URL:http://www.zdnet.com/products/stories/reviews/0,4161,280094,00.html> [retrieved on 19991221] * |
KOSTER M: "ALIWEB - Archie-like indexing in the WEB", COMPUTER NETWORKS AND ISDN SYSTEMS,NL,NORTH HOLLAND PUBLISHING. AMSTERDAM, vol. 27, no. 2, pages 175-182, XP004037988, ISSN: 0169-7552 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2807537A1 (en) * | 2000-04-06 | 2001-10-12 | France Telecom | HYPERMEDIA RESOURCE SEARCH ENGINE AND INDEXING METHOD THEREOF |
WO2001077890A1 (en) * | 2000-04-06 | 2001-10-18 | France Telecom | Hypermedia resource search engine and related indexing method |
WO2001082129A1 (en) * | 2000-04-27 | 2001-11-01 | Mondosoft A/S | Local search via global search |
WO2002008940A2 (en) * | 2000-07-20 | 2002-01-31 | Johnson Rodney D | Information archival and retrieval system for internetworked computers |
WO2002008940A3 (en) * | 2000-07-20 | 2003-09-25 | Rodney D Johnson | Information archival and retrieval system for internetworked computers |
US7792682B2 (en) | 2000-07-20 | 2010-09-07 | Johnson Rodney D | Information archival and retrieval system for internetworked computers |
Also Published As
Publication number | Publication date |
---|---|
US20060206469A1 (en) | 2006-09-14 |
US6651065B2 (en) | 2003-11-18 |
US20020107848A1 (en) | 2002-08-08 |
US7047246B2 (en) | 2006-05-16 |
US20030149684A1 (en) | 2003-08-07 |
AU5329499A (en) | 2000-02-28 |
US6336116B1 (en) | 2002-01-01 |
EP1101175A1 (en) | 2001-05-23 |
CA2339543A1 (en) | 2000-02-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6336116B1 (en) | Search and index hosting system | |
US6006217A (en) | Technique for providing enhanced relevance information for documents retrieved in a multi database search | |
US6490579B1 (en) | Search engine system and method utilizing context of heterogeneous information resources | |
US6567800B1 (en) | System and method for searching information stored on a network | |
US7099861B2 (en) | System and method for facilitating internet search by providing web document layout image | |
US6907425B1 (en) | System and method for searching information stored on a network | |
US7062707B1 (en) | System and method of providing multiple items of index information for a single data object | |
EP1211616A2 (en) | Data storage and retrieval system | |
CA2279546A1 (en) | Intelligent network browser using incremental conceptual indexer | |
KR100312591B1 (en) | Method for formulating query in information search | |
JP3501799B2 (en) | Information search support device, computer program, and program storage medium | |
De Bra et al. | Searching for arbitrary information in the WWW: the fish-search for mosaic | |
KR100379635B1 (en) | A system for retrieving world wide web and a method for storing, viewing and using the search result | |
US20030145087A1 (en) | Generating a list of addresses in a server | |
WO1997049048A1 (en) | Hypertext document retrieval system and method | |
US20030163452A1 (en) | Direct navigation for information retrieval | |
US7490082B2 (en) | System and method for searching internet domains | |
US20030145112A1 (en) | Assisted entering of addresses | |
KR19990078876A (en) | Information search method by URL input | |
US20040201631A1 (en) | Generating a list of addresses in an index page | |
CN1122232C (en) | Method for simultaneously implementing several searches of engine retrieval | |
US7496600B2 (en) | System and method for accessing web-based search services | |
US20030145046A1 (en) | Generating a list of addresses on a proxy server | |
US20030145065A1 (en) | Generating a list of document addresses on a local network | |
US6980986B1 (en) | System and method for bookmark set search and execution refinement |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
ENP | Entry into the national phase |
Ref document number: 2339543 Country of ref document: CA Ref country code: CA Ref document number: 2339543 Kind code of ref document: A Format of ref document f/p: F |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1999938912 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1999938912 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1999938912 Country of ref document: EP |