WO1997034242A1 - Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching - Google Patents
Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching Download PDFInfo
- Publication number
- WO1997034242A1 WO1997034242A1 PCT/US1997/003185 US9703185W WO9734242A1 WO 1997034242 A1 WO1997034242 A1 WO 1997034242A1 US 9703185 W US9703185 W US 9703185W WO 9734242 A1 WO9734242 A1 WO 9734242A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- records
- query
- terms
- thesaurus
- collections
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 239000013589 supplement Substances 0.000 claims 1
- 230000008569 process Effects 0.000 abstract description 17
- 238000011109 contamination Methods 0.000 abstract description 3
- 238000012545 processing Methods 0.000 description 9
- 238000000605 extraction Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 238000010187 selection method Methods 0.000 description 3
- 230000003068 static effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/374—Thesaurus
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/912—Applications of a database
- Y10S707/917—Text
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99937—Sorting
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002248793A CA2248793C (en) | 1996-03-15 | 1997-03-07 | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching |
AU20609/97A AU2060997A (en) | 1996-03-15 | 1997-03-07 | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching |
EP97908789A EP0901660A4 (en) | 1996-03-15 | 1997-03-07 | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/616,883 US5926811A (en) | 1996-03-15 | 1996-03-15 | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching |
US08/616,883 | 1996-03-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1997034242A1 true WO1997034242A1 (en) | 1997-09-18 |
Family
ID=24471373
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1997/003185 WO1997034242A1 (en) | 1996-03-15 | 1997-03-07 | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching |
Country Status (5)
Country | Link |
---|---|
US (1) | US5926811A (en) |
EP (1) | EP0901660A4 (en) |
AU (1) | AU2060997A (en) |
CA (1) | CA2248793C (en) |
WO (1) | WO1997034242A1 (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999017224A1 (en) * | 1997-09-29 | 1999-04-08 | Fujun Bi | A multi-element confidence matching system and the method therefor |
EP0952535A1 (en) * | 1998-04-22 | 1999-10-27 | Het Babbage Instituut voor Kennis en Informatie Technologie "B.I.K.I.T." | Method and system for retrieving documents via an electronic data file |
EP1081613A1 (en) * | 1999-08-13 | 2001-03-07 | Mindpass A/S | A method and an apparatus for generically and transparently expanding and contracting a query |
AU761169B2 (en) * | 1999-05-10 | 2003-05-29 | Jollify Management Limited | A search engine with two-dimensional linearly scalable parallel architecture |
GB2391648A (en) * | 2002-08-07 | 2004-02-11 | Sharp Kk | Method of and Apparatus for Retrieving an Illustration of Text |
EP1411448A2 (en) * | 2002-10-17 | 2004-04-21 | Matsushita Electric Industrial Co., Ltd. | Data searching apparatus |
KR100434902B1 (en) * | 2000-08-28 | 2004-06-07 | 주식회사 에이전트엑스퍼트 | Knowledge base custom made information offer system and service method thereof |
FR2960669A1 (en) * | 2010-05-28 | 2011-12-02 | Mobeo | Information searching method for e.g. Internet, involves transmitting derived request to sites to have search information, receiving response provided by receiving site to derived request, and presenting response from site |
EP2793146A1 (en) * | 2013-04-16 | 2014-10-22 | Wal-Mart Stores, Inc. | Relevance-based cutoff for search results |
Families Citing this family (141)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5724571A (en) * | 1995-07-07 | 1998-03-03 | Sun Microsystems, Inc. | Method and apparatus for generating query responses in a computer-based document retrieval system |
US6377965B1 (en) * | 1997-11-07 | 2002-04-23 | Microsoft Corporation | Automatic word completion system for partially entered data |
US6128634A (en) * | 1998-01-06 | 2000-10-03 | Fuji Xerox Co., Ltd. | Method and apparatus for facilitating skimming of text |
GB2336696B (en) * | 1998-04-24 | 2002-12-18 | Dialog Corp Plc The | Analysing data files |
US6178416B1 (en) * | 1998-06-15 | 2001-01-23 | James U. Parker | Method and apparatus for knowledgebase searching |
JP3760057B2 (en) * | 1998-11-19 | 2006-03-29 | 株式会社日立製作所 | Document search method and document search service for multiple document databases |
US6523028B1 (en) * | 1998-12-03 | 2003-02-18 | Lockhead Martin Corporation | Method and system for universal querying of distributed databases |
US7003719B1 (en) * | 1999-01-25 | 2006-02-21 | West Publishing Company, Dba West Group | System, method, and software for inserting hyperlinks into documents |
AU4328000A (en) * | 1999-03-31 | 2000-10-16 | Verizon Laboratories Inc. | Techniques for performing a data query in a computer system |
US8275661B1 (en) | 1999-03-31 | 2012-09-25 | Verizon Corporate Services Group Inc. | Targeted banner advertisements |
US8572069B2 (en) * | 1999-03-31 | 2013-10-29 | Apple Inc. | Semi-automatic index term augmentation in document retrieval |
US7024416B1 (en) | 1999-03-31 | 2006-04-04 | Verizon Laboratories Inc. | Semi-automatic index term augmentation in document retrieval |
US20020032564A1 (en) * | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US6269361B1 (en) * | 1999-05-28 | 2001-07-31 | Goto.Com | System and method for influencing a position on a search result list generated by a computer network search engine |
JP3791877B2 (en) * | 1999-06-15 | 2006-06-28 | 富士通株式会社 | An apparatus for searching information using the reason for referring to a document |
US7089236B1 (en) * | 1999-06-24 | 2006-08-08 | Search 123.Com, Inc. | Search engine interface |
US6424969B1 (en) * | 1999-07-20 | 2002-07-23 | Inmentia, Inc. | System and method for organizing data |
US6718363B1 (en) * | 1999-07-30 | 2004-04-06 | Verizon Laboratories, Inc. | Page aggregation for web sites |
US6477524B1 (en) * | 1999-08-18 | 2002-11-05 | Sharp Laboratories Of America, Incorporated | Method for statistical text analysis |
AU7701700A (en) * | 1999-09-13 | 2001-04-17 | David J Weitz | Competitive information management system |
US6556992B1 (en) * | 1999-09-14 | 2003-04-29 | Patent Ratings, Llc | Method and system for rating patents and other intangible assets |
US20090259506A1 (en) * | 1999-09-14 | 2009-10-15 | Barney Jonathan A | Method and system for rating patents and other intangible assets |
US7310629B1 (en) | 1999-12-15 | 2007-12-18 | Napster, Inc. | Method and apparatus for controlling file sharing of multimedia files over a fluid, de-centralized network |
US6742023B1 (en) | 2000-04-28 | 2004-05-25 | Roxio, Inc. | Use-sensitive distribution of data files between users |
US6366907B1 (en) | 1999-12-15 | 2002-04-02 | Napster, Inc. | Real-time search engine |
US6704727B1 (en) * | 2000-01-31 | 2004-03-09 | Overture Services, Inc. | Method and system for generating a set of search terms |
US8612245B2 (en) * | 2000-02-24 | 2013-12-17 | Webmd Llc | Personalized health history system with accommodation for consumer health terminology |
US6912525B1 (en) * | 2000-05-08 | 2005-06-28 | Verizon Laboratories, Inc. | Techniques for web site integration |
US7062483B2 (en) * | 2000-05-18 | 2006-06-13 | Endeca Technologies, Inc. | Hierarchical data-driven search and navigation system and method for information retrieval |
US7617184B2 (en) | 2000-05-18 | 2009-11-10 | Endeca Technologies, Inc. | Scalable hierarchical data-driven navigation system and method for information retrieval |
US7035864B1 (en) * | 2000-05-18 | 2006-04-25 | Endeca Technologies, Inc. | Hierarchical data-driven navigation system and method for information retrieval |
US7325201B2 (en) * | 2000-05-18 | 2008-01-29 | Endeca Technologies, Inc. | System and method for manipulating content in a hierarchical data-driven search and navigation system |
DE10031351A1 (en) * | 2000-06-28 | 2002-01-17 | Guru Netservices Gmbh | Automatic research procedure |
US7089301B1 (en) | 2000-08-11 | 2006-08-08 | Napster, Inc. | System and method for searching peer-to-peer computer networks by selecting a computer based on at least a number of files shared by the computer |
US7392238B1 (en) * | 2000-08-23 | 2008-06-24 | Intel Corporation | Method and apparatus for concept-based searching across a network |
US6665661B1 (en) * | 2000-09-29 | 2003-12-16 | Battelle Memorial Institute | System and method for use in text analysis of documents and records |
US20020091879A1 (en) * | 2000-12-21 | 2002-07-11 | James Beriker | System, method and apparatus for dynamic traffic management on a network |
CN1191540C (en) * | 2000-12-29 | 2005-03-02 | 国际商业机器公司 | Lossy index compression |
US6823333B2 (en) * | 2001-03-02 | 2004-11-23 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for conducting a keyterm search |
US6697793B2 (en) * | 2001-03-02 | 2004-02-24 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for generating phrases from a database |
US6741981B2 (en) * | 2001-03-02 | 2004-05-25 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration (Nasa) | System, method and apparatus for conducting a phrase search |
US8484177B2 (en) * | 2001-03-21 | 2013-07-09 | Eugene M. Lee | Apparatus for and method of searching and organizing intellectual property information utilizing a field-of-search |
US6944619B2 (en) * | 2001-04-12 | 2005-09-13 | Primentia, Inc. | System and method for organizing data |
US7536413B1 (en) | 2001-05-07 | 2009-05-19 | Ixreveal, Inc. | Concept-based categorization of unstructured objects |
US7627588B1 (en) | 2001-05-07 | 2009-12-01 | Ixreveal, Inc. | System and method for concept based analysis of unstructured data |
USRE46973E1 (en) | 2001-05-07 | 2018-07-31 | Ureveal, Inc. | Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information |
US7194483B1 (en) | 2001-05-07 | 2007-03-20 | Intelligenxia, Inc. | Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information |
JP4489994B2 (en) * | 2001-05-11 | 2010-06-23 | 富士通株式会社 | Topic extraction apparatus, method, program, and recording medium for recording the program |
SG103289A1 (en) * | 2001-05-25 | 2004-04-29 | Meng Soon Cheo | System for indexing textual and non-textual files |
JP4025517B2 (en) * | 2001-05-31 | 2007-12-19 | 株式会社日立製作所 | Document search system and server |
US7028024B1 (en) * | 2001-07-20 | 2006-04-11 | Vignette Corporation | Information retrieval from a collection of information objects tagged with hierarchical keywords |
US20030023643A1 (en) * | 2001-07-27 | 2003-01-30 | International Business Machines Corporation | Method and apparatus for providing context-sensitive code ahead input |
EP1288794A1 (en) * | 2001-08-29 | 2003-03-05 | Tarchon BV | Methods of ordering and of retrieving information from a corpus of documents and database system for the same |
EP1300773A1 (en) * | 2001-10-02 | 2003-04-09 | Sun Microsystems, Inc. | Information service using a thesaurus |
KR100501079B1 (en) * | 2001-11-12 | 2005-07-18 | 주식회사 아이니드 | Application system for network-based search service using resemblant words and method thereof |
US7356527B2 (en) * | 2001-12-19 | 2008-04-08 | International Business Machines Corporation | Lossy index compression |
US20030120630A1 (en) * | 2001-12-20 | 2003-06-26 | Daniel Tunkelang | Method and system for similarity search and clustering |
US7333966B2 (en) | 2001-12-21 | 2008-02-19 | Thomson Global Resources | Systems, methods, and software for hyperlinking names |
US8589413B1 (en) | 2002-03-01 | 2013-11-19 | Ixreveal, Inc. | Concept-based method and system for dynamically analyzing results from search engines |
US20040205660A1 (en) * | 2002-04-23 | 2004-10-14 | Joe Acton | System and method for generating and displaying attribute-enhanced documents |
US20030216930A1 (en) * | 2002-05-16 | 2003-11-20 | Dunham Carl A. | Cost-per-action search engine system, method and apparatus |
US20040064447A1 (en) * | 2002-09-27 | 2004-04-01 | Simske Steven J. | System and method for management of synonymic searching |
JP2004126840A (en) * | 2002-10-01 | 2004-04-22 | Hitachi Ltd | Document retrieval method, program, and system |
US8868543B1 (en) * | 2002-11-20 | 2014-10-21 | Google Inc. | Finding web pages relevant to multimedia streams |
US20040117366A1 (en) * | 2002-12-12 | 2004-06-17 | Ferrari Adam J. | Method and system for interpreting multiple-term queries |
US8375008B1 (en) | 2003-01-17 | 2013-02-12 | Robert Gomes | Method and system for enterprise-wide retention of digital or electronic data |
US8943024B1 (en) | 2003-01-17 | 2015-01-27 | Daniel John Gardner | System and method for data de-duplication |
US8065277B1 (en) | 2003-01-17 | 2011-11-22 | Daniel John Gardner | System and method for a data extraction and backup database |
US8630984B1 (en) | 2003-01-17 | 2014-01-14 | Renew Data Corp. | System and method for data extraction from email files |
US20040158561A1 (en) * | 2003-02-04 | 2004-08-12 | Gruenwald Bjorn J. | System and method for translating languages using an intermediate content space |
US20040162824A1 (en) * | 2003-02-13 | 2004-08-19 | Burns Roland John | Method and apparatus for classifying a document with respect to reference corpus |
US6947930B2 (en) * | 2003-03-21 | 2005-09-20 | Overture Services, Inc. | Systems and methods for interactive search query refinement |
US7395256B2 (en) * | 2003-06-20 | 2008-07-01 | Agency For Science, Technology And Research | Method and platform for term extraction from large collection of documents |
US20050060290A1 (en) * | 2003-09-15 | 2005-03-17 | International Business Machines Corporation | Automatic query routing and rank configuration for search queries in an information retrieval system |
JP4995072B2 (en) * | 2003-12-31 | 2012-08-08 | トムソン ルーターズ グローバル リソーシーズ | Systems, methods, software, and interfaces for integrating cases with litigation summary, litigation documents, and / or other litigation evidence documents |
US20050160080A1 (en) * | 2004-01-16 | 2005-07-21 | The Regents Of The University Of California | System and method of context-specific searching in an electronic database |
US20050160082A1 (en) * | 2004-01-16 | 2005-07-21 | The Regents Of The University Of California | System and method of context-specific searching in an electronic database |
US8375048B1 (en) * | 2004-01-20 | 2013-02-12 | Microsoft Corporation | Query augmentation |
US20050203924A1 (en) * | 2004-03-13 | 2005-09-15 | Rosenberg Gerald B. | System and methods for analytic research and literate reporting of authoritative document collections |
US7428528B1 (en) | 2004-03-31 | 2008-09-23 | Endeca Technologies, Inc. | Integrated application for manipulating content in a hierarchical data-driven search and navigation system |
US8069151B1 (en) | 2004-12-08 | 2011-11-29 | Chris Crafford | System and method for detecting incongruous or incorrect media in a data recovery process |
EP1851616A2 (en) * | 2005-01-31 | 2007-11-07 | Musgrove Technology Enterprises, LLC | System and method for generating an interlinked taxonomy structure |
WO2006086179A2 (en) * | 2005-01-31 | 2006-08-17 | Textdigger, Inc. | Method and system for semantic search and retrieval of electronic documents |
US8296162B1 (en) | 2005-02-01 | 2012-10-23 | Webmd Llc. | Systems, devices, and methods for providing healthcare information |
US8527468B1 (en) | 2005-02-08 | 2013-09-03 | Renew Data Corp. | System and method for management of retention periods for content in a computing system |
US7937396B1 (en) | 2005-03-23 | 2011-05-03 | Google Inc. | Methods and systems for identifying paraphrases from an index of information items and associated sentence fragments |
WO2006110684A2 (en) * | 2005-04-11 | 2006-10-19 | Textdigger, Inc. | System and method for searching for a query |
US20060229999A1 (en) * | 2005-04-11 | 2006-10-12 | Herbert Dodell | Decision support system for litigation evaluation |
US20060242130A1 (en) * | 2005-04-23 | 2006-10-26 | Clenova, Llc | Information retrieval using conjunctive search and link discovery |
EP1889181A4 (en) * | 2005-05-16 | 2009-12-02 | Ebay Inc | Method and system to process a data search request |
WO2006128183A2 (en) | 2005-05-27 | 2006-11-30 | Schwegman, Lundberg, Woessner & Kluth, P.A. | Method and apparatus for cross-referencing important ip relationships |
US7949581B2 (en) * | 2005-09-07 | 2011-05-24 | Patentratings, Llc | Method of determining an obsolescence rate of a technology |
US7716226B2 (en) | 2005-09-27 | 2010-05-11 | Patentratings, Llc | Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects |
US7937265B1 (en) | 2005-09-27 | 2011-05-03 | Google Inc. | Paraphrase acquisition |
CN1940915B (en) * | 2005-09-29 | 2010-05-05 | 国际商业机器公司 | Corpus expansion system and method |
EP1952280B8 (en) * | 2005-10-11 | 2016-11-30 | Ureveal, Inc. | System, method&computer program product for concept based searching&analysis |
US8019752B2 (en) | 2005-11-10 | 2011-09-13 | Endeca Technologies, Inc. | System and method for information retrieval from object collections with complex interrelationships |
US7814102B2 (en) * | 2005-12-07 | 2010-10-12 | Lexisnexis, A Division Of Reed Elsevier Inc. | Method and system for linking documents with multiple topics to related documents |
WO2007081681A2 (en) | 2006-01-03 | 2007-07-19 | Textdigger, Inc. | Search system with query refinement and search method |
US20070168344A1 (en) * | 2006-01-19 | 2007-07-19 | Brinson Robert M Jr | Data product search using related concepts |
US20080021887A1 (en) * | 2006-01-19 | 2008-01-24 | Intelliscience Corporation | Data product search using related concepts |
US20070175674A1 (en) * | 2006-01-19 | 2007-08-02 | Intelliscience Corporation | Systems and methods for ranking terms found in a data product |
US7676485B2 (en) * | 2006-01-20 | 2010-03-09 | Ixreveal, Inc. | Method and computer program product for converting ontologies into concept semantic networks |
US8195683B2 (en) * | 2006-02-28 | 2012-06-05 | Ebay Inc. | Expansion of database search queries |
WO2007114932A2 (en) | 2006-04-04 | 2007-10-11 | Textdigger, Inc. | Search system and method with text function tagging |
US7735010B2 (en) * | 2006-04-05 | 2010-06-08 | Lexisnexis, A Division Of Reed Elsevier Inc. | Citation network viewer and method |
CA2652409A1 (en) * | 2006-05-19 | 2007-11-29 | Jorn Lyseggen | Source search engine |
US7558725B2 (en) * | 2006-05-23 | 2009-07-07 | Lexisnexis, A Division Of Reed Elsevier Inc. | Method and apparatus for multilingual spelling corrections |
US8150827B2 (en) * | 2006-06-07 | 2012-04-03 | Renew Data Corp. | Methods for enhancing efficiency and cost effectiveness of first pass review of documents |
US8676802B2 (en) * | 2006-11-30 | 2014-03-18 | Oracle Otc Subsidiary Llc | Method and system for information retrieval with clustering |
CA2571172C (en) * | 2006-12-14 | 2012-02-14 | University Of Regina | Interactive web information retrieval using graphical word indicators |
US8380530B2 (en) | 2007-02-02 | 2013-02-19 | Webmd Llc. | Personalized health records with associative relationships |
US7925644B2 (en) * | 2007-03-01 | 2011-04-12 | Microsoft Corporation | Efficient retrieval algorithm by query term discrimination |
US9002869B2 (en) * | 2007-06-22 | 2015-04-07 | Google Inc. | Machine translation for query expansion |
JP2009026083A (en) * | 2007-07-19 | 2009-02-05 | Fujifilm Corp | Content retrieval device |
EP3104288A1 (en) | 2007-10-15 | 2016-12-14 | Lexisnexis Group | System and method for searching for documents |
WO2009059297A1 (en) * | 2007-11-01 | 2009-05-07 | Textdigger, Inc. | Method and apparatus for automated tag generation for digital content |
US8725756B1 (en) | 2007-11-12 | 2014-05-13 | Google Inc. | Session-based query suggestions |
US7856434B2 (en) * | 2007-11-12 | 2010-12-21 | Endeca Technologies, Inc. | System and method for filtering rules for manipulating search results in a hierarchical search and navigation system |
WO2009068072A1 (en) * | 2007-11-30 | 2009-06-04 | Kinkadee Systems Gmbh | Scalable associative text mining network and method |
US10733223B2 (en) * | 2008-01-08 | 2020-08-04 | International Business Machines Corporation | Term-driven records file plan and thesaurus design |
US8615490B1 (en) | 2008-01-31 | 2013-12-24 | Renew Data Corp. | Method and system for restoring information from backup storage media |
US7930322B2 (en) * | 2008-05-27 | 2011-04-19 | Microsoft Corporation | Text based schema discovery and information extraction |
US20100131513A1 (en) | 2008-10-23 | 2010-05-27 | Lundberg Steven W | Patent mapping |
US8463806B2 (en) | 2009-01-30 | 2013-06-11 | Lexisnexis | Methods and systems for creating and using an adaptive thesaurus |
US9245243B2 (en) | 2009-04-14 | 2016-01-26 | Ureveal, Inc. | Concept-based analysis of structured and unstructured data using concept inheritance |
EP2275953B1 (en) * | 2009-06-30 | 2018-10-24 | LG Electronics Inc. | Mobile terminal |
WO2011072172A1 (en) * | 2009-12-09 | 2011-06-16 | Renew Data Corp. | System and method for quickly determining a subset of irrelevant data from large data content |
US8738668B2 (en) | 2009-12-16 | 2014-05-27 | Renew Data Corp. | System and method for creating a de-duplicated data set |
US8938466B2 (en) * | 2010-01-15 | 2015-01-20 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and methods for ranking documents |
US9582575B2 (en) | 2010-07-09 | 2017-02-28 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and methods for linking items to a matter |
US9721006B2 (en) | 2011-03-21 | 2017-08-01 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and methods for enabling searches of a document corpus and generation of search queries |
US9904726B2 (en) | 2011-05-04 | 2018-02-27 | Black Hills IP Holdings, LLC. | Apparatus and method for automated and assisted patent claim mapping and expense planning |
US10242066B2 (en) | 2011-10-03 | 2019-03-26 | Black Hills Ip Holdings, Llc | Systems, methods and user interfaces in a patent management system |
US9201969B2 (en) | 2013-01-31 | 2015-12-01 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and methods for identifying documents based on citation history |
US9251292B2 (en) | 2013-03-11 | 2016-02-02 | Wal-Mart Stores, Inc. | Search result ranking using query clustering |
US9244952B2 (en) | 2013-03-17 | 2016-01-26 | Alation, Inc. | Editable and searchable markup pages automatically populated through user query monitoring |
US10474702B1 (en) | 2014-08-18 | 2019-11-12 | Street Diligence, Inc. | Computer-implemented apparatus and method for providing information concerning a financial instrument |
US11144994B1 (en) | 2014-08-18 | 2021-10-12 | Street Diligence, Inc. | Computer-implemented apparatus and method for providing information concerning a financial instrument |
CN105988704B (en) * | 2015-03-03 | 2020-10-02 | 上海触乐信息科技有限公司 | Efficient touch screen text input system and method |
US10140273B2 (en) | 2016-01-19 | 2018-11-27 | International Business Machines Corporation | List manipulation in natural language processing |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4870568A (en) * | 1986-06-25 | 1989-09-26 | Thinking Machines Corporation | Method for searching a database system including parallel processors |
US4876643A (en) * | 1987-06-24 | 1989-10-24 | Kabushiki Kaisha Toshiba | Parallel searching system having a master processor for controlling plural slave processors for independently processing respective search requests |
US5136289A (en) * | 1990-08-06 | 1992-08-04 | Fujitsu Limited | Dictionary searching system |
US5297039A (en) * | 1991-01-30 | 1994-03-22 | Mitsubishi Denki Kabushiki Kaisha | Text search system for locating on the basis of keyword matching and keyword relationship matching |
US5469355A (en) * | 1992-11-24 | 1995-11-21 | Fujitsu Limited | Near-synonym generating method |
US5615378A (en) * | 1993-07-19 | 1997-03-25 | Fujitsu Limited | Dictionary retrieval device |
US5619709A (en) * | 1993-09-20 | 1997-04-08 | Hnc, Inc. | System and method of context vector generation and retrieval |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5481742A (en) * | 1990-05-04 | 1996-01-02 | Reed Elsevier Inc. | Printer control apparatus for remotely modifying local printer by configuration signals from remote host to produce customized printing control codes |
US5410475A (en) * | 1993-04-19 | 1995-04-25 | Mead Data Central, Inc. | Short case name generating method and apparatus |
US5675819A (en) * | 1994-06-16 | 1997-10-07 | Xerox Corporation | Document information retrieval using global word co-occurrence patterns |
US5721902A (en) * | 1995-09-15 | 1998-02-24 | Infonautics Corporation | Restricted expansion of query terms using part of speech tagging |
US5717914A (en) * | 1995-09-15 | 1998-02-10 | Infonautics Corporation | Method for categorizing documents into subjects using relevance normalization for documents retrieved from an information retrieval system in response to a query |
-
1996
- 1996-03-15 US US08/616,883 patent/US5926811A/en not_active Expired - Lifetime
-
1997
- 1997-03-07 EP EP97908789A patent/EP0901660A4/en not_active Withdrawn
- 1997-03-07 WO PCT/US1997/003185 patent/WO1997034242A1/en active Application Filing
- 1997-03-07 AU AU20609/97A patent/AU2060997A/en not_active Abandoned
- 1997-03-07 CA CA002248793A patent/CA2248793C/en not_active Expired - Lifetime
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4870568A (en) * | 1986-06-25 | 1989-09-26 | Thinking Machines Corporation | Method for searching a database system including parallel processors |
US4876643A (en) * | 1987-06-24 | 1989-10-24 | Kabushiki Kaisha Toshiba | Parallel searching system having a master processor for controlling plural slave processors for independently processing respective search requests |
US5136289A (en) * | 1990-08-06 | 1992-08-04 | Fujitsu Limited | Dictionary searching system |
US5297039A (en) * | 1991-01-30 | 1994-03-22 | Mitsubishi Denki Kabushiki Kaisha | Text search system for locating on the basis of keyword matching and keyword relationship matching |
US5469355A (en) * | 1992-11-24 | 1995-11-21 | Fujitsu Limited | Near-synonym generating method |
US5615378A (en) * | 1993-07-19 | 1997-03-25 | Fujitsu Limited | Dictionary retrieval device |
US5619709A (en) * | 1993-09-20 | 1997-04-08 | Hnc, Inc. | System and method of context vector generation and retrieval |
Non-Patent Citations (3)
Title |
---|
CROUCH C J, YANG B: "EXPERIMENTS IN AUTOMATIC STATISTICAL THESAURUS CONSTRUCTION", SIGIR FORUM., ACM, NEW YORK, NY., US, 1 June 1992 (1992-06-01), US, pages 77 - 88, XP002940409, ISSN: 0163-5840 * |
CROUCH C J: "AN APPROACH TO THE AUTOMATIC CONSTRUCTION OF GLOBAL THESAURI", INFORMATION PROCESSING & MANAGEMENT., ELSEVIER, BARKING., GB, vol. 26, no. 05, 1 January 1990 (1990-01-01), GB, pages 629 - 640, XP002945186, ISSN: 0306-4573, DOI: 10.1016/0306-4573(90)90106-C * |
MINKER J, WILSON G A, ZIMMERMAN B H: "AN EVALUATION OF QUERY EXPANSION BY THE ADDITION OF CLUSTERED TERMSFOR A DOCUMENT RETRIEVAL SYSTEM", INFORMATION STORAGE AND RETRIEVAL., PERGAMON PRESS. OXFORD., GB, vol. 08, 1 January 1972 (1972-01-01), GB, pages 329 - 348, XP002945185, DOI: 10.1016/0020-0271(72)90021-6 * |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999017224A1 (en) * | 1997-09-29 | 1999-04-08 | Fujun Bi | A multi-element confidence matching system and the method therefor |
EP0952535A1 (en) * | 1998-04-22 | 1999-10-27 | Het Babbage Instituut voor Kennis en Informatie Technologie "B.I.K.I.T." | Method and system for retrieving documents via an electronic data file |
BE1012981A3 (en) * | 1998-04-22 | 2001-07-03 | Het Babbage Inst Voor Kennis E | Method and system for the weather find of documents from an electronic database. |
US6807545B1 (en) | 1998-04-22 | 2004-10-19 | Het Babbage Instituut voor Kennis en Informatie Technologie “B.I.K.I.T.” | Method and system for retrieving documents via an electronic data file |
AU761169B2 (en) * | 1999-05-10 | 2003-05-29 | Jollify Management Limited | A search engine with two-dimensional linearly scalable parallel architecture |
EP1081613A1 (en) * | 1999-08-13 | 2001-03-07 | Mindpass A/S | A method and an apparatus for generically and transparently expanding and contracting a query |
KR100434902B1 (en) * | 2000-08-28 | 2004-06-07 | 주식회사 에이전트엑스퍼트 | Knowledge base custom made information offer system and service method thereof |
GB2391648A (en) * | 2002-08-07 | 2004-02-11 | Sharp Kk | Method of and Apparatus for Retrieving an Illustration of Text |
EP1411448A2 (en) * | 2002-10-17 | 2004-04-21 | Matsushita Electric Industrial Co., Ltd. | Data searching apparatus |
EP1411448A3 (en) * | 2002-10-17 | 2007-12-05 | Matsushita Electric Industrial Co., Ltd. | Data searching apparatus |
FR2960669A1 (en) * | 2010-05-28 | 2011-12-02 | Mobeo | Information searching method for e.g. Internet, involves transmitting derived request to sites to have search information, receiving response provided by receiving site to derived request, and presenting response from site |
EP2793146A1 (en) * | 2013-04-16 | 2014-10-22 | Wal-Mart Stores, Inc. | Relevance-based cutoff for search results |
Also Published As
Publication number | Publication date |
---|---|
US5926811A (en) | 1999-07-20 |
CA2248793C (en) | 2002-04-30 |
AU2060997A (en) | 1997-10-01 |
EP0901660A4 (en) | 2001-07-04 |
EP0901660A1 (en) | 1999-03-17 |
CA2248793A1 (en) | 1997-09-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2248793C (en) | Statistical thesaurus, method of forming same, and use thereof in query expansion in automated text searching | |
US7185001B1 (en) | Systems and methods for document searching and organizing | |
US5893092A (en) | Relevancy ranking using statistical ranking, semantics, relevancy feedback and small pieces of text | |
US6236987B1 (en) | Dynamic content organization in information retrieval systems | |
US6826576B2 (en) | Very-large-scale automatic categorizer for web content | |
EP0590858B1 (en) | Method for performing a search of a plurality of documents for similarity to a query | |
US5062074A (en) | Information retrieval system and method | |
Anick et al. | The paraphrase search assistant: terminological feedback for iterative information seeking | |
US6859800B1 (en) | System for fulfilling an information need | |
US4972349A (en) | Information retrieval system and method | |
USRE36727E (en) | Method of indexing and retrieval of electronically-stored documents | |
US5940624A (en) | Text management system | |
US6286000B1 (en) | Light weight document matcher | |
US20020123994A1 (en) | System for fulfilling an information need using extended matching techniques | |
US20020073079A1 (en) | Method and apparatus for searching a database and providing relevance feedback | |
EP1342177A1 (en) | Method for structuring and searching information | |
JP2001117946A (en) | Associated text search and retrieval system | |
US7024405B2 (en) | Method and apparatus for improved internet searching | |
US5893094A (en) | Method and apparatus using run length encoding to evaluate a database | |
JPH11120203A (en) | Method for combining data base and device for retrieving document from data base | |
Lin et al. | ACIRD: intelligent Internet document organization and retrieval | |
Smeaton et al. | User-chosen phrases in interactive query formulation for information retrieval | |
TWI290684B (en) | Incremental thesaurus construction method | |
WO2002037328A2 (en) | Integrating search, classification, scoring and ranking | |
Wan et al. | Experiments with automatic indexing and a relational thesaurus in a Chinese information retrieval system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA JP |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
ENP | Entry into the national phase |
Ref document number: 2248793 Country of ref document: CA Ref country code: CA Ref document number: 2248793 Kind code of ref document: A Format of ref document f/p: F |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1997908789 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: JP Ref document number: 97532637 Format of ref document f/p: F |
|
WWP | Wipo information: published in national office |
Ref document number: 1997908789 Country of ref document: EP |