CN100565506C - 基于图像文档的索引和检索 - Google Patents
基于图像文档的索引和检索 Download PDFInfo
- Publication number
- CN100565506C CN100565506C CNB2005100062210A CN200510006221A CN100565506C CN 100565506 C CN100565506 C CN 100565506C CN B2005100062210 A CNB2005100062210 A CN B2005100062210A CN 200510006221 A CN200510006221 A CN 200510006221A CN 100565506 C CN100565506 C CN 100565506C
- Authority
- CN
- China
- Prior art keywords
- image
- signature
- document
- word
- catching
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/5846—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
Abstract
Description
Claims (39)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/758,370 | 2004-01-15 | ||
US10/758,370 US7475061B2 (en) | 2004-01-15 | 2004-01-15 | Image-based document indexing and retrieval |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1641646A CN1641646A (zh) | 2005-07-20 |
CN100565506C true CN100565506C (zh) | 2009-12-02 |
Family
ID=34620698
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2005100062210A Active CN100565506C (zh) | 2004-01-15 | 2005-01-17 | 基于图像文档的索引和检索 |
Country Status (5)
Country | Link |
---|---|
US (1) | US7475061B2 (zh) |
EP (1) | EP1555626A3 (zh) |
JP (1) | JP4718841B2 (zh) |
KR (1) | KR101027851B1 (zh) |
CN (1) | CN100565506C (zh) |
Families Citing this family (161)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7082427B1 (en) * | 2000-05-24 | 2006-07-25 | Reachforce, Inc. | Text indexing system to index, query the archive database document by keyword data representing the content of the documents and by contact data associated with the participant who generated the document |
US7330850B1 (en) | 2000-10-04 | 2008-02-12 | Reachforce, Inc. | Text mining system for web-based business intelligence applied to web site server logs |
US8694510B2 (en) * | 2003-09-04 | 2014-04-08 | Oracle International Corporation | Indexing XML documents efficiently |
US8229932B2 (en) * | 2003-09-04 | 2012-07-24 | Oracle International Corporation | Storing XML documents efficiently in an RDBMS |
US7475061B2 (en) | 2004-01-15 | 2009-01-06 | Microsoft Corporation | Image-based document indexing and retrieval |
JP4380400B2 (ja) * | 2004-04-16 | 2009-12-09 | キヤノン株式会社 | 文書処理装置及びその制御方法、並びにコンピュータプログラム |
EP1747529A1 (en) * | 2004-05-18 | 2007-01-31 | Silverbrook Research Pty. Ltd | Method and apparatus for security document tracking |
US7729538B2 (en) * | 2004-08-26 | 2010-06-01 | Microsoft Corporation | Spatial recognition and grouping of text and graphics |
US7574048B2 (en) * | 2004-09-03 | 2009-08-11 | Microsoft Corporation | Freeform digital ink annotation recognition |
US7523098B2 (en) * | 2004-09-15 | 2009-04-21 | International Business Machines Corporation | Systems and methods for efficient data searching, storage and reduction |
US8725705B2 (en) * | 2004-09-15 | 2014-05-13 | International Business Machines Corporation | Systems and methods for searching of storage data with reduced bandwidth requirements |
US7970171B2 (en) * | 2007-01-18 | 2011-06-28 | Ricoh Co., Ltd. | Synthetic image and video generation from ground truth data |
US9530050B1 (en) * | 2007-07-11 | 2016-12-27 | Ricoh Co., Ltd. | Document annotation sharing |
US7551780B2 (en) | 2005-08-23 | 2009-06-23 | Ricoh Co., Ltd. | System and method for using individualized mixed document |
US8510283B2 (en) * | 2006-07-31 | 2013-08-13 | Ricoh Co., Ltd. | Automatic adaption of an image recognition system to image capture devices |
US8825682B2 (en) | 2006-07-31 | 2014-09-02 | Ricoh Co., Ltd. | Architecture for mixed media reality retrieval of locations and registration of images |
US7639387B2 (en) * | 2005-08-23 | 2009-12-29 | Ricoh Co., Ltd. | Authoring tools using a mixed media environment |
US8838591B2 (en) * | 2005-08-23 | 2014-09-16 | Ricoh Co., Ltd. | Embedding hot spots in electronic documents |
US9405751B2 (en) * | 2005-08-23 | 2016-08-02 | Ricoh Co., Ltd. | Database for mixed media document system |
US8521737B2 (en) * | 2004-10-01 | 2013-08-27 | Ricoh Co., Ltd. | Method and system for multi-tier image matching in a mixed media environment |
US8335789B2 (en) * | 2004-10-01 | 2012-12-18 | Ricoh Co., Ltd. | Method and system for document fingerprint matching in a mixed media environment |
US8176054B2 (en) * | 2007-07-12 | 2012-05-08 | Ricoh Co. Ltd | Retrieving electronic documents by converting them to synthetic text |
US8086038B2 (en) * | 2007-07-11 | 2011-12-27 | Ricoh Co., Ltd. | Invisible junction features for patch recognition |
US8600989B2 (en) | 2004-10-01 | 2013-12-03 | Ricoh Co., Ltd. | Method and system for image matching in a mixed media environment |
US7917554B2 (en) * | 2005-08-23 | 2011-03-29 | Ricoh Co. Ltd. | Visibly-perceptible hot spots in documents |
US8369655B2 (en) * | 2006-07-31 | 2013-02-05 | Ricoh Co., Ltd. | Mixed media reality recognition using multiple specialized indexes |
US7991778B2 (en) * | 2005-08-23 | 2011-08-02 | Ricoh Co., Ltd. | Triggering actions with captured input in a mixed media environment |
US7669148B2 (en) * | 2005-08-23 | 2010-02-23 | Ricoh Co., Ltd. | System and methods for portable device for mixed media system |
US8144921B2 (en) | 2007-07-11 | 2012-03-27 | Ricoh Co., Ltd. | Information retrieval using invisible junctions and geometric constraints |
US8949287B2 (en) * | 2005-08-23 | 2015-02-03 | Ricoh Co., Ltd. | Embedding hot spots in imaged documents |
US8856108B2 (en) * | 2006-07-31 | 2014-10-07 | Ricoh Co., Ltd. | Combining results of image retrieval processes |
US8276088B2 (en) | 2007-07-11 | 2012-09-25 | Ricoh Co., Ltd. | User interface for three-dimensional navigation |
US8195659B2 (en) * | 2005-08-23 | 2012-06-05 | Ricoh Co. Ltd. | Integration and use of mixed media documents |
US7920759B2 (en) * | 2005-08-23 | 2011-04-05 | Ricoh Co. Ltd. | Triggering applications for distributed action execution and use of mixed media recognition as a control input |
US8332401B2 (en) * | 2004-10-01 | 2012-12-11 | Ricoh Co., Ltd | Method and system for position-based image matching in a mixed media environment |
US8156115B1 (en) | 2007-07-11 | 2012-04-10 | Ricoh Co. Ltd. | Document-based networking with mixed media reality |
US7672543B2 (en) * | 2005-08-23 | 2010-03-02 | Ricoh Co., Ltd. | Triggering applications based on a captured text in a mixed media environment |
US8385589B2 (en) * | 2008-05-15 | 2013-02-26 | Berna Erol | Web-based content detection in images, extraction and recognition |
US8156116B2 (en) | 2006-07-31 | 2012-04-10 | Ricoh Co., Ltd | Dynamic presentation of targeted information in a mixed media reality recognition system |
US8489583B2 (en) * | 2004-10-01 | 2013-07-16 | Ricoh Company, Ltd. | Techniques for retrieving documents using an image capture device |
US9384619B2 (en) * | 2006-07-31 | 2016-07-05 | Ricoh Co., Ltd. | Searching media content for objects specified using identifiers |
US8184155B2 (en) * | 2007-07-11 | 2012-05-22 | Ricoh Co. Ltd. | Recognition and tracking using invisible junctions |
US7885955B2 (en) * | 2005-08-23 | 2011-02-08 | Ricoh Co. Ltd. | Shared document annotation |
US8156427B2 (en) | 2005-08-23 | 2012-04-10 | Ricoh Co. Ltd. | User interface for mixed media reality |
US8868555B2 (en) | 2006-07-31 | 2014-10-21 | Ricoh Co., Ltd. | Computation of a recongnizability score (quality predictor) for image retrieval |
US7587412B2 (en) * | 2005-08-23 | 2009-09-08 | Ricoh Company, Ltd. | Mixed media reality brokerage network and methods of use |
US9171202B2 (en) * | 2005-08-23 | 2015-10-27 | Ricoh Co., Ltd. | Data organization and access for mixed media document system |
US7812986B2 (en) | 2005-08-23 | 2010-10-12 | Ricoh Co. Ltd. | System and methods for use of voice mail and email in a mixed media environment |
US9373029B2 (en) | 2007-07-11 | 2016-06-21 | Ricoh Co., Ltd. | Invisible junction feature recognition for document security or annotation |
US8005831B2 (en) * | 2005-08-23 | 2011-08-23 | Ricoh Co., Ltd. | System and methods for creation and use of a mixed media environment with geographic location information |
US7702673B2 (en) * | 2004-10-01 | 2010-04-20 | Ricoh Co., Ltd. | System and methods for creation and use of a mixed media environment |
JP4455358B2 (ja) * | 2005-01-31 | 2010-04-21 | キヤノン株式会社 | 画像処理装置およびその方法 |
US10176338B2 (en) * | 2005-11-23 | 2019-01-08 | Salesforce.Com | Secure distributed storage of documents containing restricted information, via the use of keysets |
US8782087B2 (en) | 2005-03-18 | 2014-07-15 | Beyondcore, Inc. | Analyzing large data sets to find deviation patterns |
US10127130B2 (en) | 2005-03-18 | 2018-11-13 | Salesforce.Com | Identifying contributors that explain differences between a data set and a subset of the data set |
US7546524B1 (en) | 2005-03-30 | 2009-06-09 | Amazon Technologies, Inc. | Electronic input device, system, and method using human-comprehensible content to automatically correlate an annotation of a paper document with a digital version of the document |
JP4688542B2 (ja) * | 2005-03-31 | 2011-05-25 | 株式会社日立製作所 | 計算機システム、ホストコンピュータ及びコピーペア処理方法 |
US7570816B2 (en) * | 2005-03-31 | 2009-08-04 | Microsoft Corporation | Systems and methods for detecting text |
US20060242568A1 (en) * | 2005-04-26 | 2006-10-26 | Xerox Corporation | Document image signature identification systems and methods |
US20060282430A1 (en) * | 2005-06-10 | 2006-12-14 | Diamond David L | Fuzzy matching of text at an expected location |
US7526129B2 (en) * | 2005-06-23 | 2009-04-28 | Microsoft Corporation | Lifting ink annotations from paper |
US8762410B2 (en) * | 2005-07-18 | 2014-06-24 | Oracle International Corporation | Document level indexes for efficient processing in multiple tiers of a computer system |
US20070030523A1 (en) * | 2005-08-02 | 2007-02-08 | Kabushiki Kaisha Toshiba | System and method for identifying a submitter of a printed or scanned document |
JP4533273B2 (ja) * | 2005-08-09 | 2010-09-01 | キヤノン株式会社 | 画像処理装置及び画像処理方法、プログラム |
US10733308B2 (en) * | 2005-08-17 | 2020-08-04 | Cambium Learning, Inc. | Tags for unlocking digital content |
US10296854B2 (en) * | 2005-08-17 | 2019-05-21 | Cambium Learning, Inc. | Techniques for protected viewing of digital files |
US7861307B2 (en) * | 2005-08-17 | 2010-12-28 | Kurzweil Educational Systems, Inc. | Unlocking digital content on remote systems |
US9009078B2 (en) * | 2005-08-17 | 2015-04-14 | Kurzweil/Intellitools, Inc. | Optical character recognition technique for protected viewing of digital files |
KR100979457B1 (ko) * | 2005-08-23 | 2010-09-02 | 가부시키가이샤 리코 | 혼합 미디어 환경에서의 이미지 정합 방법 및 시스템 |
WO2007023994A1 (en) * | 2005-08-23 | 2007-03-01 | Ricoh Company, Ltd. | System and methods for creation and use of a mixed media environment |
EP2482210A3 (en) * | 2005-08-23 | 2013-10-16 | Ricoh Company, Ltd. | System and methods for creation and use of a mixed media environment |
JP4897795B2 (ja) * | 2005-08-23 | 2012-03-14 | 株式会社リコー | 処理装置、インデックステーブル作成方法及びコンピュータプログラム |
WO2007023991A1 (en) * | 2005-08-23 | 2007-03-01 | Ricoh Company, Ltd. | Embedding hot spots in electronic documents |
CN101297319B (zh) * | 2005-08-23 | 2013-02-27 | 株式会社理光 | 在电子文档中嵌入热点 |
US7769772B2 (en) * | 2005-08-23 | 2010-08-03 | Ricoh Co., Ltd. | Mixed media reality brokerage network with layout-independent recognition |
US20070061319A1 (en) * | 2005-09-09 | 2007-03-15 | Xerox Corporation | Method for document clustering based on page layout attributes |
US7475072B1 (en) | 2005-09-26 | 2009-01-06 | Quintura, Inc. | Context-based search visualization and context management using neural networks |
US7620607B1 (en) * | 2005-09-26 | 2009-11-17 | Quintura Inc. | System and method for using a bidirectional neural network to identify sentences for use as document annotations |
JP2007102545A (ja) * | 2005-10-05 | 2007-04-19 | Ricoh Co Ltd | 電子文書作成装置、電子文書作成方法及び電子文書作成プログラム |
US8095876B1 (en) | 2005-11-18 | 2012-01-10 | Google Inc. | Identifying a primary version of a document |
US8949455B2 (en) * | 2005-11-21 | 2015-02-03 | Oracle International Corporation | Path-caching mechanism to improve performance of path-related operations in a repository |
JP4742839B2 (ja) * | 2005-12-09 | 2011-08-10 | 富士ゼロックス株式会社 | ワークフロー処理のためのプログラム及びシステム |
KR100767114B1 (ko) * | 2005-12-16 | 2007-10-17 | 삼성전자주식회사 | 인쇄할 문서와 관련문서를 함께 인쇄하는 방법 및 그에사용되는 호스트와 프린터 |
US20070226321A1 (en) * | 2006-03-23 | 2007-09-27 | R R Donnelley & Sons Company | Image based document access and related systems, methods, and devices |
US10152712B2 (en) * | 2006-05-10 | 2018-12-11 | Paypal, Inc. | Inspecting event indicators |
WO2007136870A2 (en) * | 2006-05-19 | 2007-11-29 | Sciencemedia Inc. | Document annotation |
JP2008009572A (ja) * | 2006-06-27 | 2008-01-17 | Fuji Xerox Co Ltd | ドキュメント処理システム、ドキュメント処理方法及びプログラム |
US20080033967A1 (en) * | 2006-07-18 | 2008-02-07 | Ravi Murthy | Semantic aware processing of XML documents |
US8489987B2 (en) | 2006-07-31 | 2013-07-16 | Ricoh Co., Ltd. | Monitoring and analyzing creation and usage of visual content using image and hotspot interaction |
US9020966B2 (en) * | 2006-07-31 | 2015-04-28 | Ricoh Co., Ltd. | Client device for interacting with a mixed media reality recognition system |
US9063952B2 (en) * | 2006-07-31 | 2015-06-23 | Ricoh Co., Ltd. | Mixed media reality recognition with image tracking |
US8073263B2 (en) * | 2006-07-31 | 2011-12-06 | Ricoh Co., Ltd. | Multi-classifier selection and monitoring for MMR-based image recognition |
US9176984B2 (en) * | 2006-07-31 | 2015-11-03 | Ricoh Co., Ltd | Mixed media reality retrieval of differentially-weighted links |
US8676810B2 (en) * | 2006-07-31 | 2014-03-18 | Ricoh Co., Ltd. | Multiple index mixed media reality recognition using unequal priority indexes |
US8201076B2 (en) | 2006-07-31 | 2012-06-12 | Ricoh Co., Ltd. | Capturing symbolic information from documents upon printing |
US20080046738A1 (en) * | 2006-08-04 | 2008-02-21 | Yahoo! Inc. | Anti-phishing agent |
KR100834293B1 (ko) * | 2006-11-06 | 2008-05-30 | 엔에이치엔(주) | 문서 처리 시스템 및 방법 |
JP4310356B2 (ja) * | 2006-11-13 | 2009-08-05 | シャープ株式会社 | 画像処理方法、画像処理装置、画像読取装置、画像形成装置、コンピュータプログラム及び記録媒体 |
JP4352274B2 (ja) * | 2006-11-16 | 2009-10-28 | コニカミノルタビジネステクノロジーズ株式会社 | 画像形成装置及び印刷方法並びに制御プログラム |
US8290311B1 (en) | 2007-01-11 | 2012-10-16 | Proofpoint, Inc. | Apparatus and method for detecting images within spam |
US8290203B1 (en) * | 2007-01-11 | 2012-10-16 | Proofpoint, Inc. | Apparatus and method for detecting images within spam |
US20090232032A1 (en) * | 2007-01-17 | 2009-09-17 | Verbal World, Inc. | Methods and Apparatus for the Manipulation of Conferenced Data |
US7437370B1 (en) * | 2007-02-19 | 2008-10-14 | Quintura, Inc. | Search engine graphical interface using maps and images |
CN101276363B (zh) * | 2007-03-30 | 2011-02-16 | 夏普株式会社 | 文档图像的检索装置及文档图像的检索方法 |
US8254692B2 (en) * | 2007-07-23 | 2012-08-28 | Hewlett-Packard Development Company, L.P. | Document comparison method and apparatus |
US20090031203A1 (en) * | 2007-07-26 | 2009-01-29 | Hewlett-Packard Development Company, L.P. | Hyperlinks |
JP4960796B2 (ja) * | 2007-08-03 | 2012-06-27 | キヤノン株式会社 | 画像処理装置、画像処理方法ならびにそのプログラム及び記憶媒体 |
US8180754B1 (en) | 2008-04-01 | 2012-05-15 | Dranias Development Llc | Semantic neural network for aggregating query searches |
US8166042B1 (en) * | 2008-04-14 | 2012-04-24 | Google Inc. | Height based indexing |
US8724930B2 (en) * | 2008-05-30 | 2014-05-13 | Abbyy Development Llc | Copying system and method |
US8538941B2 (en) * | 2008-07-31 | 2013-09-17 | Adobe Systems Incorporated | Visual information search tool |
US8867779B2 (en) | 2008-08-28 | 2014-10-21 | Microsoft Corporation | Image tagging user interface |
US8396246B2 (en) | 2008-08-28 | 2013-03-12 | Microsoft Corporation | Tagging images with labels |
US8249343B2 (en) * | 2008-10-15 | 2012-08-21 | Xerox Corporation | Representing documents with runlength histograms |
TW201027375A (en) * | 2008-10-20 | 2010-07-16 | Ibm | Search system, search method and program |
JP2010134700A (ja) * | 2008-12-04 | 2010-06-17 | Toshiba Corp | 画像評価装置および画像評価方法 |
US8385660B2 (en) * | 2009-06-24 | 2013-02-26 | Ricoh Co., Ltd. | Mixed media reality indexing and retrieval for repeated content |
US7953679B2 (en) | 2009-07-22 | 2011-05-31 | Xerox Corporation | Scalable indexing for layout based document retrieval and ranking |
US9367523B2 (en) | 2009-09-25 | 2016-06-14 | Adobe Systems Incorporated | System and method for using design features to search for page layout designs |
US8606789B2 (en) | 2010-07-02 | 2013-12-10 | Xerox Corporation | Method for layout based document zone querying |
US9262390B2 (en) * | 2010-09-02 | 2016-02-16 | Lexis Nexis, A Division Of Reed Elsevier Inc. | Methods and systems for annotating electronic documents |
US8559765B2 (en) * | 2011-01-05 | 2013-10-15 | International Business Machines Corporation | System and method for image storage and analysis |
US20120188248A1 (en) * | 2011-01-26 | 2012-07-26 | The Boeing Company | Image Management and Presentation |
US8458796B2 (en) * | 2011-03-08 | 2013-06-04 | Hewlett-Packard Development Company, L.P. | Methods and systems for full pattern matching in hardware |
US9058331B2 (en) | 2011-07-27 | 2015-06-16 | Ricoh Co., Ltd. | Generating a conversation in a social network based on visual search results |
JP5742545B2 (ja) * | 2011-07-27 | 2015-07-01 | ブラザー工業株式会社 | 画像処理プログラム、情報処理装置および画像処理方法 |
CN102955784B (zh) * | 2011-08-19 | 2018-03-06 | 北京百度网讯科技有限公司 | 一种基于数字签名对多个图像进行相似判断的设备和方法 |
US8831350B2 (en) * | 2011-08-29 | 2014-09-09 | Dst Technologies, Inc. | Generation of document fingerprints for identification of electronic document types |
US11055334B2 (en) * | 2011-09-23 | 2021-07-06 | Avaya Inc. | System and method for aligning messages to an event based on semantic similarity |
US9317544B2 (en) * | 2011-10-05 | 2016-04-19 | Microsoft Corporation | Integrated fuzzy joins in database management systems |
US8996350B1 (en) | 2011-11-02 | 2015-03-31 | Dub Software Group, Inc. | System and method for automatic document management |
US10796232B2 (en) | 2011-12-04 | 2020-10-06 | Salesforce.Com, Inc. | Explaining differences between predicted outcomes and actual outcomes of a process |
US10802687B2 (en) | 2011-12-04 | 2020-10-13 | Salesforce.Com, Inc. | Displaying differences between different data sets of a process |
US8687886B2 (en) | 2011-12-29 | 2014-04-01 | Konica Minolta Laboratory U.S.A., Inc. | Method and apparatus for document image indexing and retrieval using multi-level document image structure and local features |
US9111140B2 (en) | 2012-01-10 | 2015-08-18 | Dst Technologies, Inc. | Identification and separation of form and feature elements from handwritten and other user supplied elements |
US8942515B1 (en) * | 2012-10-26 | 2015-01-27 | Lida Huang | Method and apparatus for image retrieval |
US9906608B2 (en) * | 2013-04-30 | 2018-02-27 | International Business Machines Corporation | Intelligent adaptation of mobile applications based on constraints and contexts |
JP6242087B2 (ja) * | 2013-06-07 | 2017-12-06 | キヤノン株式会社 | 文書管理サーバ、文書管理方法、コンピュータプログラム |
CN104376317B (zh) * | 2013-08-12 | 2018-12-14 | 福建福昕软件开发股份有限公司北京分公司 | 一种将纸质文件转换为电子文件的方法 |
US20150163545A1 (en) * | 2013-12-11 | 2015-06-11 | Echostar Technologies L.L.C. | Identification of video content segments based on signature analysis of the video content |
WO2015175824A1 (en) * | 2014-05-16 | 2015-11-19 | AppCard, Inc. | Method and system for improved optical character recognition |
KR101713197B1 (ko) * | 2015-04-01 | 2017-03-09 | 주식회사 씨케이앤비 | 서버 컴퓨팅 장치 및 이를 이용한 콘텐츠 인식 기반의 영상 검색 시스템 |
US9411547B1 (en) * | 2015-07-28 | 2016-08-09 | Dst Technologies, Inc. | Compensation for print shift in standardized forms to facilitate extraction of data therefrom |
US10095920B2 (en) * | 2016-07-28 | 2018-10-09 | Intuit Inc | Optical character recognition utilizing hashed templates |
US11416680B2 (en) * | 2016-08-18 | 2022-08-16 | Sap Se | Classifying social media inputs via parts-of-speech filtering |
JP6906946B2 (ja) * | 2016-12-22 | 2021-07-21 | キヤノン株式会社 | 情報処理装置、その制御方法、及びプログラム |
GB201708767D0 (en) * | 2017-06-01 | 2017-07-19 | Microsoft Technology Licensing Llc | Managing electronic documents |
US11106867B2 (en) | 2017-08-15 | 2021-08-31 | Oracle International Corporation | Techniques for document marker tracking |
US10599761B2 (en) * | 2017-09-07 | 2020-03-24 | Qualtrics, Llc | Digitally converting physical document forms to electronic surveys |
CN109740007B (zh) * | 2018-08-27 | 2022-03-11 | 广州麦仑信息科技有限公司 | 一种基于图像特征签名的静脉图像快速检索方法 |
US11755659B2 (en) * | 2018-10-04 | 2023-09-12 | Resonac Corporation | Document search device, document search program, and document search method |
CN109167977B (zh) * | 2018-10-28 | 2020-10-23 | 广州中元软件有限公司 | 一种监控视频仿生长期保存方法 |
CN109933691B (zh) * | 2019-02-11 | 2023-06-09 | 北京百度网讯科技有限公司 | 用于内容检索的方法、装置、设备和存储介质 |
CN109960737B (zh) * | 2019-03-15 | 2020-12-08 | 西安电子科技大学 | 半监督深度对抗自编码哈希学习的遥感影像内容检索方法 |
CN109960738B (zh) * | 2019-03-15 | 2020-12-08 | 西安电子科技大学 | 基于深度对抗哈希学习的大规模遥感影像内容检索方法 |
US11449545B2 (en) | 2019-05-13 | 2022-09-20 | Snap Inc. | Deduplication of media file search results |
US20210319136A1 (en) * | 2020-04-02 | 2021-10-14 | UST Global (Singapore) Pte. Ltd. | Verifying authenticity of content of electronic documents |
CN116075820A (zh) * | 2020-05-29 | 2023-05-05 | 卡米洛英国竞标有限公司 | 用于搜索图像数据库的方法、非暂时性计算机可读存储介质和设备 |
US11734445B2 (en) * | 2020-12-02 | 2023-08-22 | International Business Machines Corporation | Document access control based on document component layouts |
JP2022170799A (ja) * | 2021-04-30 | 2022-11-11 | コニカミノルタ株式会社 | 文書検索システム、文書検索方法および文書検索プログラム |
US11783605B1 (en) * | 2022-06-30 | 2023-10-10 | Intuit, Inc. | Generalizable key-value set extraction from documents using machine learning models |
Family Cites Families (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US1095393A (en) * | 1912-12-07 | 1914-05-05 | Harry C Gerlach | Safety switch-chain. |
US1165070A (en) * | 1915-04-12 | 1915-12-21 | Fred O Lake | Knife-sharpener. |
US1171064A (en) * | 1915-08-11 | 1916-02-08 | Lyon Metallic Mfg Company | Shelving. |
JPS5035379B1 (zh) | 1970-05-25 | 1975-11-15 | ||
US4955066A (en) | 1989-10-13 | 1990-09-04 | Microsoft Corporation | Compressing and decompressing text files |
US5109433A (en) | 1989-10-13 | 1992-04-28 | Microsoft Corporation | Compressing and decompressing text files |
US5181255A (en) * | 1990-12-13 | 1993-01-19 | Xerox Corporation | Segmentation of handwriting and machine printed text |
US5526444A (en) | 1991-12-10 | 1996-06-11 | Xerox Corporation | Document image decoding using modified branch-and-bound methods |
US5499294A (en) * | 1993-11-24 | 1996-03-12 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | Digital camera with apparatus for authentication of images produced from an image file |
US6869023B2 (en) * | 2002-02-12 | 2005-03-22 | Digimarc Corporation | Linking documents through digital watermarking |
US5465353A (en) | 1994-04-01 | 1995-11-07 | Ricoh Company, Ltd. | Image matching and retrieval by multi-access redundant hashing |
US5542006A (en) | 1994-06-21 | 1996-07-30 | Eastman Kodak Company | Neural network based character position detector for use in optical character recognition |
US5594809A (en) | 1995-04-28 | 1997-01-14 | Xerox Corporation | Automatic training of character templates using a text line image, a text line transcription and a line image source model |
US5812698A (en) | 1995-05-12 | 1998-09-22 | Synaptics, Inc. | Handwriting recognition system and method |
US5867597A (en) | 1995-09-05 | 1999-02-02 | Ricoh Corporation | High-speed retrieval by example |
US6658623B1 (en) | 1997-09-15 | 2003-12-02 | Fuji Xerox Co., Ltd. | Displaying in a first document a selectable link to a second document based on a passive query |
US6587217B1 (en) | 1997-09-15 | 2003-07-01 | International Business Machines Corporation | Method for organizing files in a library in a network printing system |
JPH1178176A (ja) * | 1997-09-17 | 1999-03-23 | Seiko Epson Corp | 印刷物発行管理システム、印刷物発行管理方法及びプリンタ |
US6009198A (en) * | 1997-11-21 | 1999-12-28 | Xerox Corporation | Method for matching perceptual shape similarity layouts across multiple 2D objects |
US7062497B2 (en) | 1998-01-22 | 2006-06-13 | Adobe Systems Incorporated | Maintaining document state history |
US6487301B1 (en) * | 1998-04-30 | 2002-11-26 | Mediasec Technologies Llc | Digital authentication with digital and analog documents |
JPH11328417A (ja) * | 1998-05-20 | 1999-11-30 | Toshiba Corp | 画像処理装置、画像処理方法及び画像処理プログラムを記録したコンピュータ読み取り可能な記録媒体 |
US6523134B2 (en) | 1998-09-18 | 2003-02-18 | International Business Machines Corporation | Selective undo |
US6363381B1 (en) | 1998-11-03 | 2002-03-26 | Ricoh Co., Ltd. | Compressed document matching |
US6580806B1 (en) * | 1998-11-20 | 2003-06-17 | Canon Kabushiki Kaisha | Image processing apparatus, image processing method and storage |
US6397212B1 (en) | 1999-03-04 | 2002-05-28 | Peter Biffar | Self-learning and self-personalizing knowledge search engine that delivers holistic results |
US6546385B1 (en) | 1999-08-13 | 2003-04-08 | International Business Machines Corporation | Method and apparatus for indexing and searching content in hardcopy documents |
US6470094B1 (en) * | 2000-03-14 | 2002-10-22 | Intel Corporation | Generalized text localization in images |
US6594393B1 (en) | 2000-05-12 | 2003-07-15 | Thomas P. Minka | Dynamic programming operation with skip mode for text line image decoding |
US7058223B2 (en) | 2000-09-14 | 2006-06-06 | Cox Ingemar J | Identifying works for initiating a work-based action, such as an action on the internet |
US6928548B1 (en) * | 2000-09-29 | 2005-08-09 | Intel Corporation | System and method for verifying the integrity of stored information within an electronic device |
US7266765B2 (en) | 2001-08-31 | 2007-09-04 | Fuji Xerox Co., Ltd. | Detection and processing of annotated anchors |
US7747943B2 (en) | 2001-09-07 | 2010-06-29 | Microsoft Corporation | Robust anchoring of annotations to content |
GB2380277B (en) | 2001-09-28 | 2005-12-14 | Hewlett Packard Co | A solid state memory device and a method of document reproduction |
US7120299B2 (en) | 2001-12-28 | 2006-10-10 | Intel Corporation | Recognizing commands written onto a medium |
WO2003063067A1 (en) * | 2002-01-24 | 2003-07-31 | Chatterbox Systems, Inc. | Method and system for locating positions in printed texts and delivering multimedia information |
CA2375355A1 (en) | 2002-03-11 | 2003-09-11 | Neo Systems Inc. | Character recognition system and method |
US7243301B2 (en) | 2002-04-10 | 2007-07-10 | Microsoft Corporation | Common annotation framework |
JP2003337683A (ja) * | 2002-05-17 | 2003-11-28 | Fuji Xerox Co Ltd | 印刷物発行管理システム、印刷物検証装置、コンテンツ管理装置 |
JP2004040246A (ja) * | 2002-06-28 | 2004-02-05 | Canon Inc | 情報処理装置、情報処理方法 |
US7360093B2 (en) * | 2002-07-22 | 2008-04-15 | Xerox Corporation | System and method for authentication of JPEG image data |
US20040090439A1 (en) | 2002-11-07 | 2004-05-13 | Holger Dillner | Recognition and interpretation of graphical and diagrammatic representations |
JP2004180278A (ja) * | 2002-11-15 | 2004-06-24 | Canon Inc | 情報処理装置、サーバ装置、電子データ管理システム、情報処理システム、情報処理方法、コンピュータプログラム及びコンピュータ読み取り可能な記憶媒体 |
US7486294B2 (en) | 2003-03-27 | 2009-02-03 | Microsoft Corporation | Vector graphics element-based model, application programming interface, and markup language |
US7218783B2 (en) | 2003-06-13 | 2007-05-15 | Microsoft Corporation | Digital ink annotation process and system for recognizing, anchoring and reflowing digital ink annotations |
US7475061B2 (en) | 2004-01-15 | 2009-01-06 | Microsoft Corporation | Image-based document indexing and retrieval |
US7729538B2 (en) | 2004-08-26 | 2010-06-01 | Microsoft Corporation | Spatial recognition and grouping of text and graphics |
US7574048B2 (en) | 2004-09-03 | 2009-08-11 | Microsoft Corporation | Freeform digital ink annotation recognition |
US7570816B2 (en) * | 2005-03-31 | 2009-08-04 | Microsoft Corporation | Systems and methods for detecting text |
US7526129B2 (en) * | 2005-06-23 | 2009-04-28 | Microsoft Corporation | Lifting ink annotations from paper |
-
2004
- 2004-01-15 US US10/758,370 patent/US7475061B2/en active Active
-
2005
- 2005-01-14 KR KR1020050003672A patent/KR101027851B1/ko not_active IP Right Cessation
- 2005-01-14 EP EP05000750A patent/EP1555626A3/en not_active Ceased
- 2005-01-17 JP JP2005009686A patent/JP4718841B2/ja not_active Expired - Fee Related
- 2005-01-17 CN CNB2005100062210A patent/CN100565506C/zh active Active
Non-Patent Citations (4)
Title |
---|
Block Selection:A Method for Segmenting Page ImageofVarious Editing Styles. Shin-Ywan Wang,Toshiaki Yagasaki.IEEE. 1995 |
Block Selection:A Method for Segmenting Page ImageofVarious Editing Styles. Shin-Ywan Wang,Toshiaki Yagasaki.IEEE. 1995 * |
Document Image Matching Techniques. Jonathan J.Hull,John Cullen,Mark Peairs.Symposium on Document Image Understanding Technology. 1997 |
Document Image Matching Techniques. Jonathan J.Hull,John Cullen,Mark Peairs.Symposium on Document Image Understanding Technology. 1997 * |
Also Published As
Publication number | Publication date |
---|---|
EP1555626A3 (en) | 2006-02-15 |
JP4718841B2 (ja) | 2011-07-06 |
CN1641646A (zh) | 2005-07-20 |
KR101027851B1 (ko) | 2011-04-07 |
US20050165747A1 (en) | 2005-07-28 |
KR20050075301A (ko) | 2005-07-20 |
EP1555626A2 (en) | 2005-07-20 |
JP2005251169A (ja) | 2005-09-15 |
US7475061B2 (en) | 2009-01-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100565506C (zh) | 基于图像文档的索引和检索 | |
US8538184B2 (en) | Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category | |
Nagy | Twenty years of document image analysis in PAMI | |
US8897563B1 (en) | Systems and methods for automatically processing electronic documents | |
US7630962B2 (en) | Electronic filing system searchable by a handwritten search query | |
US7593961B2 (en) | Information processing apparatus for retrieving image data similar to an entered image | |
JP4533273B2 (ja) | 画像処理装置及び画像処理方法、プログラム | |
US6178417B1 (en) | Method and means of matching documents based on text genre | |
CN100474880C (zh) | 图像处理装置及其方法 | |
EP1917627B1 (en) | Classifying regions defined within a digital image | |
JP2011018316A (ja) | 文書区分識別用の区分モデルを生成するための方法及びプログラム、文書の区分を識別するための方法及びプログラム、及び画像処理システム | |
EP2106599A2 (en) | Feature matching method | |
CN104346415A (zh) | 图像文档命名的方法 | |
Jain et al. | Practicing vision: Integration, evaluation and applications | |
US20150169510A1 (en) | Method and system of extracting structured data from a document | |
CN112464907A (zh) | 一种文档处理系统及方法 | |
Nagy | Document analysis systems that improve with use | |
Grana et al. | Picture extraction from digitized historical manuscripts | |
JPH1063813A (ja) | イメージ文書管理方法及びその装置 | |
Sarkar et al. | Perceptual organization in semantic role labeling | |
Philip et al. | Development of an image retrieval model for biomedical image databases | |
Downton et al. | User-configurable OCR enhancement for online natural history archives | |
Syeda-Mahmood | Locating indexing structures in engineering drawing databases using location hashing | |
Suda et al. | How can document analysis help in capturing five million pages? | |
Bruce et al. | The DocBrowse system for information retrieval from document image data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150429 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150429 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160801 Address after: Grand Cayman, Georgetown, Cayman Islands Patentee after: IValley Holding Co., Ltd. Address before: Washington State Patentee before: Micro soft technique license Co., Ltd |