CN1641646A - 基于图像文档的索引和检索 - Google Patents
基于图像文档的索引和检索 Download PDFInfo
- Publication number
- CN1641646A CN1641646A CNA2005100062210A CN200510006221A CN1641646A CN 1641646 A CN1641646 A CN 1641646A CN A2005100062210 A CNA2005100062210 A CN A2005100062210A CN 200510006221 A CN200510006221 A CN 200510006221A CN 1641646 A CN1641646 A CN 1641646A
- Authority
- CN
- China
- Prior art keywords
- image
- document
- signature
- word
- produced
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/5846—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
Abstract
Description
Claims (42)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/758,370 | 2004-01-15 | ||
US10/758,370 US7475061B2 (en) | 2004-01-15 | 2004-01-15 | Image-based document indexing and retrieval |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1641646A true CN1641646A (zh) | 2005-07-20 |
CN100565506C CN100565506C (zh) | 2009-12-02 |
Family
ID=34620698
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2005100062210A Active CN100565506C (zh) | 2004-01-15 | 2005-01-17 | 基于图像文档的索引和检索 |
Country Status (5)
Country | Link |
---|---|
US (1) | US7475061B2 (zh) |
EP (1) | EP1555626A3 (zh) |
JP (1) | JP4718841B2 (zh) |
KR (1) | KR101027851B1 (zh) |
CN (1) | CN100565506C (zh) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101276363B (zh) * | 2007-03-30 | 2011-02-16 | 夏普株式会社 | 文档图像的检索装置及文档图像的检索方法 |
CN102693253A (zh) * | 2011-01-26 | 2012-09-26 | 波音公司 | 图像管理和呈现 |
CN101292258B (zh) * | 2005-08-23 | 2012-11-21 | 株式会社理光 | 混合介质环境的创建和使用的系统和方法 |
CN102955784A (zh) * | 2011-08-19 | 2013-03-06 | 北京百度网讯科技有限公司 | 一种基于数字签名对多个图像进行相似判断的设备和方法 |
US8867779B2 (en) | 2008-08-28 | 2014-10-21 | Microsoft Corporation | Image tagging user interface |
US9020183B2 (en) | 2008-08-28 | 2015-04-28 | Microsoft Technology Licensing, Llc | Tagging images with labels |
CN109167977A (zh) * | 2018-10-28 | 2019-01-08 | 广州中元软件有限公司 | 一种监控视频仿生长期保存方法 |
CN109740007A (zh) * | 2018-08-27 | 2019-05-10 | 广州麦仑信息科技有限公司 | 一种基于图像特征签名的静脉图像快速检索方法 |
CN109933691A (zh) * | 2019-02-11 | 2019-06-25 | 北京百度网讯科技有限公司 | 用于内容检索的方法、装置、设备和存储介质 |
Families Citing this family (152)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7082427B1 (en) * | 2000-05-24 | 2006-07-25 | Reachforce, Inc. | Text indexing system to index, query the archive database document by keyword data representing the content of the documents and by contact data associated with the participant who generated the document |
US7330850B1 (en) | 2000-10-04 | 2008-02-12 | Reachforce, Inc. | Text mining system for web-based business intelligence applied to web site server logs |
US8694510B2 (en) * | 2003-09-04 | 2014-04-08 | Oracle International Corporation | Indexing XML documents efficiently |
US8229932B2 (en) * | 2003-09-04 | 2012-07-24 | Oracle International Corporation | Storing XML documents efficiently in an RDBMS |
US7475061B2 (en) | 2004-01-15 | 2009-01-06 | Microsoft Corporation | Image-based document indexing and retrieval |
JP4380400B2 (ja) * | 2004-04-16 | 2009-12-09 | キヤノン株式会社 | 文書処理装置及びその制御方法、並びにコンピュータプログラム |
JP2007538320A (ja) * | 2004-05-18 | 2007-12-27 | シルバーブルック リサーチ ピーティワイ リミテッド | 製品アイテムを追跡する方法およびコンピュータシステム |
US7729538B2 (en) * | 2004-08-26 | 2010-06-01 | Microsoft Corporation | Spatial recognition and grouping of text and graphics |
US7574048B2 (en) * | 2004-09-03 | 2009-08-11 | Microsoft Corporation | Freeform digital ink annotation recognition |
US7523098B2 (en) * | 2004-09-15 | 2009-04-21 | International Business Machines Corporation | Systems and methods for efficient data searching, storage and reduction |
US8725705B2 (en) * | 2004-09-15 | 2014-05-13 | International Business Machines Corporation | Systems and methods for searching of storage data with reduced bandwidth requirements |
US9405751B2 (en) * | 2005-08-23 | 2016-08-02 | Ricoh Co., Ltd. | Database for mixed media document system |
US7702673B2 (en) * | 2004-10-01 | 2010-04-20 | Ricoh Co., Ltd. | System and methods for creation and use of a mixed media environment |
US9171202B2 (en) * | 2005-08-23 | 2015-10-27 | Ricoh Co., Ltd. | Data organization and access for mixed media document system |
US8369655B2 (en) * | 2006-07-31 | 2013-02-05 | Ricoh Co., Ltd. | Mixed media reality recognition using multiple specialized indexes |
US8005831B2 (en) * | 2005-08-23 | 2011-08-23 | Ricoh Co., Ltd. | System and methods for creation and use of a mixed media environment with geographic location information |
US8600989B2 (en) | 2004-10-01 | 2013-12-03 | Ricoh Co., Ltd. | Method and system for image matching in a mixed media environment |
US9530050B1 (en) * | 2007-07-11 | 2016-12-27 | Ricoh Co., Ltd. | Document annotation sharing |
US8385589B2 (en) * | 2008-05-15 | 2013-02-26 | Berna Erol | Web-based content detection in images, extraction and recognition |
US8856108B2 (en) * | 2006-07-31 | 2014-10-07 | Ricoh Co., Ltd. | Combining results of image retrieval processes |
US8156116B2 (en) | 2006-07-31 | 2012-04-10 | Ricoh Co., Ltd | Dynamic presentation of targeted information in a mixed media reality recognition system |
US8949287B2 (en) * | 2005-08-23 | 2015-02-03 | Ricoh Co., Ltd. | Embedding hot spots in imaged documents |
US7970171B2 (en) * | 2007-01-18 | 2011-06-28 | Ricoh Co., Ltd. | Synthetic image and video generation from ground truth data |
US8335789B2 (en) * | 2004-10-01 | 2012-12-18 | Ricoh Co., Ltd. | Method and system for document fingerprint matching in a mixed media environment |
US7669148B2 (en) * | 2005-08-23 | 2010-02-23 | Ricoh Co., Ltd. | System and methods for portable device for mixed media system |
US8156427B2 (en) | 2005-08-23 | 2012-04-10 | Ricoh Co. Ltd. | User interface for mixed media reality |
US8521737B2 (en) * | 2004-10-01 | 2013-08-27 | Ricoh Co., Ltd. | Method and system for multi-tier image matching in a mixed media environment |
US8838591B2 (en) * | 2005-08-23 | 2014-09-16 | Ricoh Co., Ltd. | Embedding hot spots in electronic documents |
US9373029B2 (en) | 2007-07-11 | 2016-06-21 | Ricoh Co., Ltd. | Invisible junction feature recognition for document security or annotation |
US9384619B2 (en) * | 2006-07-31 | 2016-07-05 | Ricoh Co., Ltd. | Searching media content for objects specified using identifiers |
US7551780B2 (en) | 2005-08-23 | 2009-06-23 | Ricoh Co., Ltd. | System and method for using individualized mixed document |
US7885955B2 (en) * | 2005-08-23 | 2011-02-08 | Ricoh Co. Ltd. | Shared document annotation |
US7917554B2 (en) * | 2005-08-23 | 2011-03-29 | Ricoh Co. Ltd. | Visibly-perceptible hot spots in documents |
US8195659B2 (en) * | 2005-08-23 | 2012-06-05 | Ricoh Co. Ltd. | Integration and use of mixed media documents |
US7991778B2 (en) * | 2005-08-23 | 2011-08-02 | Ricoh Co., Ltd. | Triggering actions with captured input in a mixed media environment |
US8276088B2 (en) | 2007-07-11 | 2012-09-25 | Ricoh Co., Ltd. | User interface for three-dimensional navigation |
US8868555B2 (en) | 2006-07-31 | 2014-10-21 | Ricoh Co., Ltd. | Computation of a recongnizability score (quality predictor) for image retrieval |
US10192279B1 (en) * | 2007-07-11 | 2019-01-29 | Ricoh Co., Ltd. | Indexed document modification sharing with mixed media reality |
US7587412B2 (en) * | 2005-08-23 | 2009-09-08 | Ricoh Company, Ltd. | Mixed media reality brokerage network and methods of use |
US8144921B2 (en) | 2007-07-11 | 2012-03-27 | Ricoh Co., Ltd. | Information retrieval using invisible junctions and geometric constraints |
US7639387B2 (en) * | 2005-08-23 | 2009-12-29 | Ricoh Co., Ltd. | Authoring tools using a mixed media environment |
US7812986B2 (en) | 2005-08-23 | 2010-10-12 | Ricoh Co. Ltd. | System and methods for use of voice mail and email in a mixed media environment |
US7920759B2 (en) * | 2005-08-23 | 2011-04-05 | Ricoh Co. Ltd. | Triggering applications for distributed action execution and use of mixed media recognition as a control input |
US8184155B2 (en) * | 2007-07-11 | 2012-05-22 | Ricoh Co. Ltd. | Recognition and tracking using invisible junctions |
US8086038B2 (en) * | 2007-07-11 | 2011-12-27 | Ricoh Co., Ltd. | Invisible junction features for patch recognition |
US8176054B2 (en) * | 2007-07-12 | 2012-05-08 | Ricoh Co. Ltd | Retrieving electronic documents by converting them to synthetic text |
US7672543B2 (en) * | 2005-08-23 | 2010-03-02 | Ricoh Co., Ltd. | Triggering applications based on a captured text in a mixed media environment |
US8332401B2 (en) * | 2004-10-01 | 2012-12-11 | Ricoh Co., Ltd | Method and system for position-based image matching in a mixed media environment |
US8510283B2 (en) * | 2006-07-31 | 2013-08-13 | Ricoh Co., Ltd. | Automatic adaption of an image recognition system to image capture devices |
US8825682B2 (en) | 2006-07-31 | 2014-09-02 | Ricoh Co., Ltd. | Architecture for mixed media reality retrieval of locations and registration of images |
US8489583B2 (en) * | 2004-10-01 | 2013-07-16 | Ricoh Company, Ltd. | Techniques for retrieving documents using an image capture device |
JP4455358B2 (ja) * | 2005-01-31 | 2010-04-21 | キヤノン株式会社 | 画像処理装置およびその方法 |
US10127130B2 (en) | 2005-03-18 | 2018-11-13 | Salesforce.Com | Identifying contributors that explain differences between a data set and a subset of the data set |
US10176338B2 (en) * | 2005-11-23 | 2019-01-08 | Salesforce.Com | Secure distributed storage of documents containing restricted information, via the use of keysets |
US8782087B2 (en) | 2005-03-18 | 2014-07-15 | Beyondcore, Inc. | Analyzing large data sets to find deviation patterns |
US7546524B1 (en) * | 2005-03-30 | 2009-06-09 | Amazon Technologies, Inc. | Electronic input device, system, and method using human-comprehensible content to automatically correlate an annotation of a paper document with a digital version of the document |
US7570816B2 (en) * | 2005-03-31 | 2009-08-04 | Microsoft Corporation | Systems and methods for detecting text |
JP4688542B2 (ja) * | 2005-03-31 | 2011-05-25 | 株式会社日立製作所 | 計算機システム、ホストコンピュータ及びコピーペア処理方法 |
US20060242568A1 (en) * | 2005-04-26 | 2006-10-26 | Xerox Corporation | Document image signature identification systems and methods |
US20060282430A1 (en) * | 2005-06-10 | 2006-12-14 | Diamond David L | Fuzzy matching of text at an expected location |
US7526129B2 (en) * | 2005-06-23 | 2009-04-28 | Microsoft Corporation | Lifting ink annotations from paper |
US8762410B2 (en) * | 2005-07-18 | 2014-06-24 | Oracle International Corporation | Document level indexes for efficient processing in multiple tiers of a computer system |
US20070030523A1 (en) * | 2005-08-02 | 2007-02-08 | Kabushiki Kaisha Toshiba | System and method for identifying a submitter of a printed or scanned document |
JP4533273B2 (ja) * | 2005-08-09 | 2010-09-01 | キヤノン株式会社 | 画像処理装置及び画像処理方法、プログラム |
US7861307B2 (en) * | 2005-08-17 | 2010-12-28 | Kurzweil Educational Systems, Inc. | Unlocking digital content on remote systems |
US10296854B2 (en) * | 2005-08-17 | 2019-05-21 | Cambium Learning, Inc. | Techniques for protected viewing of digital files |
US10733308B2 (en) * | 2005-08-17 | 2020-08-04 | Cambium Learning, Inc. | Tags for unlocking digital content |
US9009078B2 (en) * | 2005-08-17 | 2015-04-14 | Kurzweil/Intellitools, Inc. | Optical character recognition technique for protected viewing of digital files |
KR100980748B1 (ko) * | 2005-08-23 | 2010-09-07 | 가부시키가이샤 리코 | 혼합 미디어 환경을 생성 및 사용하는 시스템 및 방법 |
US7769772B2 (en) * | 2005-08-23 | 2010-08-03 | Ricoh Co., Ltd. | Mixed media reality brokerage network with layout-independent recognition |
EP1917635A4 (en) * | 2005-08-23 | 2008-12-03 | Ricoh Kk | INSERTING HOT POINTS IN ELECTRONIC DOCUMENTS |
EP2482210A3 (en) * | 2005-08-23 | 2013-10-16 | Ricoh Company, Ltd. | System and methods for creation and use of a mixed media environment |
EP1917637A4 (en) * | 2005-08-23 | 2008-12-03 | Ricoh Kk | DATA ORGANIZATION AND ACCESS FOR A MIXED MEDIA DOCUMENT SYSTEM |
WO2007023992A1 (en) * | 2005-08-23 | 2007-03-01 | Ricoh Company, Ltd. | Method and system for image matching in a mixed media environment |
US20070061319A1 (en) * | 2005-09-09 | 2007-03-15 | Xerox Corporation | Method for document clustering based on page layout attributes |
US7475072B1 (en) | 2005-09-26 | 2009-01-06 | Quintura, Inc. | Context-based search visualization and context management using neural networks |
US7620607B1 (en) * | 2005-09-26 | 2009-11-17 | Quintura Inc. | System and method for using a bidirectional neural network to identify sentences for use as document annotations |
JP2007102545A (ja) * | 2005-10-05 | 2007-04-19 | Ricoh Co Ltd | 電子文書作成装置、電子文書作成方法及び電子文書作成プログラム |
US8095876B1 (en) | 2005-11-18 | 2012-01-10 | Google Inc. | Identifying a primary version of a document |
US8949455B2 (en) | 2005-11-21 | 2015-02-03 | Oracle International Corporation | Path-caching mechanism to improve performance of path-related operations in a repository |
JP4742839B2 (ja) * | 2005-12-09 | 2011-08-10 | 富士ゼロックス株式会社 | ワークフロー処理のためのプログラム及びシステム |
KR100767114B1 (ko) * | 2005-12-16 | 2007-10-17 | 삼성전자주식회사 | 인쇄할 문서와 관련문서를 함께 인쇄하는 방법 및 그에사용되는 호스트와 프린터 |
US20070226321A1 (en) * | 2006-03-23 | 2007-09-27 | R R Donnelley & Sons Company | Image based document access and related systems, methods, and devices |
US10152712B2 (en) * | 2006-05-10 | 2018-12-11 | Paypal, Inc. | Inspecting event indicators |
CA2652986A1 (en) * | 2006-05-19 | 2007-11-29 | Sciencemedia Inc. | Interactive learning and assessment platform |
JP2008009572A (ja) * | 2006-06-27 | 2008-01-17 | Fuji Xerox Co Ltd | ドキュメント処理システム、ドキュメント処理方法及びプログラム |
US20080033967A1 (en) * | 2006-07-18 | 2008-02-07 | Ravi Murthy | Semantic aware processing of XML documents |
US9176984B2 (en) * | 2006-07-31 | 2015-11-03 | Ricoh Co., Ltd | Mixed media reality retrieval of differentially-weighted links |
US9063952B2 (en) * | 2006-07-31 | 2015-06-23 | Ricoh Co., Ltd. | Mixed media reality recognition with image tracking |
US8489987B2 (en) | 2006-07-31 | 2013-07-16 | Ricoh Co., Ltd. | Monitoring and analyzing creation and usage of visual content using image and hotspot interaction |
US9020966B2 (en) * | 2006-07-31 | 2015-04-28 | Ricoh Co., Ltd. | Client device for interacting with a mixed media reality recognition system |
US8201076B2 (en) | 2006-07-31 | 2012-06-12 | Ricoh Co., Ltd. | Capturing symbolic information from documents upon printing |
US8676810B2 (en) * | 2006-07-31 | 2014-03-18 | Ricoh Co., Ltd. | Multiple index mixed media reality recognition using unequal priority indexes |
US8073263B2 (en) * | 2006-07-31 | 2011-12-06 | Ricoh Co., Ltd. | Multi-classifier selection and monitoring for MMR-based image recognition |
US20080046738A1 (en) * | 2006-08-04 | 2008-02-21 | Yahoo! Inc. | Anti-phishing agent |
KR100834293B1 (ko) * | 2006-11-06 | 2008-05-30 | 엔에이치엔(주) | 문서 처리 시스템 및 방법 |
JP4310356B2 (ja) * | 2006-11-13 | 2009-08-05 | シャープ株式会社 | 画像処理方法、画像処理装置、画像読取装置、画像形成装置、コンピュータプログラム及び記録媒体 |
JP4352274B2 (ja) * | 2006-11-16 | 2009-10-28 | コニカミノルタビジネステクノロジーズ株式会社 | 画像形成装置及び印刷方法並びに制御プログラム |
US8290203B1 (en) * | 2007-01-11 | 2012-10-16 | Proofpoint, Inc. | Apparatus and method for detecting images within spam |
US8290311B1 (en) | 2007-01-11 | 2012-10-16 | Proofpoint, Inc. | Apparatus and method for detecting images within spam |
US20090232032A1 (en) * | 2007-01-17 | 2009-09-17 | Verbal World, Inc. | Methods and Apparatus for the Manipulation of Conferenced Data |
US7437370B1 (en) * | 2007-02-19 | 2008-10-14 | Quintura, Inc. | Search engine graphical interface using maps and images |
US8254692B2 (en) * | 2007-07-23 | 2012-08-28 | Hewlett-Packard Development Company, L.P. | Document comparison method and apparatus |
US20090031203A1 (en) * | 2007-07-26 | 2009-01-29 | Hewlett-Packard Development Company, L.P. | Hyperlinks |
JP4960796B2 (ja) * | 2007-08-03 | 2012-06-27 | キヤノン株式会社 | 画像処理装置、画像処理方法ならびにそのプログラム及び記憶媒体 |
US8180754B1 (en) | 2008-04-01 | 2012-05-15 | Dranias Development Llc | Semantic neural network for aggregating query searches |
US8166042B1 (en) * | 2008-04-14 | 2012-04-24 | Google Inc. | Height based indexing |
US8724930B2 (en) * | 2008-05-30 | 2014-05-13 | Abbyy Development Llc | Copying system and method |
US8538941B2 (en) * | 2008-07-31 | 2013-09-17 | Adobe Systems Incorporated | Visual information search tool |
US8249343B2 (en) * | 2008-10-15 | 2012-08-21 | Xerox Corporation | Representing documents with runlength histograms |
TW201027375A (en) * | 2008-10-20 | 2010-07-16 | Ibm | Search system, search method and program |
JP2010134700A (ja) * | 2008-12-04 | 2010-06-17 | Toshiba Corp | 画像評価装置および画像評価方法 |
US8385660B2 (en) * | 2009-06-24 | 2013-02-26 | Ricoh Co., Ltd. | Mixed media reality indexing and retrieval for repeated content |
US7953679B2 (en) | 2009-07-22 | 2011-05-31 | Xerox Corporation | Scalable indexing for layout based document retrieval and ranking |
US9367523B2 (en) | 2009-09-25 | 2016-06-14 | Adobe Systems Incorporated | System and method for using design features to search for page layout designs |
US8606789B2 (en) | 2010-07-02 | 2013-12-10 | Xerox Corporation | Method for layout based document zone querying |
US9262390B2 (en) | 2010-09-02 | 2016-02-16 | Lexis Nexis, A Division Of Reed Elsevier Inc. | Methods and systems for annotating electronic documents |
US8559765B2 (en) | 2011-01-05 | 2013-10-15 | International Business Machines Corporation | System and method for image storage and analysis |
US8458796B2 (en) | 2011-03-08 | 2013-06-04 | Hewlett-Packard Development Company, L.P. | Methods and systems for full pattern matching in hardware |
US9058331B2 (en) | 2011-07-27 | 2015-06-16 | Ricoh Co., Ltd. | Generating a conversation in a social network based on visual search results |
JP5742545B2 (ja) * | 2011-07-27 | 2015-07-01 | ブラザー工業株式会社 | 画像処理プログラム、情報処理装置および画像処理方法 |
US8831350B2 (en) | 2011-08-29 | 2014-09-09 | Dst Technologies, Inc. | Generation of document fingerprints for identification of electronic document types |
US11055334B2 (en) * | 2011-09-23 | 2021-07-06 | Avaya Inc. | System and method for aligning messages to an event based on semantic similarity |
US9317544B2 (en) * | 2011-10-05 | 2016-04-19 | Microsoft Corporation | Integrated fuzzy joins in database management systems |
US8996350B1 (en) | 2011-11-02 | 2015-03-31 | Dub Software Group, Inc. | System and method for automatic document management |
US10796232B2 (en) | 2011-12-04 | 2020-10-06 | Salesforce.Com, Inc. | Explaining differences between predicted outcomes and actual outcomes of a process |
US10802687B2 (en) | 2011-12-04 | 2020-10-13 | Salesforce.Com, Inc. | Displaying differences between different data sets of a process |
US8687886B2 (en) | 2011-12-29 | 2014-04-01 | Konica Minolta Laboratory U.S.A., Inc. | Method and apparatus for document image indexing and retrieval using multi-level document image structure and local features |
US9111140B2 (en) | 2012-01-10 | 2015-08-18 | Dst Technologies, Inc. | Identification and separation of form and feature elements from handwritten and other user supplied elements |
US8942515B1 (en) * | 2012-10-26 | 2015-01-27 | Lida Huang | Method and apparatus for image retrieval |
US9906608B2 (en) * | 2013-04-30 | 2018-02-27 | International Business Machines Corporation | Intelligent adaptation of mobile applications based on constraints and contexts |
JP6242087B2 (ja) * | 2013-06-07 | 2017-12-06 | キヤノン株式会社 | 文書管理サーバ、文書管理方法、コンピュータプログラム |
CN104376317B (zh) * | 2013-08-12 | 2018-12-14 | 福建福昕软件开发股份有限公司北京分公司 | 一种将纸质文件转换为电子文件的方法 |
US20150163545A1 (en) * | 2013-12-11 | 2015-06-11 | Echostar Technologies L.L.C. | Identification of video content segments based on signature analysis of the video content |
WO2015175824A1 (en) * | 2014-05-16 | 2015-11-19 | AppCard, Inc. | Method and system for improved optical character recognition |
KR101713197B1 (ko) * | 2015-04-01 | 2017-03-09 | 주식회사 씨케이앤비 | 서버 컴퓨팅 장치 및 이를 이용한 콘텐츠 인식 기반의 영상 검색 시스템 |
US9411547B1 (en) * | 2015-07-28 | 2016-08-09 | Dst Technologies, Inc. | Compensation for print shift in standardized forms to facilitate extraction of data therefrom |
US10095920B2 (en) * | 2016-07-28 | 2018-10-09 | Intuit Inc | Optical character recognition utilizing hashed templates |
US11416680B2 (en) * | 2016-08-18 | 2022-08-16 | Sap Se | Classifying social media inputs via parts-of-speech filtering |
JP6906946B2 (ja) * | 2016-12-22 | 2021-07-21 | キヤノン株式会社 | 情報処理装置、その制御方法、及びプログラム |
GB201708767D0 (en) * | 2017-06-01 | 2017-07-19 | Microsoft Technology Licensing Llc | Managing electronic documents |
US11106867B2 (en) | 2017-08-15 | 2021-08-31 | Oracle International Corporation | Techniques for document marker tracking |
US10599761B2 (en) * | 2017-09-07 | 2020-03-24 | Qualtrics, Llc | Digitally converting physical document forms to electronic surveys |
CN112868001A (zh) * | 2018-10-04 | 2021-05-28 | 昭和电工株式会社 | 文档检索装置、文档检索程序、文档检索方法 |
CN109960738B (zh) * | 2019-03-15 | 2020-12-08 | 西安电子科技大学 | 基于深度对抗哈希学习的大规模遥感影像内容检索方法 |
CN109960737B (zh) * | 2019-03-15 | 2020-12-08 | 西安电子科技大学 | 半监督深度对抗自编码哈希学习的遥感影像内容检索方法 |
US11449545B2 (en) | 2019-05-13 | 2022-09-20 | Snap Inc. | Deduplication of media file search results |
US20210319136A1 (en) * | 2020-04-02 | 2021-10-14 | UST Global (Singapore) Pte. Ltd. | Verifying authenticity of content of electronic documents |
US11908053B2 (en) * | 2020-05-29 | 2024-02-20 | Camelot Uk Bidco Limited | Method, non-transitory computer-readable storage medium, and apparatus for searching an image database |
US11734445B2 (en) * | 2020-12-02 | 2023-08-22 | International Business Machines Corporation | Document access control based on document component layouts |
JP2022170799A (ja) * | 2021-04-30 | 2022-11-11 | コニカミノルタ株式会社 | 文書検索システム、文書検索方法および文書検索プログラム |
US11783605B1 (en) * | 2022-06-30 | 2023-10-10 | Intuit, Inc. | Generalizable key-value set extraction from documents using machine learning models |
Family Cites Families (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US1095393A (en) * | 1912-12-07 | 1914-05-05 | Harry C Gerlach | Safety switch-chain. |
US1165070A (en) * | 1915-04-12 | 1915-12-21 | Fred O Lake | Knife-sharpener. |
US1171064A (en) * | 1915-08-11 | 1916-02-08 | Lyon Metallic Mfg Company | Shelving. |
JPS5035379B1 (zh) * | 1970-05-25 | 1975-11-15 | ||
US4955066A (en) * | 1989-10-13 | 1990-09-04 | Microsoft Corporation | Compressing and decompressing text files |
US5109433A (en) * | 1989-10-13 | 1992-04-28 | Microsoft Corporation | Compressing and decompressing text files |
US5181255A (en) * | 1990-12-13 | 1993-01-19 | Xerox Corporation | Segmentation of handwriting and machine printed text |
US5526444A (en) * | 1991-12-10 | 1996-06-11 | Xerox Corporation | Document image decoding using modified branch-and-bound methods |
US5499294A (en) * | 1993-11-24 | 1996-03-12 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | Digital camera with apparatus for authentication of images produced from an image file |
US6869023B2 (en) * | 2002-02-12 | 2005-03-22 | Digimarc Corporation | Linking documents through digital watermarking |
US5465353A (en) * | 1994-04-01 | 1995-11-07 | Ricoh Company, Ltd. | Image matching and retrieval by multi-access redundant hashing |
US5542006A (en) * | 1994-06-21 | 1996-07-30 | Eastman Kodak Company | Neural network based character position detector for use in optical character recognition |
US5594809A (en) * | 1995-04-28 | 1997-01-14 | Xerox Corporation | Automatic training of character templates using a text line image, a text line transcription and a line image source model |
US5812698A (en) * | 1995-05-12 | 1998-09-22 | Synaptics, Inc. | Handwriting recognition system and method |
US5867597A (en) * | 1995-09-05 | 1999-02-02 | Ricoh Corporation | High-speed retrieval by example |
US6587217B1 (en) * | 1997-09-15 | 2003-07-01 | International Business Machines Corporation | Method for organizing files in a library in a network printing system |
US6658623B1 (en) * | 1997-09-15 | 2003-12-02 | Fuji Xerox Co., Ltd. | Displaying in a first document a selectable link to a second document based on a passive query |
JPH1178176A (ja) * | 1997-09-17 | 1999-03-23 | Seiko Epson Corp | 印刷物発行管理システム、印刷物発行管理方法及びプリンタ |
US6009198A (en) * | 1997-11-21 | 1999-12-28 | Xerox Corporation | Method for matching perceptual shape similarity layouts across multiple 2D objects |
US7062497B2 (en) * | 1998-01-22 | 2006-06-13 | Adobe Systems Incorporated | Maintaining document state history |
US6487301B1 (en) * | 1998-04-30 | 2002-11-26 | Mediasec Technologies Llc | Digital authentication with digital and analog documents |
JPH11328417A (ja) * | 1998-05-20 | 1999-11-30 | Toshiba Corp | 画像処理装置、画像処理方法及び画像処理プログラムを記録したコンピュータ読み取り可能な記録媒体 |
US6523134B2 (en) * | 1998-09-18 | 2003-02-18 | International Business Machines Corporation | Selective undo |
US6363381B1 (en) * | 1998-11-03 | 2002-03-26 | Ricoh Co., Ltd. | Compressed document matching |
US6580806B1 (en) * | 1998-11-20 | 2003-06-17 | Canon Kabushiki Kaisha | Image processing apparatus, image processing method and storage |
US6397212B1 (en) * | 1999-03-04 | 2002-05-28 | Peter Biffar | Self-learning and self-personalizing knowledge search engine that delivers holistic results |
US6546385B1 (en) * | 1999-08-13 | 2003-04-08 | International Business Machines Corporation | Method and apparatus for indexing and searching content in hardcopy documents |
US6470094B1 (en) * | 2000-03-14 | 2002-10-22 | Intel Corporation | Generalized text localization in images |
US6594393B1 (en) * | 2000-05-12 | 2003-07-15 | Thomas P. Minka | Dynamic programming operation with skip mode for text line image decoding |
US7058223B2 (en) * | 2000-09-14 | 2006-06-06 | Cox Ingemar J | Identifying works for initiating a work-based action, such as an action on the internet |
US6928548B1 (en) * | 2000-09-29 | 2005-08-09 | Intel Corporation | System and method for verifying the integrity of stored information within an electronic device |
US7266765B2 (en) * | 2001-08-31 | 2007-09-04 | Fuji Xerox Co., Ltd. | Detection and processing of annotated anchors |
US7747943B2 (en) * | 2001-09-07 | 2010-06-29 | Microsoft Corporation | Robust anchoring of annotations to content |
GB2380277B (en) * | 2001-09-28 | 2005-12-14 | Hewlett Packard Co | A solid state memory device and a method of document reproduction |
US7120299B2 (en) * | 2001-12-28 | 2006-10-10 | Intel Corporation | Recognizing commands written onto a medium |
WO2003063067A1 (en) * | 2002-01-24 | 2003-07-31 | Chatterbox Systems, Inc. | Method and system for locating positions in printed texts and delivering multimedia information |
CA2375355A1 (en) * | 2002-03-11 | 2003-09-11 | Neo Systems Inc. | Character recognition system and method |
US7243301B2 (en) * | 2002-04-10 | 2007-07-10 | Microsoft Corporation | Common annotation framework |
JP2003337683A (ja) * | 2002-05-17 | 2003-11-28 | Fuji Xerox Co Ltd | 印刷物発行管理システム、印刷物検証装置、コンテンツ管理装置 |
JP2004040246A (ja) * | 2002-06-28 | 2004-02-05 | Canon Inc | 情報処理装置、情報処理方法 |
US7360093B2 (en) * | 2002-07-22 | 2008-04-15 | Xerox Corporation | System and method for authentication of JPEG image data |
US20040090439A1 (en) * | 2002-11-07 | 2004-05-13 | Holger Dillner | Recognition and interpretation of graphical and diagrammatic representations |
JP2004180278A (ja) * | 2002-11-15 | 2004-06-24 | Canon Inc | 情報処理装置、サーバ装置、電子データ管理システム、情報処理システム、情報処理方法、コンピュータプログラム及びコンピュータ読み取り可能な記憶媒体 |
US7486294B2 (en) * | 2003-03-27 | 2009-02-03 | Microsoft Corporation | Vector graphics element-based model, application programming interface, and markup language |
US7218783B2 (en) * | 2003-06-13 | 2007-05-15 | Microsoft Corporation | Digital ink annotation process and system for recognizing, anchoring and reflowing digital ink annotations |
US7475061B2 (en) | 2004-01-15 | 2009-01-06 | Microsoft Corporation | Image-based document indexing and retrieval |
US7729538B2 (en) * | 2004-08-26 | 2010-06-01 | Microsoft Corporation | Spatial recognition and grouping of text and graphics |
US7574048B2 (en) * | 2004-09-03 | 2009-08-11 | Microsoft Corporation | Freeform digital ink annotation recognition |
US7570816B2 (en) * | 2005-03-31 | 2009-08-04 | Microsoft Corporation | Systems and methods for detecting text |
US7526129B2 (en) * | 2005-06-23 | 2009-04-28 | Microsoft Corporation | Lifting ink annotations from paper |
-
2004
- 2004-01-15 US US10/758,370 patent/US7475061B2/en active Active
-
2005
- 2005-01-14 KR KR1020050003672A patent/KR101027851B1/ko not_active IP Right Cessation
- 2005-01-14 EP EP05000750A patent/EP1555626A3/en not_active Ceased
- 2005-01-17 CN CNB2005100062210A patent/CN100565506C/zh active Active
- 2005-01-17 JP JP2005009686A patent/JP4718841B2/ja not_active Expired - Fee Related
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101297319B (zh) * | 2005-08-23 | 2013-02-27 | 株式会社理光 | 在电子文档中嵌入热点 |
CN101292258B (zh) * | 2005-08-23 | 2012-11-21 | 株式会社理光 | 混合介质环境的创建和使用的系统和方法 |
CN101276363B (zh) * | 2007-03-30 | 2011-02-16 | 夏普株式会社 | 文档图像的检索装置及文档图像的检索方法 |
US9020183B2 (en) | 2008-08-28 | 2015-04-28 | Microsoft Technology Licensing, Llc | Tagging images with labels |
US8867779B2 (en) | 2008-08-28 | 2014-10-21 | Microsoft Corporation | Image tagging user interface |
CN102132312B (zh) * | 2008-08-28 | 2016-07-06 | 微软技术许可有限责任公司 | 用标签标记图像的方法和计算设备 |
CN102693253A (zh) * | 2011-01-26 | 2012-09-26 | 波音公司 | 图像管理和呈现 |
CN102693253B (zh) * | 2011-01-26 | 2017-08-25 | 波音公司 | 图像管理和呈现 |
CN102955784A (zh) * | 2011-08-19 | 2013-03-06 | 北京百度网讯科技有限公司 | 一种基于数字签名对多个图像进行相似判断的设备和方法 |
CN109740007A (zh) * | 2018-08-27 | 2019-05-10 | 广州麦仑信息科技有限公司 | 一种基于图像特征签名的静脉图像快速检索方法 |
CN109740007B (zh) * | 2018-08-27 | 2022-03-11 | 广州麦仑信息科技有限公司 | 一种基于图像特征签名的静脉图像快速检索方法 |
CN109167977A (zh) * | 2018-10-28 | 2019-01-08 | 广州中元软件有限公司 | 一种监控视频仿生长期保存方法 |
CN109167977B (zh) * | 2018-10-28 | 2020-10-23 | 广州中元软件有限公司 | 一种监控视频仿生长期保存方法 |
CN109933691A (zh) * | 2019-02-11 | 2019-06-25 | 北京百度网讯科技有限公司 | 用于内容检索的方法、装置、设备和存储介质 |
CN109933691B (zh) * | 2019-02-11 | 2023-06-09 | 北京百度网讯科技有限公司 | 用于内容检索的方法、装置、设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
EP1555626A2 (en) | 2005-07-20 |
US7475061B2 (en) | 2009-01-06 |
JP4718841B2 (ja) | 2011-07-06 |
CN100565506C (zh) | 2009-12-02 |
JP2005251169A (ja) | 2005-09-15 |
EP1555626A3 (en) | 2006-02-15 |
KR20050075301A (ko) | 2005-07-20 |
US20050165747A1 (en) | 2005-07-28 |
KR101027851B1 (ko) | 2011-04-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100565506C (zh) | 基于图像文档的索引和检索 | |
US7593961B2 (en) | Information processing apparatus for retrieving image data similar to an entered image | |
CN1625741A (zh) | 可以通过手写检索查询来检索的电子文件管理系统 | |
Nagy | Twenty years of document image analysis in PAMI | |
JP4533273B2 (ja) | 画像処理装置及び画像処理方法、プログラム | |
US6178417B1 (en) | Method and means of matching documents based on text genre | |
JP4577931B2 (ja) | ドキュメント処理システム及びインデックス情報獲得方法 | |
JP4920928B2 (ja) | 画像処理装置及びその制御方法、プログラム | |
US20090116746A1 (en) | Systems and methods for parallel processing of document recognition and classification using extracted image and text features | |
EP1993064A2 (en) | Image processing apparatus and image retrieval method | |
EP1917627B1 (en) | Classifying regions defined within a digital image | |
CN1542656A (zh) | 信息处理装置、信息处理方法、存储介质及程序 | |
US20110194736A1 (en) | Fine-grained visual document fingerprinting for accurate document comparison and retrieval | |
CN1900933A (zh) | 图像搜索系统、图像搜索方法和存储介质 | |
JP2009022009A (ja) | 書類セキュリティ又は注釈のためのインビジブルジャンクション特徴の認識 | |
JP2011018316A (ja) | 文書区分識別用の区分モデルを生成するための方法及びプログラム、文書の区分を識別するための方法及びプログラム、及び画像処理システム | |
CN104346415A (zh) | 图像文档命名的方法 | |
CN1336604A (zh) | 中文古籍数字化及内容检索自动化方法和系统 | |
US20060176521A1 (en) | Digitization of microfiche | |
CN1577382A (zh) | 文档交接系统以及文档交接方法 | |
WO2001013279A9 (en) | Word searchable database from high volume scanning of newspaper data | |
CN1107280C (zh) | 中英文表单的识别系统及识别方法 | |
CN1112653C (zh) | 图像处理方法及设备 | |
JP2005149323A (ja) | 画像処理システム及び画像処理装置並びに画像処理方法 | |
JP4047222B2 (ja) | 画像処理装置及びその制御方法、プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150429 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150429 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160801 Address after: Grand Cayman, Georgetown, Cayman Islands Patentee after: IValley Holding Co., Ltd. Address before: Washington State Patentee before: Micro soft technique license Co., Ltd |