WO2009053613A1 - Method and system for annotating multimedia documents - Google Patents
Method and system for annotating multimedia documents Download PDFInfo
- Publication number
- WO2009053613A1 WO2009053613A1 PCT/FR2008/051823 FR2008051823W WO2009053613A1 WO 2009053613 A1 WO2009053613 A1 WO 2009053613A1 FR 2008051823 W FR2008051823 W FR 2008051823W WO 2009053613 A1 WO2009053613 A1 WO 2009053613A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sections
- annotation
- automatic
- document
- section
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/169—Annotation, e.g. comment data or footnotes
Definitions
- the present invention relates to a method and a system for annotating multimedia documents as well as a computer program product for implementing the method.
- An annotation system is a system for adding high-level information called metadata to a multimedia document, that is, a text, image, audio and / or video document.
- Annotations are of various granularities and apply to a complete document as to any section of a document. They are also of varied nature. For example, annotations are temporal, spatial, semantic, and so on. and apply to either a document section or multiple sections in the case of relation extraction.
- the annotation then allows advanced processing on documents. For example, it allows filtering against annotations, reasoning, or advanced annotation searches.
- the annotation is usually done manually by a person responsible for reading the documents. Manually annotating documents, however, is a particularly time-consuming task. Also, in some cases, the annotation is performed completely automatically but then no improvement of the system over time is possible, except to change version of the annotation engine.
- All automatic systems are based on an annotation model that contains the relationships between the annotations and the characteristics of the document, or a part / section of the document. Thus, when a new document is to be annotated, the system looks for characteristics identical to those contained in the model in order to apply the corresponding annotations to the document.
- a semantic annotation platform is based on matching (in English "matching") the instances of a domain ontology with the content of a document, generally with a step of semantic disambiguation making it possible to find the best instance of ontology according to the context of the section to annotate.
- Some annotation platforms allow you to learn an annotation model from examples. In this case, the generated models are often unclear to a non-expert user and can not be easily validated by him. It would be desirable to define a method and an annotation system that combine the efficiency of automatic systems with the flexibility and versatility of manual systems.
- a method of annotating a plurality of multimedia documents, each multimedia document comprising at least one section, and each section comprising at least one characteristic comprises the steps of:
- the annotation method advantageously creates a model based on a set of annotations provided by the user. It is therefore understandable that, by a suitable selection of manually annotated documents, the user has a decisive influence on the quality of this model.
- Particular features or embodiments of this method are:
- It also includes a step of manual validation of the automatic annotation, followed by an iteration of at least the step of automatic model creation to replace the annotation model to take into account the validated annotations, the steps automatic creation, automatic annotation and manual validation thus forming an iterative loop for improving the annotation model.
- a relation between a representative graph of the annotation classes and a representative graph of the characteristics of the documents is defined as the product of exponential families based on the sections, and a cost function is defined as the log likelihood of at least a part sections so that the automatic creation of the model consists of minimizing the cost function by selecting the most representative sections and adjusting the weights.
- the method advantageously makes it possible to create an iterative loop for improving the annotation model because the validation and the possible correction of the annotations generated automatically makes it possible to on the one hand, to provide new input data for the automatic creation of the model and, on the other hand, to validate the quality of the model by the number of corrections to be made.
- the method of automatic creation of the model also advantageously allows to limit as much as possible the number of sections used by it while retaining only those which are most representative of the links of the model.
- a computer program product includes program code instructions recorded on a computer readable medium, for implementing the steps of the preceding method when said program is running on a computer.
- a system for annotating a plurality of multimedia documents each multimedia document comprising at least one section, and each section comprising at least one characteristic, comprises:
- User interface means adapted to manually annotate at least one document of the plurality of documents by assigning to each section of said document at least one annotation class; means for automatically creating a document; annotation model defining relations between the annotation classes and the characteristics of the sections, said automatic creation means comprising iterative learning means based on the selection of relevant sections, as well as means for • selection of the document section most representative of all the existing sections and the farthest from the previously selected sections,
- the interface means are further adapted to manually validate the automatic annotation, and in that the automatic model creation means are adapted to take account of the validated annotations.
- a relation between a representative graph of the annotation classes and a representative graph of the characteristics of the documents is defined as the product of exponential families based on the sections, and a cost function is defined as the log likelihood of at least a part sections so that the automatic creation of the model consists of minimizing the cost function by selecting the most representative sections and adjusting the weights.
- FIG. 1 is a schematic view of an annotation system according to one embodiment of the invention.
- FIG. 2 is a flowchart of an annotation method according to one embodiment of the invention.
- an annotation system comprises a terminal 1 having a man / machine interface 3.
- This interface 3 is adapted to annotate a document manually, it is classically based on a hardware information presentation interface composed, for example, of a screen, and on information input means composed, for example, of a keyboard and a mouse.
- This interface 3 allows various elementary operations related to manual document annotation.
- a section corresponds to a variable granularity representing a certain homogeneity.
- a section is a word in a text or a sequence of images in a video.
- the interface 3 allows the user to assign each section at least one class, or type, of annotation. For example, the user assigns syntactic annotations to text and / or semantic annotations as a class of an ontology.
- the interface includes selection tools. In a relatively simple form, these tools may be only list forms allowing a choice of annotation among a predefined list. In more sophisticated forms, these tools may offer annotations based on a first automatic analysis of the document or section concerned, for example using a pre-existing annotation template.
- the human / machine interface 3 thus includes a specialized editor for adding, modifying or deleting annotations to a multimedia document.
- the terminal 1 is connected to a learning server 5 by a data link 6.
- the learning server 5 comprises means 7 for automatically creating an annotation model defining relations between the annotation classes and the characteristics of the sections.
- the automatic creation means 7 use manually annotated documents from the terminal 1 as input parameters.
- the learning server 5 also comprises means 9 for automatic annotation of a non-annotated document by application of the created annotation model.
- a user annotates a multimedia document using the terminal 1 and the adapted man-machine interface 3.
- the annotated document is sent at 13 to the learning server 5.
- the learning server 5 then starts, step 15, the execution of the means 7 of automatic model creation.
- These perform iteratively the following steps: • selection, step 17, of the section of the most representative document of the set and the furthest away from the sections that have already been possibly selected.
- distant section, or near, another section is meant a distance in the mathematical sense of the term defined in a section metric.
- a close section is a section that has substantially the same characteristics or very similar characteristics of other sections.
- step 19 Adjustment, in step 19, of the weightings associated with the various selected sections, deletion, step 21, of the least representative sections of all the documents or sections closest to the selected sections.
- a new document is annotated, step 23, automatically either at the request of the user or as part of a batch process.
- the automatically annotated document is sent to the terminal 1 so that the user can study, and possibly modify, in step 27, the annotations proposed by the system.
- the step 15 of launching the model creation is again executed by integrating into the input data of this creation the new document with its annotations modified by the user. .
- the model is refined to reach a level of quality such that no user intervention is necessary. is no longer necessary.
- the creation of the model then consists in minimizing the log likelihood of a sample while limiting the number of cores to use.
- the selection of the nuclei and the adjustment of the weights ⁇ is done according to the iterative algorithm below.
- the selected nuclei are then those that minimize the gradient: A- - 1 T ⁇ £ (Xjk (X jx k) ⁇ (k yj.y) +] TK (x, .x k) (f ⁇ ) (y k ⁇ x) - ⁇ (y, .y k ))
- kernels correspond to the sections of annotated documents and thus constitute the annotation model.
- the distribution between terminal and learning server may actually correspond to a functional distribution, all the functions of the system being realized on a workstation programmed accordingly.
- an embodiment corresponds to a software implementation of the annotation method and that thus a computer program product includes instructions such as, executed on a computer, the annotation method is implemented.
- the method can also be implemented in hardware form, for example, by programming a network of doors of type FPGA (user programmable gate array) or in a combined hardware-software form according to design rules well known to those skilled in the art.
- FPGA field programmable gate array
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE112008002713T DE112008002713T8 (en) | 2007-10-10 | 2008-10-08 | Annotation and annotation system for multimedia documents |
GB1007180A GB2466752A (en) | 2007-10-10 | 2008-10-08 | Method and system for annotating multimedia documents |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0707100 | 2007-10-10 | ||
FR0707100A FR2922338A1 (en) | 2007-10-10 | 2007-10-10 | METHOD AND SYSTEM FOR ANNOTATING MULTIMEDIA DOCUMENTS |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009053613A1 true WO2009053613A1 (en) | 2009-04-30 |
Family
ID=39276102
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2008/051823 WO2009053613A1 (en) | 2007-10-10 | 2008-10-08 | Method and system for annotating multimedia documents |
Country Status (4)
Country | Link |
---|---|
DE (1) | DE112008002713T8 (en) |
FR (1) | FR2922338A1 (en) |
GB (1) | GB2466752A (en) |
WO (1) | WO2009053613A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110880021A (en) * | 2019-11-06 | 2020-03-13 | 创新奇智(北京)科技有限公司 | Model-assisted data annotation system and annotation method |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040205482A1 (en) * | 2002-01-24 | 2004-10-14 | International Business Machines Corporation | Method and apparatus for active annotation of multimedia content |
-
2007
- 2007-10-10 FR FR0707100A patent/FR2922338A1/en active Pending
-
2008
- 2008-10-08 GB GB1007180A patent/GB2466752A/en not_active Withdrawn
- 2008-10-08 WO PCT/FR2008/051823 patent/WO2009053613A1/en active Application Filing
- 2008-10-08 DE DE112008002713T patent/DE112008002713T8/en not_active Ceased
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040205482A1 (en) * | 2002-01-24 | 2004-10-14 | International Business Machines Corporation | Method and apparatus for active annotation of multimedia content |
Non-Patent Citations (4)
Title |
---|
DENOYER L ET AL ASSOCIATION FOR COMPUTING MACHINERY: "Structured Multimedia Document Classification", PROCEEDINGS OF THE 2003 ACM SYMPOSIUM ON DOCUMENT ENGINEERING. DOCENG 2003. GRENOBLE, FRANCE, NOV. 20 - 22, 2003, ACM SYMPOSIUM ON DOCUMENT ENGINEERING, NEW YORK, NY : ACM, US, 20 November 2003 (2003-11-20), pages 153 - 160, XP002382731, ISBN: 1-58113-724-9 * |
GRILHERES B ET AL: "A Platform for Semantic Annotations and Ontology Population Using Conditional Random Fields", WEB INTELLIGENCE, 2005. PROCEEDINGS. THE 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON COMPIEGNE, FRANCE 19-22 SEPT. 2005, PISCATAWAY, NJ, USA,IEEE, 19 September 2005 (2005-09-19), pages 790 - 793, XP010841825, ISBN: 0-7695-2415-X * |
HANNA WALLACH: "Efficient Training of Conditional Random Fields", MASTER OF SCIENCE THESIS, 2002, University of Edinburgh, UK, XP002476899, Retrieved from the Internet <URL:http://citeseer.ist.psu.edu/wallach02efficient.html> [retrieved on 20080416] * |
JOHN LAFFERTY, ANDREW MCCALLUM, FERNANDO PEREIRA: "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data", PROC. 18TH INTERNATIONAL CONF. ON MACHINE LEARNING, 28 June 2001 (2001-06-28) - 1 July 2001 (2001-07-01), Williamstown, MA, USA, XP002476900, Retrieved from the Internet <URL:http://citeseer.ist.psu.edu/lafferty01conditional.html> [retrieved on 20080416] * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110880021A (en) * | 2019-11-06 | 2020-03-13 | 创新奇智(北京)科技有限公司 | Model-assisted data annotation system and annotation method |
CN110880021B (en) * | 2019-11-06 | 2021-03-16 | 创新奇智(北京)科技有限公司 | Model-assisted data annotation system and annotation method |
Also Published As
Publication number | Publication date |
---|---|
DE112008002713T8 (en) | 2011-03-24 |
GB2466752A (en) | 2010-07-07 |
FR2922338A1 (en) | 2009-04-17 |
DE112008002713T5 (en) | 2010-11-11 |
GB201007180D0 (en) | 2010-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11263250B2 (en) | Method and system for analyzing entities | |
US11562169B2 (en) | Generating multimodal image edits | |
US11335004B2 (en) | Generating refined segmentation masks based on uncertain pixels | |
FR2821186A1 (en) | DEVICE FOR EXTRACTING INFORMATION FROM A KNOWLEDGE BASED TEXT | |
FR2698977A1 (en) | Multimedia information system. | |
CN111950287A (en) | Text-based entity identification method and related device | |
FR2914081A1 (en) | METHOD FOR PROTECTING DIGITAL DOCUMENTS AGAINST UNAUTHORIZED USES | |
WO2007077378A1 (en) | Assistance method and device for building the arborescence of an electronic document group | |
EP2996040A1 (en) | A method for determining by optimization a multi-core architecture | |
WO2009053613A1 (en) | Method and system for annotating multimedia documents | |
US9501684B1 (en) | Providing non-destructive editing reconciliation | |
WO2021257195A1 (en) | Topic graph-based comment generation | |
EP4073765A1 (en) | Method and system for generating 3d digital models | |
FR2944633A1 (en) | METHOD FOR ASSISTING THE DEVELOPMENT OR USE OF A COMPLEX SYSTEM. | |
CN112559680A (en) | Text classification method, model and storage medium based on multi-level feature extraction | |
EP3924850A1 (en) | Method for constructing behavioural software signatures | |
FR3022659A1 (en) | RELATIVE COPY-GLUE METHOD | |
Samii et al. | Iterative learning: Leveraging the computer as an on-demand expert artist | |
EP1713243A1 (en) | Method and system of automatic generation of software components for the design of voice services | |
FR2986882A1 (en) | METHOD FOR IDENTIFYING A SET OF PHRASES OF A DIGITAL DOCUMENT, METHOD FOR GENERATING A DIGITAL DOCUMENT, ASSOCIATED DEVICE | |
US20240069870A1 (en) | Computer-based software development and product management | |
US20160371236A1 (en) | Optimizing update operations in hierarchically structured documents | |
EP1435054A2 (en) | Method for indexing and comparing multimedia documents | |
EP3114597A1 (en) | Method for analysing a plurality of messages, and associated computer programme product and device | |
FR3132155A3 (en) | Method for automatically selecting and displaying the selection of a significant digest in textual content |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08842467 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1120080027137 Country of ref document: DE |
|
ENP | Entry into the national phase |
Ref document number: 1007180 Country of ref document: GB Kind code of ref document: A Free format text: PCT FILING DATE = 20081008 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1007180.1 Country of ref document: GB |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 08842467 Country of ref document: EP Kind code of ref document: A1 |
|
RET | De translation (de og part 6b) |
Ref document number: 112008002713 Country of ref document: DE Date of ref document: 20101111 Kind code of ref document: P |