WO2000034890A3 - Text translation system - Google Patents

Text translation system Download PDF

Info

Publication number
WO2000034890A3
WO2000034890A3 PCT/US1999/029273 US9929273W WO0034890A3 WO 2000034890 A3 WO2000034890 A3 WO 2000034890A3 US 9929273 W US9929273 W US 9929273W WO 0034890 A3 WO0034890 A3 WO 0034890A3
Authority
WO
WIPO (PCT)
Prior art keywords
text
text fragment
database
sentence
language
Prior art date
Application number
PCT/US1999/029273
Other languages
French (fr)
Other versions
WO2000034890A8 (en
WO2000034890A2 (en
Inventor
Emmanuel Roche
Yves Schabes
Original Assignee
Global Information Res And Tec
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Global Information Res And Tec filed Critical Global Information Res And Tec
Priority to EP99968105A priority Critical patent/EP1138003A2/en
Priority to AU24791/00A priority patent/AU2479100A/en
Publication of WO2000034890A2 publication Critical patent/WO2000034890A2/en
Publication of WO2000034890A3 publication Critical patent/WO2000034890A3/en
Publication of WO2000034890A8 publication Critical patent/WO2000034890A8/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/45Example-based machine translation; Alignment

Abstract

A system builds a text fragment database for use in translating fragments of text from a source language into a target language. The system first stores a sentence database in memory, the sentence database comprising a plurality of sentence pairs, each sentence pair including a sentence in the source language and a corresponding sentence in the target language. The system then locates corresponding source and target text fragments in corresponding source and target language sentences, respectively, and stores the source text fragment together with the target text fragment in the text fragment database. The text fragment database can then be used to translate text from the source language into the target language. To this end, the system inputs text in the source language, extracts a text fragment from the input text, and locates the extracted text fragment in the text fragment database. The system then retrieves, from the text fragment database, a text fragment in the target language that corresponds to the extracted text fragment, and outputs the retrieved text fragment.
PCT/US1999/029273 1998-12-10 1999-12-10 Text translation system WO2000034890A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP99968105A EP1138003A2 (en) 1998-12-10 1999-12-10 Text translation system
AU24791/00A AU2479100A (en) 1998-12-10 1999-12-10 Text translation system

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US11170298P 1998-12-10 1998-12-10
US60/111,702 1998-12-10
US09/457,801 1999-12-09
US09/457,801 US6535842B1 (en) 1998-12-10 1999-12-09 Automatic bilingual translation memory system

Publications (3)

Publication Number Publication Date
WO2000034890A2 WO2000034890A2 (en) 2000-06-15
WO2000034890A3 true WO2000034890A3 (en) 2000-11-23
WO2000034890A8 WO2000034890A8 (en) 2001-03-01

Family

ID=26809151

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/029273 WO2000034890A2 (en) 1998-12-10 1999-12-10 Text translation system

Country Status (3)

Country Link
US (1) US6535842B1 (en)
EP (1) EP1138003A2 (en)
WO (1) WO2000034890A2 (en)

Families Citing this family (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001167092A (en) * 1999-12-13 2001-06-22 Nec Corp Translation server system
US6473729B1 (en) * 1999-12-20 2002-10-29 Xerox Corporation Word phrase translation using a phrase index
JP2001175683A (en) * 1999-12-21 2001-06-29 Nec Corp Translation server system
US20030200535A1 (en) * 2000-06-09 2003-10-23 Mcnamara Benedict Bede System for program source code conversion
US7054803B2 (en) * 2000-12-19 2006-05-30 Xerox Corporation Extracting sentence translations from translated documents
US20030083860A1 (en) * 2001-03-16 2003-05-01 Eli Abir Content conversion method and apparatus
JP3969628B2 (en) * 2001-03-19 2007-09-05 富士通株式会社 Translation support apparatus, method, and translation support program
US8214196B2 (en) 2001-07-03 2012-07-03 University Of Southern California Syntax-based statistical translation model
US20030061026A1 (en) * 2001-08-30 2003-03-27 Umpleby Stuart A. Method and apparatus for translating one species of a generic language into another species of a generic language
US7124073B2 (en) * 2002-02-12 2006-10-17 Sunflare Co., Ltd Computer-assisted memory translation scheme based on template automaton and latent semantic index principle
WO2004001623A2 (en) 2002-03-26 2003-12-31 University Of Southern California Constructing a translation lexicon from comparable, non-parallel corpora
DK1353280T3 (en) * 2002-04-12 2006-10-16 Targit As Process for processing multilingual queries
JP4355138B2 (en) * 2002-12-13 2009-10-28 インターナショナル・ビジネス・マシーンズ・コーポレーション Translation server, collaboration server and program
US8548794B2 (en) * 2003-07-02 2013-10-01 University Of Southern California Statistical noun phrase translation
US7711545B2 (en) * 2003-07-02 2010-05-04 Language Weaver, Inc. Empirical methods for splitting compound words with application to machine translation
US8296127B2 (en) 2004-03-23 2012-10-23 University Of Southern California Discovery of parallel text portions in comparable collections of corpora and training using comparable texts
EP1733304A2 (en) * 2004-04-02 2006-12-20 Star AG Computer-assisted aid (dynamic linking) for a method for creating documents in a foreign-language
US8666725B2 (en) * 2004-04-16 2014-03-04 University Of Southern California Selection and use of nonstatistical translation components in a statistical machine translation framework
GB2415518A (en) * 2004-06-24 2005-12-28 Sharp Kk Method and apparatus for translation based on a repository of existing translations
WO2006042321A2 (en) * 2004-10-12 2006-04-20 University Of Southern California Training for a text-to-text application which uses string to tree conversion for training and decoding
US8676563B2 (en) 2009-10-01 2014-03-18 Language Weaver, Inc. Providing human-generated and machine-generated trusted translations
US8886517B2 (en) 2005-06-17 2014-11-11 Language Weaver, Inc. Trust scoring for language translation systems
CN101052964B (en) * 2005-07-06 2010-05-19 松下电器产业株式会社 Dialog supporting device
US20070011160A1 (en) * 2005-07-07 2007-01-11 Denis Ferland Literacy automation software
US10319252B2 (en) * 2005-11-09 2019-06-11 Sdl Inc. Language capability assessment and training apparatus and techniques
US8943080B2 (en) 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
WO2008009682A2 (en) 2006-07-17 2008-01-24 Total Recall Aps A computer-implemented translation tool
US8886518B1 (en) 2006-08-07 2014-11-11 Language Weaver, Inc. System and method for capitalizing machine translated text
JP5082374B2 (en) * 2006-10-19 2012-11-28 富士通株式会社 Phrase alignment program, translation program, phrase alignment device, and phrase alignment method
US8433556B2 (en) 2006-11-02 2013-04-30 University Of Southern California Semi-supervised training for statistical word alignment
GB2444084A (en) * 2006-11-23 2008-05-28 Sharp Kk Selecting examples in an example based machine translation system
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US8468149B1 (en) 2007-01-26 2013-06-18 Language Weaver, Inc. Multi-lingual online community
US8615389B1 (en) 2007-03-16 2013-12-24 Language Weaver, Inc. Generation and exploitation of an approximate language model
US8831928B2 (en) * 2007-04-04 2014-09-09 Language Weaver, Inc. Customizable machine translation service
WO2008131509A1 (en) * 2007-04-30 2008-11-06 Fireswirl Systems Inc. Systems and methods for improving translation systems
US8825466B1 (en) 2007-06-08 2014-09-02 Language Weaver, Inc. Modification of annotated bilingual segment pairs in syntax-based machine translation
US8635539B2 (en) * 2008-10-31 2014-01-21 Microsoft Corporation Web-based language translation memory compilation and application
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
US8380486B2 (en) 2009-10-01 2013-02-19 Language Weaver, Inc. Providing machine-generated translations and corresponding trust levels
US10417646B2 (en) * 2010-03-09 2019-09-17 Sdl Inc. Predicting the cost associated with translating textual content
US11003838B2 (en) 2011-04-18 2021-05-11 Sdl Inc. Systems and methods for monitoring post translation editing
US8694303B2 (en) 2011-06-15 2014-04-08 Language Weaver, Inc. Systems and methods for tuning parameters in statistical machine translation
US8805672B2 (en) 2011-06-30 2014-08-12 International Business Machines Corporation Translation cache prediction
US8886515B2 (en) 2011-10-19 2014-11-11 Language Weaver, Inc. Systems and methods for enhancing machine translation post edit review processes
US8942973B2 (en) 2012-03-09 2015-01-27 Language Weaver, Inc. Content page URL translation
US10261994B2 (en) 2012-05-25 2019-04-16 Sdl Inc. Method and system for automatic management of reputation of translators
USD742914S1 (en) * 2012-08-01 2015-11-10 Isaac S. Daniel Computer screen with icon
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
USD734343S1 (en) * 2012-12-27 2015-07-14 Nissan Jidosha Kabushiki Kaisha Display screen or portion thereof with graphical user interface
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation
WO2016033618A2 (en) * 2014-08-28 2016-03-03 Duy Thang Nguyen Method of data storage and language conversion
US10747419B2 (en) * 2015-09-25 2020-08-18 CSOFT International Systems, methods, devices, and computer readable media for facilitating distributed processing of documents
US20170308526A1 (en) * 2016-04-21 2017-10-26 National Institute Of Information And Communications Technology Compcuter Implemented machine translation apparatus and machine translation method
US10922496B2 (en) 2018-11-07 2021-02-16 International Business Machines Corporation Modified graphical user interface-based language learning

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5497319A (en) * 1990-12-31 1996-03-05 Trans-Link International Corp. Machine translation and telecommunications system
JP3220560B2 (en) * 1992-05-26 2001-10-22 シャープ株式会社 Machine translation equipment
GB2279164A (en) * 1993-06-18 1994-12-21 Canon Res Ct Europe Ltd Processing a bilingual database.
JPH07210558A (en) * 1994-01-20 1995-08-11 Fujitsu Ltd Machine translation device
EP0672989A3 (en) * 1994-03-15 1998-10-28 Toppan Printing Co., Ltd. Machine translation system
JPH07282055A (en) * 1994-04-06 1995-10-27 Fujitsu Ltd Sentence analysis method and device
US6139201A (en) * 1994-12-22 2000-10-31 Caterpillar Inc. Integrated authoring and translation system
JP3066274B2 (en) * 1995-01-12 2000-07-17 シャープ株式会社 Machine translation equipment
AU5969896A (en) * 1995-06-07 1996-12-30 International Language Engineering Corporation Machine assisted translation tools
JPH09259127A (en) * 1996-03-21 1997-10-03 Sharp Corp Translation device
US5848386A (en) * 1996-05-28 1998-12-08 Ricoh Company, Ltd. Method and system for translating documents using different translation resources for different portions of the documents

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"TRANSLATION METHOD BY SENTENCE EXAMPLES", IBM TECHNICAL DISCLOSURE BULLETIN, vol. 31, no. 5, October 1988 (1988-10-01), Armonk, NY, US, pages 438 - 439, XP000049761 *
"TRANSLATION-EXAMPLE RETRIEVING METHOD", IBM TECHNICAL DISCLOSURE BULLETIN, vol. 31, no. 3, August 1988 (1988-08-01), Armonk, NY, US, pages 59 - 61, XP000023398 *
HARRIS: "Bi-text, a new concept in translation theory", LANGUAGE MONTHLY, no. 54, 1988, pages 8 - 11, XP000747663 *
HARRIS: "Interlinear Bitext", LANGUAGE TECHNOLOGY, no. 10, December 1988 (1988-12-01), Amsterdam, NL, pages 12 - 13, XP000747836 *

Also Published As

Publication number Publication date
US6535842B1 (en) 2003-03-18
EP1138003A2 (en) 2001-10-04
WO2000034890A8 (en) 2001-03-01
WO2000034890A2 (en) 2000-06-15

Similar Documents

Publication Publication Date Title
WO2000034890A8 (en) Text translation system
WO2006121849A3 (en) E-services translation utilizing machine translation and translation memory
WO2010135204A3 (en) Mining phrase pairs from an unstructured resource
CA2020058A1 (en) Machine translation apparatus having a process function for proper nouns with acronyms
ES2101613A1 (en) Automatic interlingual translation system.
CA2125200A1 (en) Language Translation Apparatus and Method Using Context-Based Translation Models
WO2006016171A3 (en) Computer implemented method for use in a translation system
WO2001086491A3 (en) Machine translation techniques
SE0101127D0 (en) Method of finding answers to questions
CA2364999A1 (en) Method and apparatus for terminology translation
TW428137B (en) Sentence processing apparatus and method thereof
WO2003038664A3 (en) Machine translation
WO2005065033A3 (en) In-context analysis and automatic translation
WO2001022277A3 (en) Language translation using a constrained grammar in the form of structured sentences
EA200301188A1 (en) METHOD AND MEANS OF CONTAINING CONVERSION
WO2002033587A3 (en) Database linking method and apparatus
Huhtala Fran teori till praktik: Analys av oversattningar fran finska till svenska.
CA2297905A1 (en) Interlingual translation system and method
Tukeyev et al. Neural machine translation system for the kazakh language based on synthetic corpora
Tosco A parsing view on inconsistent word order: Articles in Tigre and its relatives
Zanon Boito et al. Unwritten languages demand attention too! word discovery with encoder-decoder models
GR1002453B (en) Intelligent device for retrieving multilingual texts.
Рустамова Theoretical foundations of the features of the translation of legal texts
WO2016033617A2 (en) Method of asynchronous machine translation
Cozma Dealing with Syntactic Discontinuities in the Translation of Legal Texts

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: C1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C1

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

CFP Corrected version of a pamphlet front page
CR1 Correction of entry in section i

Free format text: PAT. BUL. 24/2000 UNDER (30) REPLACE "NOT FURNISHED" BY "09/457801"

WWW Wipo information: withdrawn in national office

Ref document number: 1999968105

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1999968105

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1999968105

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642