CN103870493A - Page automatic input system and method - Google Patents

Page automatic input system and method Download PDF

Info

Publication number
CN103870493A
CN103870493A CN201210541898.4A CN201210541898A CN103870493A CN 103870493 A CN103870493 A CN 103870493A CN 201210541898 A CN201210541898 A CN 201210541898A CN 103870493 A CN103870493 A CN 103870493A
Authority
CN
China
Prior art keywords
data
dictionary
module
page
library module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210541898.4A
Other languages
Chinese (zh)
Inventor
宋鑫晶
吕苏
尹祥龙
翟威
陆寄豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201210541898.4A priority Critical patent/CN103870493A/en
Publication of CN103870493A publication Critical patent/CN103870493A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/84Mapping; Conversion
    • G06F16/86Mapping to a database

Abstract

The invention discloses a page automatic input system. The page automatic input system comprises a data dictionary database module and a client-side engine module; the data dictionary database module is used for storing mass data; the client-side engine module is used for performing analysis on a page to extract page elements and selecting appropriate data in the data dictionary database module so as to perform automatic input on the page according to the page elements. The invention also discloses a page automatic input method.

Description

The page is carried out to the system and method for automatic input
Technical field
the present invention relates to computer realm, more particularly, relate to, based on Dictionary Database, the page is carried out to the system and method for automatic input.
Background technology
the major way of current page automatic input is: the automatic test script that operation writes carries out page automatic input, and wherein, the data of typing are preprepareds, are called by automatized script.
be that CN 201010103441, name are called in the patent of invention of " text message intelligence typing search toolbar " in the patent No., method and the device of a kind of automatic extraction and typing word is provided.The method comprises the following steps: the information relevant to the content that needs extraction and typing is set, and this information is stored in memory module; Input key word, searches for the current Web page according to canned data in the key word of input and memory module, and with forms mode display of search results; Select a line or multirow in above table; Need to carry out if user triggers typing operation and selects and sign in to the external system that typing operates, by the information automatic input in selected form in this external system.By the technical scheme of this invention, can automatically extract and typing Word message the Web page, and the Word message extracting is carried out to relevant operation.
but, the shortcoming of this mode is: 1, need to take a long time the data of writing automatic test script and preparation and treat typing with more manpower, 2, the data of this preparation and the coupling of measurand are larger, and be only suitable for this specific script, be inconvenient to share, be difficult to transplant multiplexing, 3, be engaged in this need of work tester and possess higher Develop ability.
thereby, lack and a kind ofly not only adapted to various scenes, but also experience constantly can be accumulated to utilization, can also be easy to allow the settling mode of non-specialized-technical personnel's operation.
Summary of the invention
for this problem, according to an aspect of the present invention, provide a kind of for the page being carried out to the system of automatic input, comprising: data dictionary library module, described digital dictionary library module is for storing the data of magnanimity; And client engine module, the page, to extract page elements, and according to described page elements, selects suitable data to carry out automatic input to the described page in described data dictionary library module described in described client engine module analysis.
in said system, described data dictionary library module is the database with tree structure storage magnanimity test data.
in said system, described data dictionary library module comprises: dictionary tree, the basic level concept of carrying out organising data according to the mankind's natural cognition; Data mapping tables, for reflecting the relation between each dictionary and its corresponding data set; With near synonym storehouse, for the dictionary being close in meaning is divided into groups, realize the multiplexing of data.
in said system, described client engine module comprises: page elements analysis module, treat the page elements of typing for extracting the described page; Data retrieval module, for retrieving at described database dictionary library module according to described page elements, to find and the same or analogous dictionary of described page elements; And automatic input module, according to the result for retrieval of described data retrieval module, automatically complete the typing of described dictionary.Described client engine module can further comprise: automatic learning module, and for user of service's typing value being added to the data centralization of the described data dictionary library module corresponding with this typing value.
in said system, described client engine module can be configured to, in the time that dictionary is not present in described data dictionary library module, automatically in described database module, create this dictionary, and the data of user's typing is deposited in to the data centralization of this dictionary mapping.
according to another aspect of the present invention, provide a kind of method of the page being carried out to automatic input, having comprised: set up data dictionary library module, described digital dictionary library module is for having stored the data of magnanimity; Analyze the described page to extract page elements; Retrieve in described database dictionary library module according to described page elements, to find and the same or analogous dictionary of described page elements; And according to the result for retrieval of described data retrieval module, automatically complete the typing of described dictionary.
in said method, described data dictionary library module is the database with tree structure storage magnanimity test data.
in said method, set up data dictionary library module and comprise: substantially carry out the level concept of organising data according to the mankind's natural cognition, to form dictionary tree; The data set corresponding with it to each dictionary shines upon, to form data mapping tables; And the dictionary being close in meaning is divided into groups, to form near synonym storehouse, thereby realize the multiplexing of data.
said method also can comprise following both one or a combination set of: (1), in the time that dictionary is not present in described data dictionary library module, creates this dictionary automatically in described database module; (2) user of service's typing value is added to the data centralization of the described data dictionary library module corresponding with this typing value.
compared with prior art, the system and method for this motion at least tool have the following advantages:
the first, replaced by a general page elements analysis module for the different automatized scripts of different measurands, user of service need to not carry out the page developing shell script in the system of automatic input realizing.The general degree that this has promoted the system and method for this motion, has reduced use threshold.
the second, test data preliminary work no longer needs artificially to arrange form, and the data that automatic learning module is obtained and stored can be shared use in Dictionary Database.This for the mode of script input playback, easier multiplex data.
Brief description of the drawings
after having read the specific embodiment of the present invention with reference to accompanying drawing, those skilled in the art will become apparent various aspects of the present invention.Those skilled in the art should be understood that: these accompanying drawings are only for coordinating embodiment that technical scheme of the present invention is described, and are not intended to protection scope of the present invention to be construed as limiting.
fig. 1 is according to an embodiment of the invention for the page being carried out to the system schematic of automatic input.
fig. 2 is according to an embodiment of the invention for the page being carried out to the method flow diagram of automatic input.
fig. 3 illustrates the book information registration page of certain book management system.
Embodiment
what introduce below is some in multiple possibility embodiment of the present invention, aims to provide basic understanding of the present invention, is not intended to confirm key of the present invention or conclusive key element or limits claimed scope.Easily understand, according to technical scheme of the present invention, do not changing under connotation of the present invention other implementation that one of ordinary skill in the art can propose mutually to replace.Therefore, below embodiment and accompanying drawing be only the exemplary illustration to technical scheme of the present invention, and should not be considered as of the present invention all or be considered as restriction or the restriction to technical solution of the present invention.
with reference to figure 1, it shows a kind of for the page being carried out to the system of automatic input.As shown in Figure 1, can comprise data dictionary library module and client engine module for the system of the page being carried out to automatic input.Numeral dictionary library module is for storing the data of magnanimity, and client engine module is used for analyzing the page to extract page elements, and according to extracted page elements, selects suitable data to carry out automatic input to the page in data dictionary library module.
in a concrete embodiment, data dictionary library module is a database with tree structure storage magnanimity test data.The level concept of organising data is substantially according to the mankind's natural cognition.This data dictionary can comprise following three parts: dictionary tree, data mapping tables, near synonym storehouse.
The example of a simple dictionary tree below:
Figure 928169DEST_PATH_GDA00002859536800041
For dictionary tree above, may there is following data mapping tables:
Books the title: { The Romance of the Three Kingdoms; Journey to the West; A Dream of Red Mansions; The Water Margin }
Person names: { Cao behaviour; Liu Bei; Sun Quan }
Writer's name: { Shakespear; Cao Xueqin }
The writer of foreign country name: { Mark Twain }
Language: { English; Chinese; Python}
Computerese: { Java; C}
Currency code: { 156; 840; 344}
Trade company's code: { 102290053110001; 102290053110002}
Institution Code: { 01022900; 00010000; 00013000}
S class Institution Code: { 00012900; 00013000}
It should be noted that, each dictionary default packet is containing the data set of all dictionary mappings under its branch.And Dictionary Database itself does not retrain for level mechanism and the content of dictionary.
near synonym storehouse is exactly that the dictionary being close in meaning is divided into groups, and realizes the multiplexing of data.For example:
group number 001:{ person names; Name }
group number 002:{ writer name; Author; Author }
generally, the business of measurand all concentrates in a certain field, and a lot of concepts (corresponding to " dictionary " as herein described) in this field are all to have strict hierarchical relationship, and these concepts and hierarchical relationship are generally well known to those skilled in the art.Thereby it is feasible setting up available, an easy-to-use Dictionary Database.And the process of establishing of Dictionary Database is the process of experience record accumulation, but not creation process, this point has ensured that Dictionary Database can share multiplexing completely.In addition, in a preferred embodiment, the content of Dictionary Database can be managed personnel and be optimized improvement by backstage.
continue with reference to figure 1, client engine module can comprise page elements analysis module, data retrieval module, automatic input module and automatic learning module.Wherein, page elements analysis module is treated the page elements of typing for extracting the page; Data retrieval module is for retrieving at database dictionary library module according to page elements, to find and the same or analogous dictionary of page elements; Automatic input module, according to the result for retrieval of data retrieval module, completes the typing of described dictionary automatically.
suppose to exist the page to be measured (the book information registration page of certain book management system) as shown in Figure 3.Page elements analysis module by described in this motion can obtain following dictionary after processing: title, publishing house, author, book edition number, publication date, books classification number, and typing frame corresponding with it these dictionaries can be shone upon.
then, the dictionary that the data retrieval module of this motion can be analyzed out by page elements analysis module is taken in Dictionary Database and is retrieved, and finds identical dictionary or similar dictionary.Searching algorithm can be divided following three steps:
the 1st step: retrieve in Dictionary Database dictionary tree by dictionary value to be detected, jump to the 3rd step as found.As do not found, add to and do not find dictionary value table, and carry out next step.
the 2nd step: retrieve in Dictionary Database near synonym storehouse by dictionary value to be measured, as find the 3rd step of jumping.As do not found reminding user " not this dictionary, please typing voluntarily in Dictionary Database ".
the 3rd step: use the dictionary value finding to obtain the data set that this dictionary is corresponding in data mapping tables, and select at random data for automatic input module.If the data centralization that this dictionary is corresponding does not have data, reminding user " Dictionary Database is data corresponding to this dictionary not, please typing voluntarily ".
finally, automatic input module, for according to the result for retrieval of data retrieval module, completes the typing of described dictionary automatically.In a specific embodiment, open after automatic input function at user's button click, client engine module enters automatic input pattern, first transfers page elements analysis module and analyzes first dictionary and its corresponding typing frame, then dictionary is transferred to data retrieval module.If data retrieval module return data, automatic input module is inserted these data in the typing frame that dictionary is corresponding automatically, completes the typing of first dictionary, and completes successively the automatic input of all the other dictionaries.
optionally, the client engine module of this motion also can comprise automatic learning module.For example, when opening after autolearn feature, client engine enters auto learn mode, no longer carry out automatic input, page typing is completed by user of service oneself, automatic learning module is added the dictionary of this page in dictionary tree on backstage, and adds user of service's typing value to data centralization that this dictionary is corresponding.
generally, carrying out before automatic input work, often need to open in advance auto learn mode, make system can set up the Dictionary Database of measurand.More preferably, automatic learning module can be opened the mode of learning of mourning in silence.Under the learning state of mourning in silence, as long as comprise when the client engine module of automatic learning module has detected page typing, carry out page elements analysis, scan the dictionary that this page exists, if dictionary is not present in dictionary tree, creates dictionary, and the data of user's typing are deposited in to the data centralization of this dictionary mapping.Whole process is invisible for user, and can not exert an influence to user's use.
by above-mentioned, for the page being carried out to the system of automatic input, client engine module can become a kind of testing tool.Adopt this testing tool, user of service does not need to be grasped any system correlation technique, as long as a key just can complete automatic input, or one-key operation automatic learning, automatic learning also can adopt silent mode.
with reference to figure 2, show a kind of method of the page being carried out to automatic input.The method comprises: step 210, set up data dictionary library module, and this numeral dictionary library module is for storing the data of magnanimity; Step 220, analyzes the page to extract page elements; Step 230, retrieves in database dictionary library module according to page elements, to find and the same or analogous dictionary of this page elements; Step 240, according to the result for retrieval of data retrieval module, completes the typing of dictionary automatically.
in a concrete embodiment, data dictionary library module can tree structure be stored magnanimity test data.The step of setting up data dictionary library module specifically can comprise the steps: the first, substantially carrys out the level concept of organising data according to the mankind's natural cognition, to form dictionary tree; The second, the data set corresponding with it to each dictionary shines upon, to form data mapping tables; The 3rd, the dictionary being close in meaning is divided into groups, to form near synonym storehouse, thereby realize the multiplexing of data.
in another concrete embodiment, the method that the page is carried out to automatic input also can comprise following both one or a combination set of: (1), in the time that dictionary is not present in data dictionary library module, creates this dictionary automatically in database module; (2) user of service's typing value is added to the data centralization of the data dictionary library module corresponding with this typing value.
the above-mentioned method that the page is carried out to automatic input does not need to write automatized script, do not need for the special setup test data of specific object to be measured, because the client engine module in this method can be analyzed page elements, and in Dictionary Database, select suitable data automatic input according to page elements.
in addition, the above-mentioned method that the page is carried out to automatic input does not need for the specific special setup test data of measurand, and Data Preparation can, by all client engine by automatic learning, submit to Dictionary Database constantly to accumulate, continue to optimize.Dictionary Database is shared and is used and use in design pure natural thinking between all client engine.
in sum, this motion proposes a kind of page automatic input method and system based on Dictionary Database.In one embodiment, this system can comprise client engine module and data dictionary library module.Client engine module operates on user of service's machine, mainly completes the functions such as page elements analysis, data retrieval, automatic input, automatic learning.Dictionary Database can be local, can be also far-end, high in the clouds, it is shared for one or more client engine modules as a public knowledge base, and the learning outcome of all clients of persistence.The method and system of this motion are applicable to the test job of the system of various types of inclusion information input interfaces.By adopting the method and system of this motion, can improve the testing efficiency for the bulk information typing page.In addition, Dictionary Database has realized information sharing, and full decoupled with measurand, can simulating nature thinking.Moreover, the autolearn feature of client engine module make Dictionary Database can be constantly oneself grow, operating characteristic that also can analysis user in abundant information amount, thus test data is continued to optimize.
above, describe the specific embodiment of the present invention with reference to the accompanying drawings.But those skilled in the art can understand, without departing from the spirit and scope of the present invention in the situation that, can also do various changes and replacement to the specific embodiment of the present invention.These changes and replacement all drop in the claims in the present invention book limited range.

Claims (11)

1. for the page being carried out to a system for automatic input, comprising:
Data dictionary library module, described digital dictionary library module is for storing the data of magnanimity; And
Client engine module, the page, to extract page elements, and according to described page elements, selects suitable data to carry out automatic input to the described page in described data dictionary library module described in described client engine module analysis.
2. the system as claimed in claim 1, wherein, described data dictionary library module is the database with tree structure storage magnanimity test data.
3. system as claimed in claim 2, wherein, described data dictionary library module comprises:
Dictionary tree, the basic level concept of carrying out organising data according to the mankind's natural cognition;
Data mapping tables, for reflecting the relation between each dictionary and its corresponding data set; With
Near synonym storehouse, for the dictionary being close in meaning is divided into groups, realizes the multiplexing of data.
4. the system as claimed in claim 1, wherein, described client engine module comprises:
Page elements analysis module, treats the page elements of typing for extracting the described page;
Data retrieval module, for retrieving at described database dictionary library module according to described page elements, to find and the same or analogous dictionary of described page elements; And
Automatic input module, according to the result for retrieval of described data retrieval module, completes the typing of described dictionary automatically.
5. system as claimed in claim 4, wherein, described client engine module further comprises:
Automatic learning module, for adding user of service's typing value to the data centralization of the described data dictionary library module corresponding with this typing value.
6. the system as claimed in claim 1, wherein, described client engine block configuration becomes in the time that dictionary is not present in described data dictionary library module, automatically in described database module, creates this dictionary, and the data of user's typing is deposited in to the data centralization of this dictionary mapping.
7. a method of the page being carried out to automatic input, comprising:
Set up data dictionary library module, described digital dictionary library module is for storing the data of magnanimity;
Analyze the described page to extract page elements;
Retrieve in described database dictionary library module according to described page elements, to find and the same or analogous dictionary of described page elements; And
According to the result for retrieval of described data retrieval module, automatically complete the typing of described dictionary.
8. method as claimed in claim 7, wherein, described data dictionary library module is the database with tree structure storage magnanimity test data.
9. method as claimed in claim 8, wherein, set up data dictionary library module and comprise:
Substantially carry out the level concept of organising data according to the mankind's natural cognition, to form dictionary tree;
The data set corresponding with it to each dictionary shines upon, to form data mapping tables; And
The dictionary being close in meaning is divided into groups, to form near synonym storehouse, thereby realize the multiplexing of data.
10. method as claimed in claim 7, also comprises: in the time that dictionary is not present in described data dictionary library module, automatically in described database module, create this dictionary.
11. methods as described in claim 7 or 10, also comprise: the data centralization of user of service's typing value being added to the described data dictionary library module corresponding with this typing value.
CN201210541898.4A 2012-12-14 2012-12-14 Page automatic input system and method Pending CN103870493A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210541898.4A CN103870493A (en) 2012-12-14 2012-12-14 Page automatic input system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210541898.4A CN103870493A (en) 2012-12-14 2012-12-14 Page automatic input system and method

Publications (1)

Publication Number Publication Date
CN103870493A true CN103870493A (en) 2014-06-18

Family

ID=50909036

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210541898.4A Pending CN103870493A (en) 2012-12-14 2012-12-14 Page automatic input system and method

Country Status (1)

Country Link
CN (1) CN103870493A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105488219A (en) * 2015-12-21 2016-04-13 珠海威泓医疗科技有限公司 Automatic webpage information entering method and system
CN107341080A (en) * 2017-07-07 2017-11-10 郑州云海信息技术有限公司 A kind of test case management system and method
CN108549722A (en) * 2018-04-25 2018-09-18 稿定(厦门)科技有限公司 Multi-platform data publication method, system and medium
CN110287222A (en) * 2019-05-29 2019-09-27 贵州观云易优游科技发展有限公司 A kind of book information storage method and system
CN110704524A (en) * 2019-09-06 2020-01-17 平安城市建设科技(深圳)有限公司 House source recording method, device and equipment and computer readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6564213B1 (en) * 2000-04-18 2003-05-13 Amazon.Com, Inc. Search query autocompletion
CN101127050A (en) * 2007-07-03 2008-02-20 北京大学 Method for automatically extracting website owner administrative apanage information from web page
CN101196885A (en) * 2006-12-06 2008-06-11 鸿富锦精密工业(深圳)有限公司 Web page data auto-filling system and method
CN101782998A (en) * 2009-01-20 2010-07-21 复旦大学 Intelligent judging method for illegal on-line product information and system
CN102084363A (en) * 2008-07-03 2011-06-01 加利福尼亚大学董事会 A method for efficiently supporting interactive, fuzzy search on structured data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6564213B1 (en) * 2000-04-18 2003-05-13 Amazon.Com, Inc. Search query autocompletion
CN101196885A (en) * 2006-12-06 2008-06-11 鸿富锦精密工业(深圳)有限公司 Web page data auto-filling system and method
CN101127050A (en) * 2007-07-03 2008-02-20 北京大学 Method for automatically extracting website owner administrative apanage information from web page
CN102084363A (en) * 2008-07-03 2011-06-01 加利福尼亚大学董事会 A method for efficiently supporting interactive, fuzzy search on structured data
CN101782998A (en) * 2009-01-20 2010-07-21 复旦大学 Intelligent judging method for illegal on-line product information and system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105488219A (en) * 2015-12-21 2016-04-13 珠海威泓医疗科技有限公司 Automatic webpage information entering method and system
CN105488219B (en) * 2015-12-21 2019-06-04 珠海威泓医疗科技有限公司 Webpage information method for automatically inputting and system
CN107341080A (en) * 2017-07-07 2017-11-10 郑州云海信息技术有限公司 A kind of test case management system and method
CN108549722A (en) * 2018-04-25 2018-09-18 稿定(厦门)科技有限公司 Multi-platform data publication method, system and medium
CN108549722B (en) * 2018-04-25 2021-12-28 稿定(厦门)科技有限公司 Multi-platform data publishing method, system and medium
CN110287222A (en) * 2019-05-29 2019-09-27 贵州观云易优游科技发展有限公司 A kind of book information storage method and system
CN110704524A (en) * 2019-09-06 2020-01-17 平安城市建设科技(深圳)有限公司 House source recording method, device and equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
CN107766371B (en) Text information classification method and device
US10579736B2 (en) Method and device for determining comment, server and storage medium
US10423649B2 (en) Natural question generation from query data using natural language processing system
CN102708096B (en) Network intelligence public sentiment monitoring system based on semantics and work method thereof
CN102207948B (en) Method for generating incident statement sentence material base
US20170193393A1 (en) Automated Knowledge Graph Creation
US20090094189A1 (en) Methods, systems, and computer program products for managing tags added by users engaged in social tagging of content
US20200265074A1 (en) Searching multilingual documents based on document structure extraction
CN106537370A (en) Method and system for robust tagging of named entities in the presence of source or translation errors
CN110377908B (en) Semantic understanding method, semantic understanding device, semantic understanding equipment and readable storage medium
CN101840406A (en) Place name searching device and system
Chrupała Text segmentation with character-level text embeddings
CN101093478A (en) Method and system for identifying Chinese full name based on Chinese shortened form of entity
CN103870493A (en) Page automatic input system and method
WO2014000130A1 (en) Method or system for automated extraction of hyper-local events from one or more web pages
Sarmento et al. Repentino–a wide-scope gazetteer for entity recognition in portuguese
CN112395858A (en) Multi-knowledge point marking method and system fusing test question data and answer data
CN102207947B (en) Direct speech material library generation method
Strobel et al. Metadata for scientific audiovisual media: current practices and perspectives of the TIB| AV-Portal
CN114491209A (en) Method and system for mining enterprise business label based on internet information capture
Sabty et al. Techniques for named entity recognition on arabic-english code-mixed data
Breuing Improving human-agent conversations by accessing contextual knowledge from wikipedia
Ya et al. A Customized Tourism System Using Log Data on Hadoop
Du et al. A novel pattern learning method for open domain question answering
Thottempudi A visual narrative of ramayana using extractive summarization topic modeling and named entity recognition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140618

RJ01 Rejection of invention patent application after publication