WO2000077668A2 - A method for caching xml documents viewable on devices with different displays - Google Patents

A method for caching xml documents viewable on devices with different displays Download PDF

Info

Publication number
WO2000077668A2
WO2000077668A2 PCT/US2000/016206 US0016206W WO0077668A2 WO 2000077668 A2 WO2000077668 A2 WO 2000077668A2 US 0016206 W US0016206 W US 0016206W WO 0077668 A2 WO0077668 A2 WO 0077668A2
Authority
WO
WIPO (PCT)
Prior art keywords
document
processed version
unprocessed
client
markup language
Prior art date
Application number
PCT/US2000/016206
Other languages
French (fr)
Other versions
WO2000077668A3 (en
Inventor
Venkata S. Ayyagari
Polina Kuznetsov
Lutfiye Umit Valcinalp
Original Assignee
Sun Microsystems, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sun Microsystems, Inc. filed Critical Sun Microsystems, Inc.
Priority to EP00939831A priority Critical patent/EP1299812A2/en
Priority to AU54851/00A priority patent/AU5485100A/en
Publication of WO2000077668A2 publication Critical patent/WO2000077668A2/en
Publication of WO2000077668A3 publication Critical patent/WO2000077668A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching

Definitions

  • the present invention relates to the transfer and storage of information over an electronic network. More particularly, the invention relates to a proxy server caching mechanism that provides a method for retrieving, processing and storing XML documents for access by an end user.
  • HTML Hypertext Markup Language
  • HTML is the most successful electronic-publishing language ever invented, it is superficial. In essence, it describes how a Web browser should arrange text, images and push-buttons on a page, but its lack of structure creates significant barriers to using HTML for applications beyond simple browsing. For example, more and more people are trying to configure their Web sites to take orders from customers, transmit records, even run factories and scientific instruments from half a world away. Consequently, several enhancements and additions have been made to the suite of software systems operating on the Internet to make HTML sufficiently functional. Functionality is often added in HTML documents using Java, JavaScript, and Common Gateway Interface (CGI) programs. Unfortunately, adding functionality in this way obscures the clean, basic revisions arising from the efforts of browser vendors to extend the Web, followed by the efforts of standard bodies such as the World Wide Web Consortium (W3C) to keep up with the ever-changing state of the art.
  • W3C World Wide Web Consortium
  • XML Extensible Markup Language
  • HTML Extensible Markup Language
  • An XML document is composed of data embedded within markup tags. These tags are similar to those used in HTML, except that the XML tags may be self-defined. In other words, XML tags define the internal structure of the embedded data. This convention change leads to a greatly increased number of tags in an XML document, compared to the identical HTML document.
  • This example represents information about an employee: name, phone, address,
  • the employee is described by a name, an email address, phone and fax numbers, a location, and an
  • An employee object consists of name, email,
  • a name consists of first, middle, and last
  • a location contains a building and a room object, etc.
  • XML representations in further contrast to HTML, do not contain a description of how to display the content. XML instead stores the rendering (e.g., fonts, colors,
  • stylesheet Language XSL
  • a stylesheet can be associated with a client and/or a server.
  • a server can
  • processor on the server side means that one can make use of the flexibility and power of
  • a user is able to use a stylesheet that the server does not know about. This
  • client side allows the client to use the same data and feed it into a database or send it to
  • Stylesheets can also take XML documents and render them into multi-color
  • the XML-powered Web will be faster, friendlier and a better
  • PDAs personal assistants
  • embedded devices etc.
  • Systems and methods consistent with this invention provide for efficient processing, caching and routing of XML documents through the use of a proxy server.
  • the proxy server interfaces with at least one client computer and a plurality of remote
  • the proxy server is adapted to receive a document request in the Internet.
  • the proxy server is further adapted to search a local cache for a processed version of the document and to transmit the processed document to the requesting client. In the event the document is not found
  • the proxy server is adapted to process the XML document, route it to the
  • the proxy server simply monitors traffic
  • the proxy server searches a local cache for a processed version of the XML
  • Figure 1 is a block diagram of a computer network including an arrangement
  • FIG. 2 is a block diagram of the client computer of Figure 1;
  • FIG. 3 is a block diagram of the XML proxy server of Figure 1;
  • Figure 4 is a block diagram of the remote server of Figure 1;
  • Figure 5 is a flow chart of steps for retrieving a document in accordance with the
  • Figure 6 is a flow chart of an alternate set of steps for retrieving a document in
  • a system in accordance with the present invention comprises a proxy server
  • the proxy server is adapted to receive a document request in the form of a
  • URL uniform resource locator
  • remote server and to determine whether the document is an unprocessed XML document.
  • the proxy server is further adapted to search a local cache for a processed version of the document, and to transmit the processed version to the requesting client computer. In the event the document is not found in local cache, the proxy server is adapted to process the XML document, route it
  • an XML proxy network system 10 is comprised of a
  • the network can be comprised of PentiumTM based
  • XML proxy network 10 includes client devices 100, an XML proxy server 200,
  • FIG. 1 are coupled to a network 20 which may be the Internet, via a series of network cable interfaces. More specifically, client devices 100 are connected to network 20 via
  • XML proxy server 200 is coupled to network 20 via cable interface 40,
  • remote server computers 300 are coupled to network 20 via cable interface 50.
  • FIG. 2 A detailed diagram of client device 100 is shown in FIG. 2. As shown in FIG. 2,
  • client device 100 is comprised of a central processor unit (CPU) 101, a memory 102, a
  • a display adapter 106 a display 108, a user interface (UI) adapter 110, a pointing device
  • a keyboard 112 a keyboard 112
  • IO input/output
  • disk storage unit 115 a disk storage unit 115
  • Communications adapter 120 for providing a communications function.
  • Memory 102
  • each client includes an operating system 130 for operating the device and a browser program 132 for rendering and displaying content. As shown, the various components of each client
  • display adaptor 106 is
  • user interface adaptor 110 is coupled to pointing device 111 and
  • I/O adaptor 114 is coupled to disk storage unit 115 and communications
  • adaptor 120 is coupled to network interface cable 30 for providing connectivity between
  • client computer 100 and network 20.
  • XML proxy server 200 is comprised of a CPU 201 , a memory 202, a display
  • Memory 202 includes an operating system 230 and an XML processor program 232. As shown, the
  • each XML proxy server 200 communicates through a system bus
  • communications adaptor 220 is coupled to network
  • interface cable 40 for providing connectivity between XML proxy server 200 and network
  • FIG.4 A detailed diagram of remote server computer 300 is shown in FIG.4. As shown,
  • remote server computer 300 is comprised of the same hardware components as client
  • Internet content 332 associated with remote server computer 300 is associated with Internet content 332 associated with remote server computer 300.
  • FIG. 5 there is shown exemplary steps of XML processor
  • program 232 for implementing the method for processing and routing XML documents
  • the XML proxy server 200 is
  • proxy server 200 are generally discussed as being performed by such single device.
  • XML proxy server 200 may actually comprise multiple physical and logical
  • XML proxy server 200 which are provided by XML proxy server 200 may actually be distributed among multiple server devices. As shown in FIG. 5, a user action in the form of a uniform resource
  • the XML proxy server 200 decodes the received URL and then ascertains whether the document is an XML document (step 410). For a
  • MIME refers to an official Internet standard that specifies how messages must be formatted so that they can be exchanged between different email
  • XML proxy server 200 determines that
  • the document is an XML document
  • step 415 XML proxy server 200 determines whether the document has been previously retrieved by the same or a different client computer 100 (i.e., is the
  • proxy server 200 routes the document to the client device If the document has not been cached, proxy server 200 routes the request to the client device.
  • step 420 Processing then flows to step 425 where the
  • remote server 300 identifies the requested document, and routes it to XML proxy server
  • step 425 Processing flows from step 425 to step 435 where XML proxy server 200 locates
  • XML proxy server 200 retrieves client
  • step 440 Processing then flows to step 445 where XML proxy server 200 applies any stylesheets to the document.
  • step 450 XML proxy
  • server 200 stores the processed document in local cache 215.
  • step 455 the document
  • FIG. 1 schematic overview illustrated by FIG. 1 and the detailed schematics of FIGs.2-4 that our
  • the client computer 100 stores within the system in which our invention is situated.
  • the client computer 100 is the server computer 100
  • the proxy server 200 with the facilities acts as an agent to review the data
  • remote servers 300 are provided with reference to FIG. 6.
  • FIG. 6 shows a flowchart of an alternate implementation of the process used by
  • proxy server 200 for processing requests for XML documents.
  • the XML proxy server 200 does not receive document requests from client computers 100.
  • XML proxy server 200 simply monitors document flow to client computer 100
  • proxy server 200 receives the document from the remote server 300.
  • XML proxy server 200 ascertains whether the document is
  • XML proxy server 200 an XML document.
  • XML proxy server 200 XML proxy server 200
  • XML proxy server 200 determines that the document is an XML document, processing flows to step 525, otherwise processing flows
  • step 525 XML proxy server 200 determines whether the document has
  • step 555 If the document has not been cached, processing flows from step 555.
  • XML proxy server 200 locates and retrieves the document's
  • XML proxy server 200 retrieves client computer's 100 stylesheet in step
  • step 540 Processing then flows to step 545 where XML proxy server 200 applies one or both stylesheets to the document.
  • XML proxy server 200 stores the processed document in local cache 215.
  • step 555 the document is routed to client

Abstract

Systems and methods consistent with this invention provide for efficient processing, caching and routing of XML documents through the use of a proxy server. The proxy server is coupled to at least one client computer and a plurality of remote servers on the Internet. The proxy server is adapted in this preferred embodiment to receive a document request in the form of a uniform resource locator (URL) from a client computer and to determine whether the document is an unprocessed XML document. If the document is an unprocessed XML document, the proxy server is further adapted to search a local cache for a processed version of the document, and to transmit the processed document to the requesting client. In the event the document is not found in local storage, the proxy server is adapted to process the XML document, route it to the client and then store the file in local storage in anticipation of subsequent requests for the same document.

Description

A METHOD FOR CACHING XML DOCUMENTS VIEWABLE ON DEVICES WITH DIFFERENT DISPLAYS
Related Applications
This application claims priority in U.S. Provisional Application No. 60/138,685, entitled "Method and System for Offloading Processing of XML Documents to a Proxy Server" filed June 14, 1999, bearing attorney docket no. 06502-6009.
Field of the Invention The present invention relates to the transfer and storage of information over an electronic network. More particularly, the invention relates to a proxy server caching mechanism that provides a method for retrieving, processing and storing XML documents for access by an end user.
Description of the Prior Art
Computers need to be told exactly what things are, how they are related and how to deal with them. From their beginning, computers have relied on programming languages to transform ideas and algorithms into useful digital processing system output. A network of computers like the Internet or its subset, the World Wide Web is no different. The Web relies on software to collect, process and disseminate data to its network users. Shortly after its inception, the Hypertext Markup Language (HTML) became the Web's de facto markup language with a set of symbols or codes that tell a Web browser how to display a Web page's content. HTML gained widespread popularity by providing an outstanding mechanism to deliver simple documents over the Web. It makes surfing the Web so simple, most people can effectively use it with little or no training.
Although HTML is the most successful electronic-publishing language ever invented, it is superficial. In essence, it describes how a Web browser should arrange text, images and push-buttons on a page, but its lack of structure creates significant barriers to using HTML for applications beyond simple browsing. For example, more and more people are trying to configure their Web sites to take orders from customers, transmit records, even run factories and scientific instruments from half a world away. Consequently, several enhancements and additions have been made to the suite of software systems operating on the Internet to make HTML sufficiently functional. Functionality is often added in HTML documents using Java, JavaScript, and Common Gateway Interface (CGI) programs. Unfortunately, adding functionality in this way obscures the clean, basic revisions arising from the efforts of browser vendors to extend the Web, followed by the efforts of standard bodies such as the World Wide Web Consortium (W3C) to keep up with the ever-changing state of the art.
To stem this flow of incessant change, a working group of the W3C developed a new kind of markup language now known as the Extensible Markup Language (XML). XML addresses many of HTML's limitations, by creating a whole new way to approach how Web sites are structured and designed. It also expands how relationships between content (the messages a Web site seeks to communicate) and form (the ways that content is formatted or presented, and the ways links between elements operate) may be expressed. An XML document is composed of data embedded within markup tags. These tags are similar to those used in HTML, except that the XML tags may be self-defined. In other words, XML tags define the internal structure of the embedded data. This convention change leads to a greatly increased number of tags in an XML document, compared to the identical HTML document. Consider this trivial HTML example: <B>Nenkata S. Ayyagari</B>
<P> vsa@eng.sun.com<BR> phone: 408-517-aaaa<BR> fax: 408-863-bbbb<BR> 901 San Antonio Rd.<BR>
SUN Microsystems<BR>
MS UCUP02-20KBR>
Palo Alto, CA<BR>
94303<BR>
This example represents information about an employee: name, phone, address,
etc. The markup describes how the lines should be displayed (<B> Bold, <P> Paragraph
Break, <BR> Line Break). When the HTML is processed by the browser, no semantics
can be inferred. Consequently, the computer has no understanding of the kind of
information being rendered and therefore cannot provide meaningful insight into its
contents. No w consider a possible XML representation of the same information that conveys
the relationship between various data objects. In the XML version below, the employee is described by a name, an email address, phone and fax numbers, a location, and an
address. Note that each conceptual piece of information is represented by its own XML
element, such as <EMPLOYEE>, <NAME>, and <ADDRESS>.
<EMPLOYEE> <NAME>
<FIRST>Venkata</FIRST> <MIDDLE> S</MIDDLE> <LAST>Ayyagari</LAST>
</NAME>
<EMAIL>vsa@eng.sun.com</EMAIL> <PHONE>408-517-aaaa</PHONE> <FAX>408-863-bbbb</FAX> <LOCATION>
<STREET NUMBER>90K/STREET NUMBER> <STREET NAME>San Antonio Rd</STREET NAME> </LOCATION> <ADDRESS> <ORG>SUN Microsystems</ORG>
<MAILSTOP>MS UCUP02-201 </MAILSTOP> <CITY>Palo Alto</CITY> <STATE>CA</STATE> <ZIP>94303</ZIP> </ADDRESS>
</EMPLOYEE>
The advantage of XML in this example is that it preserves the semantics and
structure of the data in a hierarchical form. An employee object consists of name, email,
phone, fax, location, and address objects. A name consists of first, middle, and last
components, a location contains a building and a room object, etc.
XML representations, in further contrast to HTML, do not contain a description of how to display the content. XML instead stores the rendering (e.g., fonts, colors,
leading, margins, typefaces, and other aspects of style) in an Extensible Stylesheet Language (XSL) document called a "stylesheet." Separating the functionality in this way
allows publishers, who would often like to "write once and publish everywhere "--to distill
the substance of a publication and then pour it into myriad forms, both printed and
electronic. Further complicating the differences between XML and HTML is the fact that
a stylesheet can be associated with a client and/or a server. In other words, a server can
apply a particular stylesheet to an XML document and then transmit the document to a
client computer that also applies its own stylesheet to the document. Having an XSL
processor on the server side means that one can make use of the flexibility and power of
XML without having to worry about whether a particular client provides XSL. However,
in order to take full advantage of XML and XSL, a scenario where the rendering occurs on the client side is most appropriate. Having the server send XML data in conjunction with XSL stylesheets to a client allows that client to use different stylesheets based on
user preferences and the like, without having to refer back to the XSL stylesheet stored on the remote server. Having the client send out XML and XSL provides other benefits
as well. A user is able to use a stylesheet that the server does not know about. This
allows for full customization of renderings based on a user's needs. Also, XML on the
client side allows the client to use the same data and feed it into a database or send it to
a speech synthesis program. It is also easy to envision one stylesheet for rendering in the
browser and yet another one, working on the same data, for preparing high-quality printouts. Stylesheets can also take XML documents and render them into multi-color,
multi-font documents, braille, audible speech or any format desired simply by altering the associated stylesheet. Similar alterations to HTML documents could not be done without significant alterations to the underlying HTML program. Experts in the field believe that the change from HTML to XML has the potential to extend the Internet beyond mere information delivery to many other kinds of unimaginable human activity.
Thus, for its users, the XML-powered Web will be faster, friendlier and a better
place to do business. Referring back to the previous example, it is obvious that a search
for SUN Microsystems employees using the XML document would be much quicker, and
more accurate from a similar search of the HTML document. Web site designers, on the
other hand, will find it more demanding. Battalions of programmers will be needed to
exploit new XML languages to their fullest. Future Web designers will need to be versed
not just in the production of words and graphics but also in the construction of
multilayered, interdependent systems of DTDs, data trees, hyperlink structures and stylesheets. Web clients correspondingly will need to possess more processing power to read an XML document, retrieve the appropriate stylesheet, and use it to sort and format
the information on the screen. Just as it is obvious that Web servers will become more
efficient and less burdened due to the organization and efficiencies provided by XML, it
is equally clear that the current configuration of Web clients could quickly become
inundated by the multiple file access operations necessary to process a single XML
document. This reality is particularly true with respect to "thin clients" like personal data
assistants (PDAs), embedded devices, etc.
There is a need therefore for an apparatus and method that minimizes unnecessary network traffic; provides XML processing capability to clients not otherwise capable of
such processing; optimizes processing of XML documents on the Internet; and prevents
redundant processing of XML documents. Summary of the Invention
Systems and methods consistent with this invention provide for efficient processing, caching and routing of XML documents through the use of a proxy server.
The proxy server interfaces with at least one client computer and a plurality of remote
servers on the Internet. The proxy server is adapted to receive a document request in the
form of a uniform resource locator (URL) from a client computer, to forward the request
to a designated remote server based on the URL, to receive the requested document from
the remote server, and to determine whether the document is an unprocessed XML
document. If the document is an unprocessed XML document, the proxy server is further adapted to search a local cache for a processed version of the document and to transmit the processed document to the requesting client. In the event the document is not found
in local storage, the proxy server is adapted to process the XML document, route it to the
client and then store the file in local storage in anticipation of subsequent requests for the
same document. In another aspect of the subject invention, the proxy server simply monitors traffic
destined for a client. When it identifies an unprocessed XML document being routed to
the client, the proxy server searches a local cache for a processed version of the XML
document, and routes the processed document to the client, if a processed version was found. In the event the document is not found in local storage, again the proxy server is
adapted to process the XML document, route it to the client and then store the file in local
storage in anticipation of subsequent requests for the same document. Brief Description of the Drawings
The accompanying drawings, which are incorporated in and constitute a part of
this specification, illustrate embodiments of the invention and, together with the
description, serve to explain the objects, advantages, and principles of the invention.
Figure 1 is a block diagram of a computer network including an arrangement
constructed in accordance with the subject invention;
Figure 2 is a block diagram of the client computer of Figure 1;
Figure 3 is a block diagram of the XML proxy server of Figure 1;
Figure 4 is a block diagram of the remote server of Figure 1; Figure 5 is a flow chart of steps for retrieving a document in accordance with the
subject invention; and
Figure 6 is a flow chart of an alternate set of steps for retrieving a document in
accordance with the subject invention.
Detailed Description
A system in accordance with the present invention comprises a proxy server
interfacing with at least one client computer and a plurality of remote servers on the
Internet. The proxy server is adapted to receive a document request in the form of a
uniform resource locator (URL) from a client computer, to forward the request to a
designated remote server based on the URL, to receive the requested document from the
remote server, and to determine whether the document is an unprocessed XML document.
If the document is an unprocessed XML document, the proxy server is further adapted to search a local cache for a processed version of the document, and to transmit the processed version to the requesting client computer. In the event the document is not found in local cache, the proxy server is adapted to process the XML document, route it
to the client computer and then store the processed XML document in the server's local
cache.
Referring first to FIG. 1, an XML proxy network system 10 is comprised of a
plurality of interconnected computers and microprocessors hosting a plurality of operating
systems. By way of example, the network can be comprised of Pentium™ based
microprocessors operating on Windows/NT, UNIX and/or Windows/CE operating systems. XML proxy network 10 includes client devices 100, an XML proxy server 200,
and a plurality of remote server computers 300. The devices and computers, as shown in
FIG. 1, are coupled to a network 20 which may be the Internet, via a series of network cable interfaces. More specifically, client devices 100 are connected to network 20 via
cable interface 30, XML proxy server 200 is coupled to network 20 via cable interface 40,
and remote server computers 300 are coupled to network 20 via cable interface 50.
A detailed diagram of client device 100 is shown in FIG. 2. As shown in FIG. 2,
client device 100 is comprised of a central processor unit (CPU) 101, a memory 102, a
display adapter 106, a display 108, a user interface (UI) adapter 110, a pointing device
111, a keyboard 112, an input/output (IO) adapter 114, a disk storage unit 115, and a
communications adapter 120 for providing a communications function. Memory 102
includes an operating system 130 for operating the device and a browser program 132 for rendering and displaying content. As shown, the various components of each client
device 100 communicate through a system bus 113 or similar architecture. There may
be other components as well, but these are not shown to facilitate description of the
unique aspects of this embodiment of the invention. The hardware arrangement of this computer, as well as the other computers discussed in this specification is intentionally
shown as general, and is meant to represent a broad variety of architectures, which depend
on the particular computing device used. As shown in FIG. 2, display adaptor 106 is
coupled to display 108, user interface adaptor 110 is coupled to pointing device 111 and
keyboard 112, I/O adaptor 114 is coupled to disk storage unit 115 and communications
adaptor 120 is coupled to network interface cable 30 for providing connectivity between
client computer 100 and network 20.
A detailed diagram of XML proxy server 200 is shown in FIG. 3. As with client devices 100, XML proxy server 200 is comprised of a CPU 201 , a memory 202, a display
adapter 206, a display 208, a UI adapter 210, a pointing device 211, a keyboard 212, an IO adapter 214, a disk storage unit 215, and a communications adapter 220. Memory 202 includes an operating system 230 and an XML processor program 232. As shown, the
various components of each XML proxy server 200 communicate through a system bus
213 or similar architecture. Also, communications adaptor 220 is coupled to network
interface cable 40 for providing connectivity between XML proxy server 200 and network
20.
A detailed diagram of remote server computer 300 is shown in FIG.4. As shown,
remote server computer 300 is comprised of the same hardware components as client
computer 100 and XML proxy server 200. In addition, stored in memory 302 is the
Internet content 332 associated with remote server computer 300.
Referring now to FIG. 5, there is shown exemplary steps of XML processor
program 232 for implementing the method for processing and routing XML documents
in system 10 in accordance with an implementation of the present invention. Note that in this description, in order to facilitate explanation, the XML proxy server 200 is
generally discussed as if it were a single device, and functions provided by the XML
proxy server 200 are generally discussed as being performed by such single device.
However, XML proxy server 200 may actually comprise multiple physical and logical
devices connected in a distributed architecture, and the various functions discussed below
which are provided by XML proxy server 200 may actually be distributed among multiple server devices. As shown in FIG. 5, a user action in the form of a uniform resource
locator (URL) is received at step 405. The XML proxy server 200 decodes the received URL and then ascertains whether the document is an XML document (step 410). For a
web page this can be done by examining the URL for a reference to "XML". For a mail message this can be done by examining the multipurpose Internet mail extension (MIME)
for a reference to "XML". MIME refers to an official Internet standard that specifies how messages must be formatted so that they can be exchanged between different email
systems. "Text/xml" and "application/xml" are two media types that enable the exchange
of XML documents with various email systems. If XML proxy server 200 determines that
the document is an XML document, processing flows to step 415, otherwise processing
flows to step 455. In step 415, XML proxy server 200 determines whether the document has been previously retrieved by the same or a different client computer 100 (i.e., is the
document cached?) If the document is cached, processing flows to step 430 and the XML document is retrieved from the local cache 215 of XML proxy server 200. Processing
then flows to step 455 and the proxy server 200 routes the document to the client device If the document has not been cached, proxy server 200 routes the request to the
appropriate remote server 300 (step 420). Processing then flows to step 425 where the
remote server 300 identifies the requested document, and routes it to XML proxy server
200. Processing flows from step 425 to step 435 where XML proxy server 200 locates
and retrieves the document's stylesheet. Next, XML proxy server 200 retrieves client
computer's 100 stylesheet in step 440. Processing then flows to step 445 where XML proxy server 200 applies any stylesheets to the document. Next, in step 450, XML proxy
server 200 stores the processed document in local cache 215. In step 455 the document
is routed to client device 100.
Thus, in connection with one embodiment, it should be appreciated from the
schematic overview illustrated by FIG. 1 and the detailed schematics of FIGs.2-4 that our
invention may be employed in a distributed computer system environment which has
internal, external and intranet networks collectively represented in our schematic
overview by the network 20 to connect clients to World Wide Web servers and other
servers within the system in which our invention is situated. The client computer 100
makes a request to the network 20 for a access to a Web site located anywhere on the
entire network. The proxy server 200 with the facilities acts as an agent to review the data
gathered from one or more of the multiple remote servers 300 coupled to the network 20
and convert it, if necessary into a format acceptable to the requesting client 100. Further
details with respect to the use of our invention for information retrieval from the multiple
remote servers 300 are provided with reference to FIG. 6.
FIG. 6 shows a flowchart of an alternate implementation of the process used by
proxy server 200 for processing requests for XML documents. In this implementation the XML proxy server 200 does not receive document requests from client computers 100.
Instead, XML proxy server 200 simply monitors document flow to client computer 100
to determine whether a document is an XML document and then either routes a previously
stored document to the client computer 100 or performs processing on the unprocessed
XML document before routing it to the client computer 100. As shown in step 505, the
process begins when a user transmits a request for a document directly to the appropriate
remote server 300. In step 510, proxy server 200 receives the document from the remote
server 300, and in step 520, XML proxy server 200 ascertains whether the document is
an XML document. As in the case of the preferred embodiment, XML proxy server 200
performs this step by examining the URL. If XML proxy server 200 determines that the document is an XML document, processing flows to step 525, otherwise processing flows
to step 555. In step 525, XML proxy server 200 determines whether the document has
been cached. If the document is cached, processing flows to step 530 and the XML
document is retrieved from the local cache 215 of XML proxy server 200 and processing
then flows to step 555. If the document has not been cached, processing flows from step
525 to step 535 where XML proxy server 200 locates and retrieves the document's
stylesheet. Next, XML proxy server 200 retrieves client computer's 100 stylesheet in step
540. Processing then flows to step 545 where XML proxy server 200 applies one or both stylesheets to the document. Next, in step 550, XML proxy server 200 stores the processed document in local cache 215. In step 555 the document is routed to client
computer 100, and in step 560, processing completes.
It will be apparent to those skilled in the art that various modifications and
variations can be made in the disclosed process and product without departing from the scope or spirit of the invention. Other embodiments of the invention will be apparent to
those skilled in the art from consideration of the specification and practice of the
invention disclosed herein. It is intended that the specification and examples be
considered as exemplary only, with a true scope and spirit of the invention being indicated
by the following claims.

Claims

WHAT IS CLAIMED IS:
1. A method for processing content requests in a network having at least one
content provider having stored documents in an unprocessed format, a proxy server and
at least one client, the method comprising the following steps performed by the proxy
server, of:
receiving a request from the client for a document associated with the content
provider;
determining whether a processed version of the document is located in a local
cache;
when it is determined that the processed version of the document is located in the
local cache, providing the processed version of the document to the client; and
when it is determined that the processed version of the document is not located in
the local cache, (i) obtaining the unprocessed document associated with the content
provider; (ii) processing the unprocessed document in accordance with predetermined
instructions associated with the unprocessed document; and (iii) providing the processed
version of the identified processed document to the client.
2. The method of claim 1, wherein the step of processing the unprocessed
document, further comprises the step of storing the processed version of the document in
the local cache.
3. The method of claim 1, wherein the step of determining whether a
processed version of the document is located in local cache, further comprises the step of ascertaining whether the document is written in a second markup language, wherein said
second markup language is a processed version of said first markup language.
4. The method of claim 1, wherein the step of processing the unprocessed
document in accordance with predetermined instructions associated with the unprocessed
document, further comprises the step of converting the unprocessed document from a first
markup language to a second markup language.
5. The method of claim 1, wherein said second markup language is the
extensible markup language (XML).
6. A method for processing content requests in a network having at least one
content provider having stored documents in an unprocessed format, a proxy server and
at least one client, the method comprising the following steps performed by the proxy
server, of:
receiving an unprocessed document from the content provider, wherein the
document was requested by the client;
determining whether a processed version of the document is located in a local
cache;
when it is determined that the processed version of the document is located in the
local cache, providing the processed version of the document to the client; and
when it is determined that the processed version of the document is not located in
the local cache, (i) processing the unprocessed document in accordance with predetermined instructions associated with the unprocessed document; and (ii) providing
the processed version of the identified processed document to the client.
7. The method of claim 6, wherein the step of processing the unprocessed
document, further comprises the step of storing the processed version of the document in
the local cache.
8. The method of claim 6, wherein the step of determining whether a
processed version of the document is located in local cache, further comprises the step of
ascertaining whether the document is written in a second markup language, wherein said
second markup language is a processed version of said first markup language.
9. The method of claim 6, wherein the step of processing the unprocessed
document in accordance with predetermined instructions associated with the unprocessed
document, further comprises the step of converting the unprocessed document from a first
markup language to a second markup language.
10. An Internet Proxy server comprising:
a memory having program instructions; and
a processor configured to use the program instructions to receive a request from
the client for a document associated with the content provider; to determine whether a
processed version of the document is located in a local cache; when it is determined that
the processed version of the document is located in the local cache, to provide the processed version of the document to the client; and when it is determined that the
processed version of the document is not located in the local cache, (i) obtain the
unprocessed document associated with the content provider; (ii) process the unprocessed
document in accordance with predetermined instructions associated with the unprocessed
document; and (iii) provide the processed version of the identified processed document
to the client.
11. The server of claim 10, wherein the instruction to process the unprocessed
document, further comprises the step of storing the processed version of the document in
the local cache.
12. The server of claim 10, wherein the instruction to determine whether a
processed version of the document is located in local cache, further comprises the
instruction to ascertain whether the document is written in a second markup language,
wherein said second markup language is a processed version of said first markup
language.
13. The server of claim 10, wherein the instruction to process the unprocessed
document in accordance with predetermined instructions associated with the unprocessed
document, further comprises the instruction to convert the unprocessed document from
a first markup language to a second markup language.
14. The server of claim 10, wherein said second markup language is the
extensible markup language (XML).
15. An Internet Proxy server comprising:
a memory having program instructions; and
a processor configured to use the program instructions to receive an unprocessed
document from the content provider, wherein the document was requested by the client;
to determine whether a processed version of the document is located in a local cache;
when it is determined that the processed version of the document is located in the local
cache, to provide the processed version of the document to the client; and when it is
determined that the processed version of the document is not located in the local cache,
(i) process the unprocessed document in accordance with predetermined instructions
associated with the unprocessed document; and (ii) provide the processed version of the
identified processed document to the client.
16. The server of claim 15 , wherein the instruction to process the unprocessed
document, further comprises the instruction of storing the processed version of the
document in the local cache.
17. The server of claim 15, wherein the instruction to determine whether a
processed version of the document is located in local cache, further comprises the step to
ascertain whether the document is written in a second markup language, wherein said
second markup language is a processed version of said first markup language.
18. The method of claim 15, wherein the instruction to process the
unprocessed document in accordance with predetermined instructions associated with the
unprocessed document, further comprises the instruction to convert the unprocessed
document from a first markup language to a second markup language.
19. A data processing system for processing content requests in a network
having at least one content provider having stored documents in an unprocessed format,
a proxy server and at least one client, the system comprising:
means for receiving a request from the client for a document associated with the
content provider;
means for determining whether a processed version of the document is located in
a local cache;
when it is determined that the processed version of the document is located in the
local cache, means for providing the processed version of the document to the client; and
when it is determined that the processed version of the document is not located in
the local cache, (i) means for obtaining the unprocessed document associated with the
content provider; (ii) means for processing the unprocessed document in accordance with
predetermined instructions associated with the unprocessed document; and (iii) means for
providing the processed version of the identified processed document to the client.
PCT/US2000/016206 1999-06-14 2000-06-14 A method for caching xml documents viewable on devices with different displays WO2000077668A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP00939831A EP1299812A2 (en) 1999-06-14 2000-06-14 A method for caching xml documents viewable on devices with different displays
AU54851/00A AU5485100A (en) 1999-06-14 2000-06-14 A method for caching xml documents viewable on devices with different displays

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13868599P 1999-06-14 1999-06-14
US60/138,685 1999-06-14

Publications (2)

Publication Number Publication Date
WO2000077668A2 true WO2000077668A2 (en) 2000-12-21
WO2000077668A3 WO2000077668A3 (en) 2002-12-12

Family

ID=22483161

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/016206 WO2000077668A2 (en) 1999-06-14 2000-06-14 A method for caching xml documents viewable on devices with different displays

Country Status (3)

Country Link
EP (1) EP1299812A2 (en)
AU (1) AU5485100A (en)
WO (1) WO2000077668A2 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002075604A2 (en) * 2001-03-15 2002-09-26 Kodak Polychrome Graphics Web page color accuracy using color-customized style sheets
EP1249765A1 (en) * 2001-04-12 2002-10-16 Alcatel Personalized service
KR100445922B1 (en) * 2000-02-24 2004-08-25 인터내셔널 비지네스 머신즈 코포레이션 System and method for collaborative multi-device web browsing
GB2407668A (en) * 2003-10-29 2005-05-04 Ibm A method and system for archiving and retrieving a markup language data stream
EP1830275A1 (en) * 2004-11-12 2007-09-05 JustSystems Corporation Information distribution system
US7584215B2 (en) 2001-08-20 2009-09-01 Helsingin Kauppakoreakoulu User-specific personalization of information services
EP2434728A1 (en) * 2010-09-27 2012-03-28 Research In Motion Limited Mobile wireless communications system providing enhanced user interface (ui) content display features and related methods
GB2503452A (en) * 2012-06-26 2014-01-01 Nds Ltd Supplying a request for content together with a caching recommendation to cloud equipment
CN110958186A (en) * 2019-12-09 2020-04-03 中国科学技术大学 Network equipment data processing method and system

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998043177A1 (en) * 1997-03-25 1998-10-01 Intel Corporation System for dynamically transcoding data transmitted between computers

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998043177A1 (en) * 1997-03-25 1998-10-01 Intel Corporation System for dynamically transcoding data transmitted between computers

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JOHNSON D: "CONVERTING PC GUIS FOR NON PC DEVICES" CIRCUIT CELLUR INK, VERNON, CT, US, vol. 91, February 1998 (1998-02), pages 40-42,44-45, XP000852859 ISSN: 0896-8985 *
MICHAEL FLOYD: "Building an XML Workbench" NEW ARCHITECT MAGAZINE, [Online] May 1999 (1999-05), XP002208575 Retrieved from the Internet: <URL:http://www.webtechniques.com/archives/1999/05/beyo/>> [retrieved on 2002-08-02] *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100445922B1 (en) * 2000-02-24 2004-08-25 인터내셔널 비지네스 머신즈 코포레이션 System and method for collaborative multi-device web browsing
WO2002075604A3 (en) * 2001-03-15 2003-11-27 Kodak Polychrome Graphics Co Web page color accuracy using color-customized style sheets
WO2002075604A2 (en) * 2001-03-15 2002-09-26 Kodak Polychrome Graphics Web page color accuracy using color-customized style sheets
EP1249765A1 (en) * 2001-04-12 2002-10-16 Alcatel Personalized service
US7584215B2 (en) 2001-08-20 2009-09-01 Helsingin Kauppakoreakoulu User-specific personalization of information services
GB2407668A (en) * 2003-10-29 2005-05-04 Ibm A method and system for archiving and retrieving a markup language data stream
US7206777B2 (en) 2003-10-29 2007-04-17 International Business Machines Corporation Method and system for archiving and retrieving a markup language document
EP1830275A1 (en) * 2004-11-12 2007-09-05 JustSystems Corporation Information distribution system
EP1830275A4 (en) * 2004-11-12 2008-01-16 Justsystems Corp Information distribution system
EP2434728A1 (en) * 2010-09-27 2012-03-28 Research In Motion Limited Mobile wireless communications system providing enhanced user interface (ui) content display features and related methods
US20120188264A1 (en) * 2010-09-27 2012-07-26 Research In Motion Limited Mobile wireless communications system providing enhanced user interface (ui) content display features and related methods
GB2503452A (en) * 2012-06-26 2014-01-01 Nds Ltd Supplying a request for content together with a caching recommendation to cloud equipment
CN110958186A (en) * 2019-12-09 2020-04-03 中国科学技术大学 Network equipment data processing method and system

Also Published As

Publication number Publication date
AU5485100A (en) 2001-01-02
EP1299812A2 (en) 2003-04-09
WO2000077668A3 (en) 2002-12-12

Similar Documents

Publication Publication Date Title
US7020681B1 (en) Method for caching XML documents viewable on devices with different displays
US8793341B2 (en) Web page content translator
KR100317401B1 (en) Apparatus and method for printing related web pages
US6925595B1 (en) Method and system for content conversion of hypertext data using data mining
US20180032629A1 (en) Transport and administration model for offline browsing
KR100398711B1 (en) Content publication system for supporting real-time integration and processing of multimedia contents including dynamic data and method thereof
US20020143523A1 (en) System and method for providing a file in multiple languages
US20020188631A1 (en) Method, system, and software for transmission of information
US6948117B2 (en) Methods and systems for generating a viewable document using view descriptors and generic view stylesheets
US20040225749A1 (en) Transformation of web site summary via taglibs
US10860391B2 (en) System and method for automatic generation of service-specific data conversion templates
GB2344197A (en) Content conversion of electronic documents
WO2000039666A1 (en) Converting content of markup data for wireless devices
JP2004527041A (en) System and method for content delivery over a wireless communication medium to a portable computing device
US20020188435A1 (en) Interface for submitting richly-formatted documents for remote processing
US20070027897A1 (en) Selectively structuring a table of contents for accesing a database
KR100855997B1 (en) Configurable transformation of electronic documents
KR100456022B1 (en) An XML-based method of supplying Web-pages and its system for non-PC information terminals
US6253229B1 (en) Hotspots only interfaces to links in hypertext document pages in network display stations
US7085807B2 (en) System and method for providing links to available services over a local network by a thin portal service configured to access imaging data stored in a personal imaging repository
WO2000077668A2 (en) A method for caching xml documents viewable on devices with different displays
US7424509B2 (en) System and method for printing from a web application
US20030076526A1 (en) Method and apparatus for printing documents using a document repository in a distributed data processing system
US6934912B2 (en) System and method for providing a history list of existing imaging compositions
US20060123107A1 (en) Web link management systems and methods

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2000939831

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWP Wipo information: published in national office

Ref document number: 2000939831

Country of ref document: EP

NENP Non-entry into the national phase in:

Ref country code: JP