US7716238B2 - Systems and methods for server management - Google Patents

Systems and methods for server management Download PDF

Info

Publication number
US7716238B2
US7716238B2 US11/367,174 US36717406A US7716238B2 US 7716238 B2 US7716238 B2 US 7716238B2 US 36717406 A US36717406 A US 36717406A US 7716238 B2 US7716238 B2 US 7716238B2
Authority
US
United States
Prior art keywords
computing devices
query
server
task
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US11/367,174
Other versions
US20070198528A1 (en
Inventor
Adam Pierce Harris
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Interactive Entertainment LLC
Original Assignee
Sony Computer Entertainment America LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/355,327 external-priority patent/US7979460B2/en
Assigned to SONY COMPUTER ENTERTAINMENT AMERICA INC. reassignment SONY COMPUTER ENTERTAINMENT AMERICA INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: HARRIS, ADAM PIERCE
Priority to US11/367,174 priority Critical patent/US7716238B2/en
Application filed by Sony Computer Entertainment America LLC filed Critical Sony Computer Entertainment America LLC
Priority to DE602007004076T priority patent/DE602007004076D1/en
Priority to EP07750128A priority patent/EP1987454B1/en
Priority to PCT/US2007/003258 priority patent/WO2007097915A2/en
Priority to JP2008555276A priority patent/JP5015965B2/en
Priority to AT07750128T priority patent/ATE453898T1/en
Publication of US20070198528A1 publication Critical patent/US20070198528A1/en
Publication of US7716238B2 publication Critical patent/US7716238B2/en
Application granted granted Critical
Assigned to SONY COMPUTER ENTERTAINMENT AMERICA LLC reassignment SONY COMPUTER ENTERTAINMENT AMERICA LLC MERGER (SEE DOCUMENT FOR DETAILS). Assignors: SONY COMPUTER ENTERTAINMENT AMERICA INC.
Assigned to SONY INTERACTIVE ENTERTAINMENT AMERICA LLC reassignment SONY INTERACTIVE ENTERTAINMENT AMERICA LLC CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: SONY COMPUTER ENTERTAINMENT AMERICA LLC
Assigned to Sony Interactive Entertainment LLC reassignment Sony Interactive Entertainment LLC MERGER (SEE DOCUMENT FOR DETAILS). Assignors: SONY INTERACTIVE ENTERTAINMENT AMERICA LLC
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1029Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers using data related to the state of servers by a load balancer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1034Reaction to server failures by a load balancer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed

Definitions

  • the present invention relates generally to computer networks, and more particularly to network server balancing and redundancy.
  • FIG. 1 shows a simplified architecture 100 for a prior art approach to network server management.
  • Three servers are illustrated: server 110 containing a first segment of a database; server 120 containing a second segment of the database; and server 130 containing a third segment of the same database.
  • communications network 140 responsible for transferring information between users 150 through 170 and one of the three servers responsible for responding to a particular query.
  • server A may comprise entries A thru H
  • server B may comprise entries I through Q
  • server C may comprise entries R-Z. Accordingly, in the event one of the three servers illustrated in FIG. 1 experiences a delay or failure, the other servers in the network are unable to respond on behalf of the failed server, because they lack the requisite data. As a result, certain responses to queries may be delayed or go unprocessed.
  • the prior art architecture illustrated in FIG. 1 is not scalable. Server limitations (e.g. processor speed or storage capacity) dictate the number of queries a server can process. Installation of additional servers to store additional information and process additional queries often requires shutting down an entire network. Additional downtime is often imposed by rearranging data on existing servers. Consequently, as evidenced by the prior art architecture illustrated in FIG. 1 , there is a need for improved systems and methods of server management.
  • Server limitations e.g. processor speed or storage capacity
  • the present invention provides exemplary systems and methods for server management.
  • An exemplary system comprises a plurality of servers, each server having the ability to access a database.
  • An exemplary communications network allows for queries to be received by all servers in the network, while a look-up table identifies the servers in the network responsible for processing particular queries. Further embodiments of the system feature a program logic controller for tracking and rebalancing the workload of the network servers.
  • An exemplary method of server management comprises installing a copy of a database (or portion thereof) on each network server, receiving all queries at all servers, and processing queries as directed by a look-up table.
  • An exemplary method of query processing comprises submitting a query to a network, sending the query to every server in the network and each server looking-up on a look-up table the server in the network having primary responsibility for processing the particular query. Monitoring of the processing of queries in the network is performed by all network servers. A query result may be transmitted to the user thereby completing the process.
  • An exemplary method of rebalancing server load includes determining an overall query response rate for a network, comparing the overall query response rate for the network to a target overall query response rate, determining a query response rate for each server in the network, and comparing the query response rates for all of the servers in the network. Based on this method, the primary responsibility for one or more data segments can be transferred from one server in the network having a relatively slow query response rate to a server in the network having a relatively fast query response rate. This method can be performed either manually or with the support of an optional program logic controller. In the event server load is not rebalanced, further embodiments of the present invention include methods for introducing an additional server to the network.
  • Exemplary methods of introducing an additional server to the network include installing a copy of the database (or portions thereof) used on the existing servers in the network on the additional server, configuring the additional server to receive all queries and installing a look-up table that either resides on or is accessible by the additional server to the network. Further embodiments include configuring the additional server to monitor the processing of queries in the network.
  • FIG. 1 is a simplified architecture of a prior art, non-scalable, non-redundant and failure-prone approach to network server management;
  • FIG. 2 is an exemplary network architecture in which a scalable, redundant and reliable server network may be implemented
  • FIG. 3 is an exemplary look-up table according to one embodiment of the present invention.
  • FIG. 4 is an exemplary look-up table according to an exemplary scenario of server load balancing
  • FIG. 5 is an exemplary look-up table according to an exemplary scenario of server load rebalancing by the addition of a server to the network;
  • FIG. 6 is a flowchart for one exemplary method of establishing a scalable, redundant and reliable server network according to various embodiments of the invention.
  • FIG. 7 is a flowchart for one exemplary method of processing a query on a scalable, redundant and reliable server network according to various embodiments of the invention.
  • FIG. 8 is a flowchart for one exemplary method of rebalancing network server load either manually or by the use of a program logic controller according to various embodiments of the invention.
  • FIG. 9 is a flowchart for one exemplary method of rebalancing network server load by the installation of an additional server according to various embodiments of the invention.
  • the exemplary network architecture 200 includes an optional program logic controller 202 , an optional master database 204 and a look-up table 300 ( FIG. 3 ).
  • Optional communications linkage 206 , servers 208 through 218 , and communications network 220 also comprise the exemplary network architecture shown in FIG. 2 .
  • servers 208 through 218 each contain one or more clocks and/or other timing devices that are maintained in synchrony with one another and/or based upon each clock or other timing device periodically referencing a shared standard through communications network 220 , such as an atomic clock.
  • timing means e.g., clock, timer, etc.
  • a secondary server for handling (e.g., response).
  • timing devices may be better suited to particular configurations than others (although they are not necessarily prevented from being implemented in those configurations). For example, a clock may be best suited for a shared query resource (as described below) whereas a simple timer may be best suited for responding to individual queries delivered to the exemplary network architecture 200 .
  • queries are generated by users 150 through 170 and communicated over communications network 220 to servers 208 through 218 .
  • communications network 220 uses multicast or broadcast techniques to communicate queries to the servers.
  • all servers receive all queries.
  • a query may comprise a request for the calculation, computation or determination of a particular value, values or other information.
  • Queries may also comprise a request for a return of information, which may further comprise the aforementioned values or other information.
  • other wired and/or wireless mechanisms communicate all queries to all servers.
  • queries and/or notice of queries are communicated to a subset of servers comprising the network, wherein the servers that are not responsible for processing or backing up a particular query do not receive the query.
  • queries may be held in a shared resource, that resource comprising a list of outstanding queries, which can be monitored and/or accessed by the aforementioned network of servers.
  • This shared resource may be an intermediate server, some sort of queuing mechanism (e.g., a router), a memory buffer or other means for maintaining a list of queries or the actual queries themselves.
  • Communications network 220 allows each server in the exemplary network architecture 200 to monitor query processing by the other servers in the network. For example, a reply to a query may be broadcast or multicast over the network 200 . In alternative embodiments, other forms of server peer monitoring are used, such as optional communications linkage 206 . In still further embodiments, a subset of servers comprising the network are monitored by peer servers, wherein the servers that are not responsible for processing a particular query are not monitored.
  • servers 208 through 218 each contain an entire database or database copy.
  • the contents of each database or database copy can be substantially the same or may have certain segments of data omitted.
  • Alternative exemplary embodiments of the present invention include an optional master database 204 , which can be accessed by all of the servers in the network.
  • Optional database 204 may be in lieu of or in addition to the entire database or database copy installed on each server.
  • an entire database or database copy contains the information queried by users 150 through 170 .
  • Database examples include telephone directories, customer databases or catalogs of products and/or services. Categories of other database content are within the scope of the present invention.
  • servers 208 through 218 may be configured to process and/or respond to the aforementioned queries (e.g., be programmed with the necessary logic to respond to a particular calculation request). This configuration may be in addition to or in place of the aforementioned database or database copy.
  • Each database or database copy comprises one or more segments of data or data segments.
  • segments of data are determined based on the nature of the underlying data.
  • the twenty-six letters of the English alphabet may represent twenty-six segments of data forming a telephone directory database.
  • Twenty-six servers may each be assigned a primary responsibility for processing queries corresponding to a particular letter of the alphabet. For example, one server is assigned the primary responsibility for processing queries corresponding to last names beginning with the letter “A,” while a second server is assigned the primary responsibility for processing queries corresponding to last names beginning with the letter “B.” Likewise, a third server is assigned primary responsibility for processing queries corresponding to last names beginning with the letter “C,” and so on.
  • responsibilities for each server in a network may be determined based upon an arbitrary designation of data segments.
  • a database may be segmented into as many equally-sized megabytes of data as there are servers forming the network.
  • Various formulas may also be used to weight segment determinations based on averaging or estimating query frequency for a particular segment of the data or certain processing requirements related thereto.
  • segments of data in the database are manually or automatically cataloged by look-up table 300 ( FIG. 3 ).
  • an optional program logic controller 202 may divide a database into optimized segments of data that are automatically updated and reflected in look-up table 300 .
  • program logic controller 202 monitors, balances and/or rebalances server load, based on factors such as changes in server usage, server storage capacity and/or query frequency.
  • look-up table 300 directs query processing by network servers.
  • the presence of columns and headers in look-up table 300 is for illustrative purposes and not meant to impose any particular data structure or format.
  • look-up table 300 servers 208 through 218 ( FIG. 2 ) are identified in column 310 .
  • each of these servers contains a copy of look-up table 300 .
  • the servers can access a centralized look-up table.
  • Look-up table 300 in column 320 , identifies the data segments installed in each of the servers. In the illustrated network architecture of FIG. 2 , look-up table 300 reflects that an entire database comprising data segments 1 - 6 is installed in servers 208 through 218 .
  • each server in a network is assigned one or more unique data segments.
  • each unique data segment assigned to each of the servers on the network comprises the entire database.
  • the unique portion of the database or data segments represent that particular server's responsibility for processing when a query for information located in the server's unique data segment or segments is communicated to all of the servers on the network.
  • the particular server responsible for the data segment(s) containing the requested information will be allocated a certain amount of time to process the query while the other servers monitor the processing. Accordingly, the server first responsible for processing a query is deemed to have primary responsibility for processing queries for information located in the server's unique data segment(s).
  • server 208 is assigned primary responsibility for data segment 1 ;
  • server 210 is assigned primary responsibility for data segment 2 ;
  • server 212 is assigned primary responsibility for data segment 3 ;
  • server 214 is assigned primary responsibility for data segment 4 ;
  • server 216 is assigned primary responsibility for data segment 5 ;
  • server 218 is assigned primary responsibility for data segment 6 .
  • each server is allocated 100 milliseconds in which to complete its assigned primary responsibility (e.g. responding to a query) as shown in column 340 .
  • Exemplary look-up table 300 also includes an assigned time for secondary query initiation as reflected in column 370 .
  • a server having secondary responsibility is assigned a particular time to initiate the query. For example, should server 208 fail to respond within 100 milliseconds to a query of data segment 1 (for which server 208 has been assigned primary responsibility), server 210 will initiate processing of the same query following the expiration of server 208 's allocated primary response time (e.g. at 101 milliseconds as reflected in column 360 ).
  • the assignment of a second query initiation time may not be necessary whereby a second server simply takes on processing responsibilities with the expiration of the allocated primary query response time (column 340 ) if there has not been a response to the query.
  • server 208 is assigned secondary responsibility for data segment 6 ;
  • server 210 is assigned secondary responsibility for data segment 1 ;
  • server 212 is assigned secondary responsibility for data segment 2 ;
  • server 214 is assigned secondary responsibility for data segment 3 ;
  • server 216 is assigned secondary responsibility for data segment 4 ;
  • server 218 is assigned secondary responsibility for data segment 5 as reflected in column 360 .
  • secondary responsibility for querying a particular segment of data is not assigned to the same server having primary responsibility for the same segment of data, in order to enhance network reliability in the event of a server delay or failure. That is, the delay or failure of one server should not adversely impair the ability of a second server to step-in and respond to a particular query.
  • Look-up table 300 indicates the exemplary server network is operating with double redundancy as reflected in column 350 . If the desired redundancy level indicates the server network is operating with double redundancy, a third server with tertiary responsibility will attempt to process any query missed by the respective primary and secondary servers.
  • tertiary responsibilities and respective query initiation times are assigned to server networks operating with double redundancy.
  • tertiary responsibility for querying a particular segment of data is not assigned to the same server having secondary responsibility for the same segment of data.
  • Double redundancy enhances network reliability and performance in the event two servers experience a failure, because a third server can ‘step-in’ and process a query for a segment of data for which it has tertiary responsibility.
  • the presence of optional master database 204 in addition to the database or database copy stored on each of servers 208 through 218 provides an additional fail-safe mechanism that can be accessed in the event each server with an assigned responsibility (i.e. primary, secondary, tertiary or so on) for a particular query should fail to process its assigned responsibility within the allocated time.
  • each server with an assigned responsibility i.e. primary, secondary, tertiary or so on
  • no query should go unprocessed in such a network, because the server containing optional master database 204 will step-in and process the query or, alternatively, may be accessed by another capable server in the network to obtain, process and deliver the necessary data.
  • FIG. 4 an exemplary look-up table according to an exemplary scenario of server load balancing is shown.
  • Query response times, server usage, volume of stored data and query frequency are just a few of the reasons necessitating the balancing and/or rebalancing of server responsibilities and/or stored databases.
  • An optional program logic controller 202 FIG. 2
  • An optional program logic controller 202 can be used to monitor query response times, server usage, volume of stored data and/or query frequency and automatically rebalance a server network. In some embodiments, these adjustments are made manually. In either case, increased query response times are typically the first indication that a server network might be in need of rebalancing.
  • exemplary look-table 400 One way of balancing and/or rebalancing a server network is illustrated by exemplary look-table 400 .
  • extra server storage capacity is created by the selective installation of data segments that comprise each server's database.
  • look-up table 400 The categories of information contained in look-up table 400 are similar to the categories of information contained in exemplary look-up table 300 ( FIG. 3 ), with the exception that column 420 reflects that each server in the network does not contain an entire copy of the database. Unlike the illustrative network reflected in exemplary look-up table 300 , segments of data are omitted from the data segments installed in the servers shown in column 420 .
  • data segments 1 - 2 and 4 - 6 are installed in server 208 .
  • Data segment 3 is omitted from the database in server 208 .
  • Server 208 is assigned primary responsibility for data segment 1 as shown in column 330 .
  • Server 208 is also assigned secondary responsibility for data segment 6 (column 360 ), and assigned tertiary responsibility for data segment 5 (column 380 ).
  • data segments 1 - 3 and 5 - 6 are installed in server 210 ; data segments 1 - 4 and 6 are installed in server 212 ; data segments 1 - 5 are installed in server 214 ; data segments 2 - 6 are installed in server 216 ; and data segments 1 and 3 - 6 are installed in server 218 .
  • the exemplary scenario of server load balancing and/or rebalancing illustrated in exemplary look-up table 400 can be applied to the exemplary network shown in FIG. 2 and FIG. 3 , resulting in a savings of six data segments or the equivalent of the storage capacity of one server.
  • the exemplary scenario of server load balancing and/or rebalancing retains the double redundancy level of the network.
  • the extra server storage capacity can be used to store data segments transferred from servers experiencing a slow response time, as will be described in connection with FIG. 7 herein.
  • FIG. 5 an exemplary look-up table 500 whereby server load is rebalanced by the addition of a server to a network is shown.
  • the categories of information contained in look-up table 500 are similar to the categories of information contained in exemplary look-up table 300 ( FIG. 3 ), with the exception that before server installation (columns 530 & 570 ) and after server installation (columns 540 & 580 ) comparisons are shown.
  • each server remains the same throughout the installation process while the number of network servers and their responsibilities (i.e. primary, secondary and/or tertiary and so on) are changed.
  • servers 208 through 218 each contain a database comprising data segments 1 - 50 .
  • Data segments 1 - 50 have been installed in the additional server (titled “NEW”) prior to initiating server installation.
  • each server contains a database comprising the same data segments, the network can continue operating while server NEW is added to the network. That is, the continuous operating of the network is not dependent on any one server.
  • server NEW is brought online, the before server installation settings (columns 530 & 570 ) are replaced by the after server installation settings (columns 540 & 580 ), and the network continues to operate uninterrupted.
  • server load rebalancing by the installation of an additional server assume in exemplary look-up table 500 , before the installation of server NEW, server 208 , server 210 , server 212 , and server 214 are each processing queries at an unacceptably slow rate. Also assume that server 216 is processing queries at an acceptable rate, and server 218 is processing queries at a maximum rate. As shown in column 560 , the network is operating at a single rate of redundancy or redundancy level.
  • Server load rebalancing based on the exemplary method described in connection with FIG. 8 herein will result in server 208 transferring primary responsibility for data segments 7 - 8 (column 530 ) to server 210 (column 540 ), server 210 transferring primary responsibility for data segments 14 - 17 (column 530 ) to server 212 (column 540 ), and server 212 transferring primary responsibility for data segments 21 - 26 (column 530 ) to server NEW (column 540 ).
  • server load rebalancing results in server 214 transferring primary responsibility for data segment 27 (column 530 ) to server NEW (column 540 ) and transferring primary responsibility for data segments 35 - 36 (column 530 ) to server 216 (column 540 ). Finally, server 216 transfers primary responsibility for data segments 42 - 43 (column 530 ) to server 218 (column 540 ).
  • primary responsibility for server 208 decreases by two data segments; primary responsibility for server 210 decreases by two data segments; primary responsibility for server 212 decreases by two data segments and primary responsibility for server 214 decreases by three data segments. In total, the workload of these four servers decreases by nine data segments.
  • primary responsibility for server 216 remains unchanged and primary responsibility for server 218 increases by two data segments.
  • primary responsibility for server NEW is initiated with seven data segments (column 540 ). As shown in column 560 , the network remains operating at a single rate of redundancy.
  • FIG. 6 an exemplary flowchart for one method of establishing a scalable, redundant and fault-tolerant server network according to an exemplary embodiment of the present invention is shown.
  • an optional program logic controller 202 ( FIG. 2 ) is installed as part of the exemplary network architecture 200 ( FIG. 2 ).
  • program logic controller 202 monitors and rebalances server load, based in part on changes in server usage, server storage and query frequency.
  • Optional program logic controller 202 reduces the need for network manual server maintenance and associated equipment upgrades and purchases, through automating, for example, the aforementioned functions.
  • a database is installed on each server in the exemplary network architecture 200 .
  • the contents of each installed database or installed database copy can be substantially the same or may have certain segments of data omitted.
  • Database examples include but are not limited to telephone directories, customer databases or catalogs of products and/or services.
  • an optional master database 204 ( FIG. 2 ) is installed in the server network.
  • Optional master database 204 may be accessed by all of the servers in the network should such access ever prove to be necessary.
  • network servers in the exemplary network architecture 200 are configured to receive all queries.
  • communications network 220 uses multicasting or broadcasting to communicate all queries to all servers.
  • all servers receive all queries.
  • other wired and/or wireless mechanisms communicate all queries to all servers.
  • queries and/or notice of queries are communicated to a subset of servers comprising the network, wherein the servers that are not responsible for processing a particular query do not receive the query.
  • a look-up table 300 ( FIG. 3 ) is installed for each of servers 208 through 218 ( FIG. 2 ) that comprise the exemplary network architecture 200 .
  • look-up table 300 directs server query processing.
  • Look-up-table 300 may be local or remote relative to the server 208 through 218 .
  • a server redundancy level may be established for exemplary network architecture 200 .
  • Server redundancy level is a function of the tolerance for server network failure. The lesser the tolerance for server network failure, the higher the server redundancy level. For example, users that can tolerate an occasional network failure might establish a single redundancy level as shown in FIG. 5 , whereas users that can not tolerate an occasional network failure might establish a double redundancy level as shown in FIG. 4 .
  • a single redundancy level signifies that if a server assigned primary responsibility fails to process an assigned query within an allocated period of time, another server with secondary responsibility for the same segment of data will attempt to process the query.
  • a double redundancy level signifies that a third server assigned tertiary responsibility will attempt to process any query missed by the servers assigned primary and secondary responsibilities. Assuming the installation of an entire database on each network server as illustrated in FIG. 2 , FIG. 3 , and FIG. 5 , the redundancy level of a server network is limited only by the number of servers (e.g. servers 208 through 218 ) on the network.
  • servers comprising the exemplary network architecture 200 are configured upon receipt of each query to check the look-up table installed at step 610 .
  • look-up table 300 identifies the data segments installed in each server.
  • servers comprising the exemplary network architecture 200 are configured to process queries per look-up table 300 .
  • Look-up table 300 allocates each server 100 milliseconds in which to complete its assigned primary responsibility.
  • servers comprising the exemplary network architecture 200 are configured to monitor query processing by the other servers in the network.
  • communications network 220 allows each server in the exemplary network architecture 200 to monitor query processing by the other servers in the network by ‘listening,’ for example, for a broadcast or multicast reply to the query.
  • other forms of server peer monitoring are used, such as optional communications linkage 206 .
  • a subset of servers comprising the network are monitored by peer servers, wherein the servers that are not to be responsible for processing a particular query are not monitored.
  • servers comprising the exemplary network architecture 200 are configured to transmit query results to users.
  • servers comprising the exemplary network architecture 200 are configured to reset upon transmission of a query result to a user. That is, present response time is reset to zero.
  • FIG. 7 a flowchart for an exemplary method of processing a query on the exemplary network architecture 200 ( FIG. 2 ) is shown.
  • a query is submitted to exemplary network architecture 200 .
  • user 150 FIG. 2
  • the submitted query is communicated to network servers.
  • the query for an address corresponding to a person having the last name of Jones is multicast through communications network 220 ( FIG. 2 ) to all servers in the exemplary network architecture 200 .
  • the identity of the server having primary responsibility for processing the submitted query is determined based upon referencing a look-up table.
  • the look-up table for the queried telephone directory database reflects that the tenth server of twenty-six servers (each corresponding to a letter of the alphabet) is assigned primary responsibility for processing queries corresponding to last names beginning with the letter “J.” Therefore, server ten has the primary responsibility of querying its data segment for the address of Jones.
  • the submitted query is processed (or attempted to be processed) by the responsible server.
  • server ten processes the query for the address corresponding to Jones.
  • communications network 220 allows each server in the exemplary network architecture 200 to monitor query processing by the other servers in the network through, for example, listening for a multicast or broadcast reply to the query.
  • other forms of server peer monitoring are used, such as through optional communications linkage 206 .
  • a subset of servers comprising the network are monitored by peer servers, wherein the servers that are not to be responsible for processing a particular query are not monitored.
  • twenty-five of the twenty-six servers comprising the server network for the telephone directory database monitor the processing by server ten for the address corresponding to Jones.
  • each server is allocated 100 milliseconds in which to complete its assigned primary responsibility.
  • server ten determined within 100 milliseconds that the address corresponding to Jones is 2200 Geng Road, Palo Alto, Calif.
  • a server with secondary responsibility is determined based on the method described in connection with step 730 .
  • the server with secondary responsibility then processes the query as set forth in steps 740 - 750 .
  • Other backup and/or secondary servers continue to await an indication the query has been timely processed in step 760 .
  • the query result is transmitted to the user who submitted the query via, for example, a multicast or broadcast methodology.
  • user 150 FIG. 2
  • network servers reset for the next query.
  • the twenty-six servers comprising the telephone directory database will reset their present processing time to zero in anticipation of the next query to be submitted by a user. That is, the network servers await the arrival of a new query wherein overall processing time with regard to a particular query begins relative that particular query and its own timestamp (i.e., the overall time since the query was made or was received by the network servers).
  • FIG. 8 a flowchart for one exemplary method of evaluating and rebalancing network server load according to an exemplary embodiment of the invention is shown. All steps in FIG. 8 can be performed either manually or with the assistance of an optional program logic controller 202 ( FIG. 2 ).
  • an overall network query response rate is determined. For example, the average time it takes to process each query submitted to the exemplary network architecture 200 ( FIG. 2 ) can be determined for a twenty-four hour period either manually or by an optional program logic controller 202 . Various other time periods or measures of response time may be used.
  • the overall network query response rate as determined by step 810 is compared to a target overall network query response rate.
  • a target overall network query response rate For example, with respect to the telephone directory database described in connection with FIG. 7 , the particular telephone company responsible for the telephone directory database might determine, on average, it desires to have all queries processed within 100 milliseconds. This comparison represents a measure by which the server network can be evaluated apart from the performance of the constituent servers.
  • individual server query response rates are determined. For example, the average time it takes each of servers 208 through 218 in FIG. 2 to process each query submitted in the exemplary network architecture 200 can be determined for a twenty-four hour period either manually or by an optional program logic controller 202 or through various other periods and measures of time.
  • the response rates for all of the servers are compared.
  • the response rates of server 208 , server 210 , server 212 , and server 214 were slower than the response rates of server 216 and server 218 , which indicated that servers 208 through 214 warranted a reduction in the number of data segments for which they had primary responsibility.
  • the primary responsibilities for particular data segments are transferred from the servers having slower query response rates to the servers having faster query response rates. For example, as described in connection with FIG. 5 , server 208 transferred primary responsibility for data segments 7 - 8 (two data segments) to server 210 , server 210 transferred primary responsibility for data segments 14 - 17 (four data segments) to server 212 .
  • an overall network query response rate is re-determined in the same fashion as described in connection with step 810 .
  • the re-determined overall network query response rate as determined at step 860 is re-compared to the target overall network response rate.
  • step 880 a decision is made as to whether the performance of the rebalanced server network measures favorably against the target network response rate. If the performance of the rebalanced server network is satisfactory, the overall network query response rate can be periodically re-determined, as described in connection with step 810 . If the performance of the rebalanced server network is not satisfactory, then step 890 may need to be undertaken.
  • an additional server is installed in the server network, as described in connection with the exemplary method shown in FIG. 9 .
  • FIG. 9 a flowchart for an exemplary method of rebalancing network server load by the installation of an additional server is shown.
  • a database or database copy corresponding to the database used in the existing server network is installed on the additional server.
  • an optional master database 204 ( FIG. 2 ), is made accessible to the additional server to the network.
  • This optional master database may have previously been made available to all other servers in the network.
  • the additional server is configured to receive all queries submitted to the server network.
  • user queries are directed through communications network 220 ( FIG. 2 ) to all servers in the network.
  • queries and/or notice of queries are communicated to a subset of servers comprising the network.
  • a look-up table such as exemplary look-up table 500 ( FIG. 5 ) is either installed on the additional server or made accessible to the additional server. As shown in column 510 of exemplary look-up table 500 , the look-up table reflects the presence of the additional server.
  • the additional server to the server network is configured to check the modified look-up table described in step 908 .
  • the additional server to the network is configured to process queries.
  • the additional server to the network is configured to monitor query processing by the other servers in the network.
  • the additional server to the network is configured to transmit query results to the user.
  • the additional server to the network is configured to reset its present response time in preparation for the next query to be submitted over the network.

Abstract

Exemplary systems and methods for server management are provided. An exemplary system comprises a plurality of servers, with each server having the ability to access a database or, in some embodiments, be configured to perform a calculation, computation or make a determination of a particular value, values or other information. A communications network delivers queries to each server whereby a look-up table directs query processing by the servers. Further embodiments of the system feature a program logic controller for rebalancing the workload of the network servers. Exemplary methods of processing queries on the system comprise submitting a query to the network, communicating the query to each server in the network and each server looking-up on a look-up table the processing responsibilities of the servers in the network. The query is processed by the server having primary responsibility for processing the query while other the servers monitor query processing. Transmission of a query result to a user typically completes the process.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation and claims the priority benefit of U.S. patent application Ser. No. 11/355,327 filed Feb. 15, 2006 and entitled “Systems and Methods for Server Management.” The disclosure of this commonly owned application is incorporated herein by reference.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates generally to computer networks, and more particularly to network server balancing and redundancy.
2. Description of Related Art
FIG. 1 shows a simplified architecture 100 for a prior art approach to network server management. Three servers are illustrated: server 110 containing a first segment of a database; server 120 containing a second segment of the database; and server 130 containing a third segment of the same database. Also illustrated in FIG. 1 is communications network 140 responsible for transferring information between users 150 through 170 and one of the three servers responsible for responding to a particular query.
The prior art approach to server management illustrated in FIG. 1 suffers from several drawbacks.
First, the entire database in FIG. 1 is divided between three separate servers. No single server contains the entire database nor do segments of the database overlap amongst the servers. For example, in the case of a phone book database, server A (110) may comprise entries A thru H, server B (120) may comprise entries I through Q and server C (130) may comprise entries R-Z. Accordingly, in the event one of the three servers illustrated in FIG. 1 experiences a delay or failure, the other servers in the network are unable to respond on behalf of the failed server, because they lack the requisite data. As a result, certain responses to queries may be delayed or go unprocessed.
Second, even if all of the servers in the illustrated network of FIG. 1 stored the requisite data and received all queries, there is no mechanism for one server to monitor whether another server with responsibility for processing a query is actually processing the query. As a result, one server may have available processing capability going unutilized as that server is under the misconception that another server is processing the request.
Third, the prior art architecture illustrated in FIG. 1 is not scalable. Server limitations (e.g. processor speed or storage capacity) dictate the number of queries a server can process. Installation of additional servers to store additional information and process additional queries often requires shutting down an entire network. Additional downtime is often imposed by rearranging data on existing servers. Consequently, as evidenced by the prior art architecture illustrated in FIG. 1, there is a need for improved systems and methods of server management.
SUMMARY OF THE INVENTION
The present invention provides exemplary systems and methods for server management. An exemplary system comprises a plurality of servers, each server having the ability to access a database. An exemplary communications network allows for queries to be received by all servers in the network, while a look-up table identifies the servers in the network responsible for processing particular queries. Further embodiments of the system feature a program logic controller for tracking and rebalancing the workload of the network servers.
An exemplary method of server management comprises installing a copy of a database (or portion thereof) on each network server, receiving all queries at all servers, and processing queries as directed by a look-up table.
An exemplary method of query processing comprises submitting a query to a network, sending the query to every server in the network and each server looking-up on a look-up table the server in the network having primary responsibility for processing the particular query. Monitoring of the processing of queries in the network is performed by all network servers. A query result may be transmitted to the user thereby completing the process.
An exemplary method of rebalancing server load includes determining an overall query response rate for a network, comparing the overall query response rate for the network to a target overall query response rate, determining a query response rate for each server in the network, and comparing the query response rates for all of the servers in the network. Based on this method, the primary responsibility for one or more data segments can be transferred from one server in the network having a relatively slow query response rate to a server in the network having a relatively fast query response rate. This method can be performed either manually or with the support of an optional program logic controller. In the event server load is not rebalanced, further embodiments of the present invention include methods for introducing an additional server to the network.
Exemplary methods of introducing an additional server to the network include installing a copy of the database (or portions thereof) used on the existing servers in the network on the additional server, configuring the additional server to receive all queries and installing a look-up table that either resides on or is accessible by the additional server to the network. Further embodiments include configuring the additional server to monitor the processing of queries in the network.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a simplified architecture of a prior art, non-scalable, non-redundant and failure-prone approach to network server management;
FIG. 2 is an exemplary network architecture in which a scalable, redundant and reliable server network may be implemented;
FIG. 3 is an exemplary look-up table according to one embodiment of the present invention;
FIG. 4 is an exemplary look-up table according to an exemplary scenario of server load balancing;
FIG. 5 is an exemplary look-up table according to an exemplary scenario of server load rebalancing by the addition of a server to the network;
FIG. 6 is a flowchart for one exemplary method of establishing a scalable, redundant and reliable server network according to various embodiments of the invention;
FIG. 7 is a flowchart for one exemplary method of processing a query on a scalable, redundant and reliable server network according to various embodiments of the invention;
FIG. 8 is a flowchart for one exemplary method of rebalancing network server load either manually or by the use of a program logic controller according to various embodiments of the invention; and
FIG. 9 is a flowchart for one exemplary method of rebalancing network server load by the installation of an additional server according to various embodiments of the invention.
DETAILED DESCRIPTION
Referring to FIG. 2, an exemplary network architecture 200 is illustrated in which various embodiments of the invention may be implemented. The exemplary network architecture 200 includes an optional program logic controller 202, an optional master database 204 and a look-up table 300 (FIG. 3). Optional communications linkage 206, servers 208 through 218, and communications network 220 also comprise the exemplary network architecture shown in FIG. 2. In some embodiments of the present invention, servers 208 through 218 each contain one or more clocks and/or other timing devices that are maintained in synchrony with one another and/or based upon each clock or other timing device periodically referencing a shared standard through communications network 220, such as an atomic clock. Ultimately, some sort of a timing means (e.g., clock, timer, etc.) is necessary—as will be explained in detail herein—to ensure that a query has been timely handled by a primary server before the query is passed to a secondary server for handling (e.g., response).
Certain timing devices may be better suited to particular configurations than others (although they are not necessarily prevented from being implemented in those configurations). For example, a clock may be best suited for a shared query resource (as described below) whereas a simple timer may be best suited for responding to individual queries delivered to the exemplary network architecture 200.
In FIG. 2, queries are generated by users 150 through 170 and communicated over communications network 220 to servers 208 through 218. In one exemplary embodiment, communications network 220 uses multicast or broadcast techniques to communicate queries to the servers. In this exemplary embodiment, all servers receive all queries. Among other things, a query may comprise a request for the calculation, computation or determination of a particular value, values or other information. Queries may also comprise a request for a return of information, which may further comprise the aforementioned values or other information. In alternative embodiments, other wired and/or wireless mechanisms communicate all queries to all servers. In yet further embodiments, queries and/or notice of queries are communicated to a subset of servers comprising the network, wherein the servers that are not responsible for processing or backing up a particular query do not receive the query. In still further embodiments of the present invention, queries may be held in a shared resource, that resource comprising a list of outstanding queries, which can be monitored and/or accessed by the aforementioned network of servers. This shared resource may be an intermediate server, some sort of queuing mechanism (e.g., a router), a memory buffer or other means for maintaining a list of queries or the actual queries themselves.
Communications network 220 allows each server in the exemplary network architecture 200 to monitor query processing by the other servers in the network. For example, a reply to a query may be broadcast or multicast over the network 200. In alternative embodiments, other forms of server peer monitoring are used, such as optional communications linkage 206. In still further embodiments, a subset of servers comprising the network are monitored by peer servers, wherein the servers that are not responsible for processing a particular query are not monitored.
In an exemplary embodiment, servers 208 through 218 each contain an entire database or database copy. The contents of each database or database copy can be substantially the same or may have certain segments of data omitted. Alternative exemplary embodiments of the present invention include an optional master database 204, which can be accessed by all of the servers in the network. Optional database 204 may be in lieu of or in addition to the entire database or database copy installed on each server. In the exemplary network architecture 200, an entire database or database copy contains the information queried by users 150 through 170. Database examples include telephone directories, customer databases or catalogs of products and/or services. Categories of other database content are within the scope of the present invention. In other embodiments of the present invention, servers 208 through 218 may be configured to process and/or respond to the aforementioned queries (e.g., be programmed with the necessary logic to respond to a particular calculation request). This configuration may be in addition to or in place of the aforementioned database or database copy.
Each database or database copy comprises one or more segments of data or data segments. In some exemplary embodiments, segments of data are determined based on the nature of the underlying data. For example, the twenty-six letters of the English alphabet may represent twenty-six segments of data forming a telephone directory database. Twenty-six servers may each be assigned a primary responsibility for processing queries corresponding to a particular letter of the alphabet. For example, one server is assigned the primary responsibility for processing queries corresponding to last names beginning with the letter “A,” while a second server is assigned the primary responsibility for processing queries corresponding to last names beginning with the letter “B.” Likewise, a third server is assigned primary responsibility for processing queries corresponding to last names beginning with the letter “C,” and so on.
In alternate embodiments, responsibilities for each server in a network may be determined based upon an arbitrary designation of data segments. For example, in some embodiments, a database may be segmented into as many equally-sized megabytes of data as there are servers forming the network. Various formulas may also be used to weight segment determinations based on averaging or estimating query frequency for a particular segment of the data or certain processing requirements related thereto.
In an exemplary embodiment, segments of data in the database are manually or automatically cataloged by look-up table 300 (FIG. 3). In an alternative embodiment, an optional program logic controller 202 may divide a database into optimized segments of data that are automatically updated and reflected in look-up table 300. In an exemplary embodiment of the server network, program logic controller 202 monitors, balances and/or rebalances server load, based on factors such as changes in server usage, server storage capacity and/or query frequency.
Turning to FIG. 3, exemplary look-up table 300 is shown. In accordance with some embodiments of the invention, a look-up table such as look-up table 300 directs query processing by network servers. The presence of columns and headers in look-up table 300 is for illustrative purposes and not meant to impose any particular data structure or format.
In look-up table 300, servers 208 through 218 (FIG. 2) are identified in column 310. In some embodiments of the present invention, each of these servers contains a copy of look-up table 300. In alternative embodiments, the servers can access a centralized look-up table.
Look-up table 300, in column 320, identifies the data segments installed in each of the servers. In the illustrated network architecture of FIG. 2, look-up table 300 reflects that an entire database comprising data segments 1-6 is installed in servers 208 through 218.
In exemplary embodiments, each server in a network is assigned one or more unique data segments. Collectively, each unique data segment assigned to each of the servers on the network comprises the entire database. The unique portion of the database or data segments represent that particular server's responsibility for processing when a query for information located in the server's unique data segment or segments is communicated to all of the servers on the network. In response to a query transmitted to all of the servers on the network, the particular server responsible for the data segment(s) containing the requested information will be allocated a certain amount of time to process the query while the other servers monitor the processing. Accordingly, the server first responsible for processing a query is deemed to have primary responsibility for processing queries for information located in the server's unique data segment(s).
The primary responsibilities for each server in the network are identified in column 330 of look-up table 300. As shown in FIG. 3, server 208 is assigned primary responsibility for data segment 1; server 210 is assigned primary responsibility for data segment 2; server 212 is assigned primary responsibility for data segment 3; server 214 is assigned primary responsibility for data segment 4; server 216 is assigned primary responsibility for data segment 5; and server 218 is assigned primary responsibility for data segment 6.
In look-up table 300, each server is allocated 100 milliseconds in which to complete its assigned primary responsibility (e.g. responding to a query) as shown in column 340. Exemplary look-up table 300 also includes an assigned time for secondary query initiation as reflected in column 370. In the event a particular server assigned primary responsibility cannot process or respond to a particular query in its allocated time, a server having secondary responsibility is assigned a particular time to initiate the query. For example, should server 208 fail to respond within 100 milliseconds to a query of data segment 1 (for which server 208 has been assigned primary responsibility), server 210 will initiate processing of the same query following the expiration of server 208's allocated primary response time (e.g. at 101 milliseconds as reflected in column 360). In some embodiments of the present invention, the assignment of a second query initiation time (col. 370) may not be necessary whereby a second server simply takes on processing responsibilities with the expiration of the allocated primary query response time (column 340) if there has not been a response to the query.
In look-up table 300, server 208 is assigned secondary responsibility for data segment 6; server 210 is assigned secondary responsibility for data segment 1; server 212 is assigned secondary responsibility for data segment 2; server 214 is assigned secondary responsibility for data segment 3; server 216 is assigned secondary responsibility for data segment 4; and server 218 is assigned secondary responsibility for data segment 5 as reflected in column 360. In exemplary embodiments, secondary responsibility for querying a particular segment of data is not assigned to the same server having primary responsibility for the same segment of data, in order to enhance network reliability in the event of a server delay or failure. That is, the delay or failure of one server should not adversely impair the ability of a second server to step-in and respond to a particular query.
Look-up table 300 indicates the exemplary server network is operating with double redundancy as reflected in column 350. If the desired redundancy level indicates the server network is operating with double redundancy, a third server with tertiary responsibility will attempt to process any query missed by the respective primary and secondary servers.
As illustrated by the exemplary look-up table 300, tertiary responsibilities and respective query initiation times are assigned to server networks operating with double redundancy. In an exemplary embodiment, tertiary responsibility for querying a particular segment of data is not assigned to the same server having secondary responsibility for the same segment of data. Double redundancy enhances network reliability and performance in the event two servers experience a failure, because a third server can ‘step-in’ and process a query for a segment of data for which it has tertiary responsibility.
According to some exemplary embodiments of the present invention, such as illustrated by exemplary network architecture 200, the presence of optional master database 204 in addition to the database or database copy stored on each of servers 208 through 218 provides an additional fail-safe mechanism that can be accessed in the event each server with an assigned responsibility (i.e. primary, secondary, tertiary or so on) for a particular query should fail to process its assigned responsibility within the allocated time. Provided the server containing optional master database 204 remains functional, no query should go unprocessed in such a network, because the server containing optional master database 204 will step-in and process the query or, alternatively, may be accessed by another capable server in the network to obtain, process and deliver the necessary data.
Turning to FIG. 4, an exemplary look-up table according to an exemplary scenario of server load balancing is shown. Query response times, server usage, volume of stored data and query frequency are just a few of the reasons necessitating the balancing and/or rebalancing of server responsibilities and/or stored databases. An optional program logic controller 202 (FIG. 2) can be used to monitor query response times, server usage, volume of stored data and/or query frequency and automatically rebalance a server network. In some embodiments, these adjustments are made manually. In either case, increased query response times are typically the first indication that a server network might be in need of rebalancing. One way of balancing and/or rebalancing a server network is illustrated by exemplary look-table 400. In exemplary look-up table 400, extra server storage capacity is created by the selective installation of data segments that comprise each server's database.
The categories of information contained in look-up table 400 are similar to the categories of information contained in exemplary look-up table 300 (FIG. 3), with the exception that column 420 reflects that each server in the network does not contain an entire copy of the database. Unlike the illustrative network reflected in exemplary look-up table 300, segments of data are omitted from the data segments installed in the servers shown in column 420.
As illustrated in column 420 of exemplary look-up table 400, data segments 1-2 and 4-6 are installed in server 208. Data segment 3 is omitted from the database in server 208. Server 208 is assigned primary responsibility for data segment 1 as shown in column 330. Server 208 is also assigned secondary responsibility for data segment 6 (column 360), and assigned tertiary responsibility for data segment 5 (column 380).
Additionally, as illustrated in column 420, data segments 1-3 and 5-6 are installed in server 210; data segments 1-4 and 6 are installed in server 212; data segments 1-5 are installed in server 214; data segments 2-6 are installed in server 216; and data segments 1 and 3-6 are installed in server 218.
The exemplary scenario of server load balancing and/or rebalancing illustrated in exemplary look-up table 400 can be applied to the exemplary network shown in FIG. 2 and FIG. 3, resulting in a savings of six data segments or the equivalent of the storage capacity of one server. As shown in column 350 of FIG. 4, the exemplary scenario of server load balancing and/or rebalancing retains the double redundancy level of the network. The extra server storage capacity can be used to store data segments transferred from servers experiencing a slow response time, as will be described in connection with FIG. 7 herein.
Turning to FIG. 5, an exemplary look-up table 500 whereby server load is rebalanced by the addition of a server to a network is shown. The categories of information contained in look-up table 500 are similar to the categories of information contained in exemplary look-up table 300 (FIG. 3), with the exception that before server installation (columns 530 & 570) and after server installation (columns 540 & 580) comparisons are shown.
As shown in FIG. 5, the database installed on each server remains the same throughout the installation process while the number of network servers and their responsibilities (i.e. primary, secondary and/or tertiary and so on) are changed. In column 520 of exemplary look-up table 500, servers 208 through 218 each contain a database comprising data segments 1-50. Data segments 1-50 have been installed in the additional server (titled “NEW”) prior to initiating server installation. Because each server contains a database comprising the same data segments, the network can continue operating while server NEW is added to the network. That is, the continuous operating of the network is not dependent on any one server. When server NEW is brought online, the before server installation settings (columns 530 & 570) are replaced by the after server installation settings (columns 540 & 580), and the network continues to operate uninterrupted.
As an example of server load rebalancing by the installation of an additional server, assume in exemplary look-up table 500, before the installation of server NEW, server 208, server 210, server 212, and server 214 are each processing queries at an unacceptably slow rate. Also assume that server 216 is processing queries at an acceptable rate, and server 218 is processing queries at a maximum rate. As shown in column 560, the network is operating at a single rate of redundancy or redundancy level.
Server load rebalancing based on the exemplary method described in connection with FIG. 8 herein will result in server 208 transferring primary responsibility for data segments 7-8 (column 530) to server 210 (column 540), server 210 transferring primary responsibility for data segments 14-17 (column 530) to server 212 (column 540), and server 212 transferring primary responsibility for data segments 21-26 (column 530) to server NEW (column 540). Likewise, server load rebalancing results in server 214 transferring primary responsibility for data segment 27 (column 530) to server NEW (column 540) and transferring primary responsibility for data segments 35-36 (column 530) to server 216 (column 540). Finally, server 216 transfers primary responsibility for data segments 42-43 (column 530) to server 218 (column 540).
As evidenced by comparing the number of data segments for each network server before installation (column 530) to after installation (column 540), primary responsibility for server 208 decreases by two data segments; primary responsibility for server 210 decreases by two data segments; primary responsibility for server 212 decreases by two data segments and primary responsibility for server 214 decreases by three data segments. In total, the workload of these four servers decreases by nine data segments. After the installation of server NEW, primary responsibility for server 216 remains unchanged and primary responsibility for server 218 increases by two data segments. Finally, primary responsibility for server NEW is initiated with seven data segments (column 540). As shown in column 560, the network remains operating at a single rate of redundancy.
Turning to FIG. 6, an exemplary flowchart for one method of establishing a scalable, redundant and fault-tolerant server network according to an exemplary embodiment of the present invention is shown.
At step 602, an optional program logic controller 202 (FIG. 2) is installed as part of the exemplary network architecture 200 (FIG. 2). In an exemplary embodiment of the server network, program logic controller 202 monitors and rebalances server load, based in part on changes in server usage, server storage and query frequency. Optional program logic controller 202 reduces the need for network manual server maintenance and associated equipment upgrades and purchases, through automating, for example, the aforementioned functions.
At step 604, a database is installed on each server in the exemplary network architecture 200. The contents of each installed database or installed database copy can be substantially the same or may have certain segments of data omitted. Database examples include but are not limited to telephone directories, customer databases or catalogs of products and/or services.
At step 606, an optional master database 204 (FIG. 2) is installed in the server network. Optional master database 204 may be accessed by all of the servers in the network should such access ever prove to be necessary.
At step 608, network servers in the exemplary network architecture 200 are configured to receive all queries. In exemplary embodiments, communications network 220 uses multicasting or broadcasting to communicate all queries to all servers. In these embodiments, all servers receive all queries. In alternative embodiments, other wired and/or wireless mechanisms communicate all queries to all servers. In yet further embodiments, queries and/or notice of queries are communicated to a subset of servers comprising the network, wherein the servers that are not responsible for processing a particular query do not receive the query.
At step 610, a look-up table 300 (FIG. 3) is installed for each of servers 208 through 218 (FIG. 2) that comprise the exemplary network architecture 200. In exemplary embodiments, look-up table 300 directs server query processing. Look-up-table 300 may be local or remote relative to the server 208 through 218.
At step 612, a server redundancy level may be established for exemplary network architecture 200. Server redundancy level is a function of the tolerance for server network failure. The lesser the tolerance for server network failure, the higher the server redundancy level. For example, users that can tolerate an occasional network failure might establish a single redundancy level as shown in FIG. 5, whereas users that can not tolerate an occasional network failure might establish a double redundancy level as shown in FIG. 4. As illustrated by exemplary look-up table 300, a single redundancy level signifies that if a server assigned primary responsibility fails to process an assigned query within an allocated period of time, another server with secondary responsibility for the same segment of data will attempt to process the query. A double redundancy level signifies that a third server assigned tertiary responsibility will attempt to process any query missed by the servers assigned primary and secondary responsibilities. Assuming the installation of an entire database on each network server as illustrated in FIG. 2, FIG. 3, and FIG. 5, the redundancy level of a server network is limited only by the number of servers (e.g. servers 208 through 218) on the network.
At step 614, servers comprising the exemplary network architecture 200 are configured upon receipt of each query to check the look-up table installed at step 610. In exemplary embodiments, look-up table 300 identifies the data segments installed in each server.
At step 616, servers comprising the exemplary network architecture 200 are configured to process queries per look-up table 300. Look-up table 300, in the present example, allocates each server 100 milliseconds in which to complete its assigned primary responsibility.
At step 618, servers comprising the exemplary network architecture 200 are configured to monitor query processing by the other servers in the network. In exemplary embodiments, communications network 220 allows each server in the exemplary network architecture 200 to monitor query processing by the other servers in the network by ‘listening,’ for example, for a broadcast or multicast reply to the query. In alternative embodiments, other forms of server peer monitoring are used, such as optional communications linkage 206. In yet further embodiments, a subset of servers comprising the network are monitored by peer servers, wherein the servers that are not to be responsible for processing a particular query are not monitored.
At step 620, servers comprising the exemplary network architecture 200 are configured to transmit query results to users.
At step 622, servers comprising the exemplary network architecture 200 are configured to reset upon transmission of a query result to a user. That is, present response time is reset to zero.
Turning to FIG. 7, a flowchart for an exemplary method of processing a query on the exemplary network architecture 200 (FIG. 2) is shown.
At step 710, a query is submitted to exemplary network architecture 200. In the case of a server network for a telephone directory database, user 150 (FIG. 2) submits a query for an address corresponding to a person having the last name of Jones.
At step 720, the submitted query is communicated to network servers. Here, the query for an address corresponding to a person having the last name of Jones is multicast through communications network 220 (FIG. 2) to all servers in the exemplary network architecture 200.
At step 730, the identity of the server having primary responsibility for processing the submitted query is determined based upon referencing a look-up table. Here, the look-up table for the queried telephone directory database reflects that the tenth server of twenty-six servers (each corresponding to a letter of the alphabet) is assigned primary responsibility for processing queries corresponding to last names beginning with the letter “J.” Therefore, server ten has the primary responsibility of querying its data segment for the address of Jones.
At step 740, the submitted query is processed (or attempted to be processed) by the responsible server. In this particular case, server ten processes the query for the address corresponding to Jones.
At step 750, the processing of the submitted query by the server having primary responsibility is monitored by the other servers in the network. In exemplary embodiments, communications network 220 allows each server in the exemplary network architecture 200 to monitor query processing by the other servers in the network through, for example, listening for a multicast or broadcast reply to the query. In alternative embodiments, other forms of server peer monitoring are used, such as through optional communications linkage 206. In yet further embodiments, a subset of servers comprising the network are monitored by peer servers, wherein the servers that are not to be responsible for processing a particular query are not monitored. In this particular example, twenty-five of the twenty-six servers comprising the server network for the telephone directory database monitor the processing by server ten for the address corresponding to Jones.
At step 760, it is determined whether the submitted query has been processed within the allocated time. Referring to look-up table 300 (FIG. 3), each server is allocated 100 milliseconds in which to complete its assigned primary responsibility. In this particular example, server ten determined within 100 milliseconds that the address corresponding to Jones is 2200 Geng Road, Palo Alto, Calif.
At step 770, should the query not be processed within the allocated time by the server having primary responsibility for the query (e.g. due to server delay or server failure), a server with secondary responsibility is determined based on the method described in connection with step 730. The server with secondary responsibility then processes the query as set forth in steps 740-750. Other backup and/or secondary servers continue to await an indication the query has been timely processed in step 760.
At step 780, the query result is transmitted to the user who submitted the query via, for example, a multicast or broadcast methodology. In this particular example, user 150 (FIG. 2) will receive the address for Jones via a multicast transmission over communications network 220.
At step 790, network servers reset for the next query. In this particular example, the twenty-six servers comprising the telephone directory database will reset their present processing time to zero in anticipation of the next query to be submitted by a user. That is, the network servers await the arrival of a new query wherein overall processing time with regard to a particular query begins relative that particular query and its own timestamp (i.e., the overall time since the query was made or was received by the network servers).
Turning to FIG. 8, a flowchart for one exemplary method of evaluating and rebalancing network server load according to an exemplary embodiment of the invention is shown. All steps in FIG. 8 can be performed either manually or with the assistance of an optional program logic controller 202 (FIG. 2).
At step 810, an overall network query response rate is determined. For example, the average time it takes to process each query submitted to the exemplary network architecture 200 (FIG. 2) can be determined for a twenty-four hour period either manually or by an optional program logic controller 202. Various other time periods or measures of response time may be used.
At step 820, the overall network query response rate as determined by step 810 is compared to a target overall network query response rate. For example, with respect to the telephone directory database described in connection with FIG. 7, the particular telephone company responsible for the telephone directory database might determine, on average, it desires to have all queries processed within 100 milliseconds. This comparison represents a measure by which the server network can be evaluated apart from the performance of the constituent servers.
At step 830, individual server query response rates are determined. For example, the average time it takes each of servers 208 through 218 in FIG. 2 to process each query submitted in the exemplary network architecture 200 can be determined for a twenty-four hour period either manually or by an optional program logic controller 202 or through various other periods and measures of time.
At step 840, the response rates for all of the servers are compared. For example, in the exemplary embodiment illustrated in FIG. 5 and described herein, the response rates of server 208, server 210, server 212, and server 214 were slower than the response rates of server 216 and server 218, which indicated that servers 208 through 214 warranted a reduction in the number of data segments for which they had primary responsibility.
At step 850, the primary responsibilities for particular data segments are transferred from the servers having slower query response rates to the servers having faster query response rates. For example, as described in connection with FIG. 5, server 208 transferred primary responsibility for data segments 7-8 (two data segments) to server 210, server 210 transferred primary responsibility for data segments 14-17 (four data segments) to server 212.
At step 860, an overall network query response rate is re-determined in the same fashion as described in connection with step 810.
At step 870, the re-determined overall network query response rate as determined at step 860 is re-compared to the target overall network response rate.
At step 880, a decision is made as to whether the performance of the rebalanced server network measures favorably against the target network response rate. If the performance of the rebalanced server network is satisfactory, the overall network query response rate can be periodically re-determined, as described in connection with step 810. If the performance of the rebalanced server network is not satisfactory, then step 890 may need to be undertaken.
At step 890, an additional server is installed in the server network, as described in connection with the exemplary method shown in FIG. 9.
Turning to FIG. 9, a flowchart for an exemplary method of rebalancing network server load by the installation of an additional server is shown.
At step 902, a database or database copy corresponding to the database used in the existing server network is installed on the additional server.
At step 904, an optional master database 204 (FIG. 2), is made accessible to the additional server to the network. This optional master database may have previously been made available to all other servers in the network.
At step 906, the additional server is configured to receive all queries submitted to the server network. In the example shown in FIG. 2, user queries are directed through communications network 220 (FIG. 2) to all servers in the network. In yet further embodiments, queries and/or notice of queries are communicated to a subset of servers comprising the network.
At step 908, a look-up table, such as exemplary look-up table 500 (FIG. 5), is either installed on the additional server or made accessible to the additional server. As shown in column 510 of exemplary look-up table 500, the look-up table reflects the presence of the additional server.
At step 910, the additional server to the server network is configured to check the modified look-up table described in step 908.
At step 912, the additional server to the network is configured to process queries.
At step 914, the additional server to the network is configured to monitor query processing by the other servers in the network.
At step 916, the additional server to the network is configured to transmit query results to the user.
At step 918, the additional server to the network is configured to reset its present response time in preparation for the next query to be submitted over the network.
The present invention is described above with reference to exemplary embodiments. It will be apparent to those skilled in the art that various modifications may be made and other embodiments can be used without departing from the broader scope of the present invention. Therefore, these and other variations upon the exemplary embodiments are intended to be covered by the present invention.

Claims (24)

1. A system of networked, redundant computing devices the system comprising:
a plurality of computing devices, wherein each of the plurality of computing devices are:
configured to execute a task,
assigned responsibility for at least a portion of the task, and
coupled to a communications network and receive a query corresponding to the task, the query received by all of the plurality of computing devices;
a look-up table stored in memory at each of the plurality of computing devices and accessible by each of the plurality of computing devices, the look-up table configured to direct execution of the task by the plurality of computing devices in response to the query based on at least the responsibility for at least the portion of the task assigned to each of the plurality of computing devices; and
control logic stored in memory at each of the plurality of computing devices and executable to monitor completion of query processing, whereupon a second computing device from the plurality of computing devices attempts to execute the task corresponding to the query as directed by the look-up table and the monitoring process when a first computing device designated by the look-up table from the plurality of computing devices fails to execute the task corresponding to the query within a predetermined time period, and the plurality of computing devices reset for processing of a new query upon successful processing of the query corresponding to the task.
2. The system of networked, redundant computing devices of claim 1, wherein the query is delivered over the communications network through a multicast transmission.
3. The system of networked, redundant computing devices of claim 1, wherein the query is delivered over the communications network through a broadcast transmission.
4. The system of networked, redundant computing devices of claim 1, wherein each of the plurality of computing devices is assigned a primary responsibility for executing the task and each of the plurality of computing devices is assigned a secondary responsibility for executing the task.
5. The system of networked, redundant computing devices of claim 4, further comprising a program logic controller, the program logic controller configured to reassign the primary and secondary responsibilities for execution of the task in response to the query.
6. The system of networked, redundant computing devices of claim 4, wherein the primary responsibility for each of the plurality of computing devices differs from the secondary responsibility for each of the plurality of computing devices.
7. The system of networked, redundant computing devices of claim 1, wherein each of the plurality of computing devices is configured to execute the task.
8. The system of networked, redundant computing devices of claim 7, wherein each of the plurality of computing devices is further configured to execute a different task.
9. The system of networked, redundant computing devices of claim 1, wherein the query is a request for information.
10. The system of networked, redundant computing devices of claim 1, wherein the query is a request for a determination of information.
11. The system of networked, redundant computing devices of claim 1, further comprising an intermediate computing device coupled to the communications network, the intermediate computing device not otherwise configured to execute a task, and wherein the query is delivered to each of the plurality of computing devices by the communications network via the intermediate computing device.
12. The system of networked, redundant computing devices of claim 11, wherein the intermediate computing device is configured to maintain a queue of queries for distribution to the plurality of computing devices.
13. A method for balancing processing responsibilities for a plurality of computing devices, comprising:
determining an average query response rate for a first computing device from the plurality of computing devices, each of the plurality of computing devices having been assigned responsibility for at least a portion of a task and the first computing device having being assigned primary processing responsibility according to a look-up table stored in memory at each of the plurality of computing devices for at least a portion of a task, and having been coupled to the network to receive a user generated query;
determining an average query response rate for all other computing devices from the plurality of computing devices;
comparing the average query response rate for the first computing device against the average query response rate for all other computing devices;
receiving an active query at all of a plurality of servers in a network;
monitoring the processing of the active query;
reallocating processing responsibilities for the plurality of computing devices based on at least the responsibility for at least the portion of the task assigned to each of the plurality of computing devices according to the look-up table when the average query response rate for the first computing device is imbalanced relative the average query response rate for all other computing devices from the plurality of computing devices; and
resetting the plurality of servers for processing of a new query upon successful processing of the active query.
14. The method for balancing processing responsibilities of claim 13, wherein the reallocation of processing responsibilities is automated.
15. The method for balancing processing responsibilities of claim 14, wherein the automated relocation occurs in response to an instruction from a program logic controller.
16. The method for balancing processing responsibilities of claim 13, wherein the reallocation of processing responsibilities is manual.
17. A computer-readable storage medium having embodied thereon a program, the program being executable by a computer to provide a method for directing execution of a task amongst a plurality of computing devices, the method comprising:
allocating a primary responsibility for execution of the task to a first computing device according to a look-up table stored in memory at each of the plurality of computing devices;
allocating a secondary responsibility for execution of the task to a second computing device according to a look-up table stored in memory at each of the plurality of computing devices;
receiving at all of the plurality of computers a user generated query,
monitoring the completion of processing of the user generated query;
allocating a time period for the first computing device with the primary responsibility for executing the task to execute the task upon receipt of the user generated query;
instructing the second computing device with the secondary responsibility for executing the task to execute the task when the time period for the first computing device with the primary responsibility for executing the task to execute the task has expired without the task having been executed according to the look-up table based on at least the secondary responsibility for execution of the task allocated to each of the second computing device; and
resetting the plurality of servers for processing of a new user generated query upon successful processing of the user generated query.
18. The computer-readable storage medium of claim 17, wherein the computer -readable medium is installed on each of the plurality of computing devices configured to execute the task.
19. The computer-readable storage medium of claim 17, wherein the computer-readable medium is installed on a master computing device not otherwise configured to execute the task, the master computing device configured to be accessed by each of the plurality of computing devices configured to execute the task.
20. A method for introducing a new computing device to a redundant-network of computing devices, comprising:
installing a database on the new computing device, wherein the database is identical to a database installed on each of already existing computing devices in the redundant-network of computing devices, each of the already existing computing devices having been assigned responsibility for at least a portion of the database;
configuring the new computing device to receive all queries submitted to the redundant-network of computing devices;
allocating a primary and secondary processing responsibility to the new computing device via a look-up table, the look-up table further assigning primary and secondary processing responsibility to each of the already existing computing devices in the redundant-network of computing devices;
receiving an active query at all of the plurality of computing devices,
monitoring the completion of processing of the active query;
processing the active query at a computing device from the plurality of computing devices identified by the look-up table as having primary responsibility for processing the query based on at least the responsibility for at least the portion of the database assigned to each of the already existing computing devices; and
resetting the plurality of computing devices for processing of a new query upon successful processing of the active query.
21. The method of claim 20, wherein the look-up table is installed on the new computing device.
22. The method of claim 20, wherein the look-up table is installed on at least one of the already-existing computing devices.
23. The method of claim 22, wherein the new computing device accesses the look-up table via a communications link with the at least one of the already-existing computing device where the look-up table is installed.
24. The method of claim 20, wherein the look-up table is installed on a master computing device, and the new computing device is configured to access the master computing device.
US11/367,174 2006-02-15 2006-03-03 Systems and methods for server management Active 2026-10-31 US7716238B2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
US11/367,174 US7716238B2 (en) 2006-02-15 2006-03-03 Systems and methods for server management
DE602007004076T DE602007004076D1 (en) 2006-02-15 2007-02-05 SYSTEMS AND METHODS FOR SERVER MANAGEMENT
EP07750128A EP1987454B1 (en) 2006-02-15 2007-02-05 Systems and methods for server management
PCT/US2007/003258 WO2007097915A2 (en) 2006-02-15 2007-02-05 Systems and methods for server management
JP2008555276A JP5015965B2 (en) 2006-02-15 2007-02-05 Server management system and method
AT07750128T ATE453898T1 (en) 2006-02-15 2007-02-05 SERVER MANAGEMENT SYSTEMS AND METHODS

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/355,327 US7979460B2 (en) 2006-02-15 2006-02-15 Systems and methods for server management
US11/367,174 US7716238B2 (en) 2006-02-15 2006-03-03 Systems and methods for server management

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/355,327 Continuation US7979460B2 (en) 2006-02-15 2006-02-15 Systems and methods for server management

Publications (2)

Publication Number Publication Date
US20070198528A1 US20070198528A1 (en) 2007-08-23
US7716238B2 true US7716238B2 (en) 2010-05-11

Family

ID=38314133

Family Applications (1)

Application Number Title Priority Date Filing Date
US11/367,174 Active 2026-10-31 US7716238B2 (en) 2006-02-15 2006-03-03 Systems and methods for server management

Country Status (6)

Country Link
US (1) US7716238B2 (en)
EP (1) EP1987454B1 (en)
JP (1) JP5015965B2 (en)
AT (1) ATE453898T1 (en)
DE (1) DE602007004076D1 (en)
WO (1) WO2007097915A2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7962509B2 (en) 2006-02-15 2011-06-14 Sony Computer Entertainment America Llc Systems and methods for server management
US20110214007A1 (en) * 2000-03-16 2011-09-01 Silicon Graphics, Inc. Flexible failover policies in high availability computing systems
US8126987B2 (en) 2009-11-16 2012-02-28 Sony Computer Entertainment Inc. Mediation of content-related services
US8433759B2 (en) 2010-05-24 2013-04-30 Sony Computer Entertainment America Llc Direction-conscious information sharing
US8966557B2 (en) 2001-01-22 2015-02-24 Sony Computer Entertainment Inc. Delivery of digital content
US9483405B2 (en) 2007-09-20 2016-11-01 Sony Interactive Entertainment Inc. Simplified run-time program translation for emulating complex processor pipelines

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8641545B2 (en) * 2012-03-06 2014-02-04 Brunswick Bowling & Billiards Corporation Distributed scoring system
US9069833B2 (en) 2013-01-09 2015-06-30 International Business Machines Corporation Detecting data omissions for an intermittently-connected application
US9665633B2 (en) 2014-02-19 2017-05-30 Snowflake Computing, Inc. Data management systems and methods
JP6325995B2 (en) * 2015-02-20 2018-05-16 日本電信電話株式会社 Distributed system, load balancing method and program
CN106559822A (en) * 2015-09-29 2017-04-05 中兴通讯股份有限公司 A kind of network management and emergency system

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5187787A (en) * 1989-07-27 1993-02-16 Teknekron Software Systems, Inc. Apparatus and method for providing decoupling of data exchange details for providing high performance communication between software processes
US5339392A (en) * 1989-07-27 1994-08-16 Risberg Jeffrey S Apparatus and method for creation of a user definable video displayed document showing changes in real time data
US5504894A (en) * 1992-04-30 1996-04-02 International Business Machines Corporation Workload manager for achieving transaction class response time goals in a multiprocessing system
US5778187A (en) * 1996-05-09 1998-07-07 Netcast Communications Corp. Multicasting method and apparatus
WO1998049620A1 (en) 1997-04-25 1998-11-05 Symbios, Inc. Redundant server failover in networked environment
US5864854A (en) * 1996-01-05 1999-01-26 Lsi Logic Corporation System and method for maintaining a shared cache look-up table
US5867494A (en) * 1996-11-18 1999-02-02 Mci Communication Corporation System, method and article of manufacture with integrated video conferencing billing in a communication system architecture
US6003030A (en) 1995-06-07 1999-12-14 Intervu, Inc. System and method for optimized storage and retrieval of data on a distributed computer network
US6108703A (en) 1998-07-14 2000-08-22 Massachusetts Institute Of Technology Global hosting system
US6157955A (en) * 1998-06-15 2000-12-05 Intel Corporation Packet processing system including a policy engine having a classification unit
US6199110B1 (en) 1997-05-30 2001-03-06 Oracle Corporation Planned session termination for clients accessing a resource through a server
US6212521B1 (en) * 1997-09-25 2001-04-03 Fujitsu Limited Data management system, primary server, and secondary server for data registration and retrieval in distributed environment
US6263433B1 (en) * 1998-09-30 2001-07-17 Ncr Corporation Provision of continuous database service and scalable query performance using active redundant copies
US6421726B1 (en) 1997-03-14 2002-07-16 Akamai Technologies, Inc. System and method for selection and retrieval of diverse types of video data on a computer network
US6564336B1 (en) * 1999-12-29 2003-05-13 General Electric Company Fault tolerant database for picture archiving and communication systems
US6625152B1 (en) * 1999-10-13 2003-09-23 Cisco Technology, Inc. Methods and apparatus for transferring data using a filter index

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04271454A (en) * 1991-02-27 1992-09-28 Toshiba Corp Loosely-coupled computer system
JPH05128077A (en) * 1991-11-06 1993-05-25 Hitachi Ltd Method for distributed processing plural computer system
JP2004062603A (en) * 2002-07-30 2004-02-26 Dainippon Printing Co Ltd Parallel processing system, server, parallel processing method, program and recording medium
JP3928505B2 (en) * 2002-07-05 2007-06-13 株式会社デンソー Vehicle communication system

Patent Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5339392A (en) * 1989-07-27 1994-08-16 Risberg Jeffrey S Apparatus and method for creation of a user definable video displayed document showing changes in real time data
US5187787B1 (en) * 1989-07-27 1996-05-07 Teknekron Software Systems Inc Apparatus and method for providing decoupling of data exchange details for providing high performance communication between software processes
US5187787A (en) * 1989-07-27 1993-02-16 Teknekron Software Systems, Inc. Apparatus and method for providing decoupling of data exchange details for providing high performance communication between software processes
US5504894A (en) * 1992-04-30 1996-04-02 International Business Machines Corporation Workload manager for achieving transaction class response time goals in a multiprocessing system
US6003030A (en) 1995-06-07 1999-12-14 Intervu, Inc. System and method for optimized storage and retrieval of data on a distributed computer network
US6665706B2 (en) 1995-06-07 2003-12-16 Akamai Technologies, Inc. System and method for optimized storage and retrieval of data on a distributed computer network
US6502125B1 (en) 1995-06-07 2002-12-31 Akamai Technologies, Inc. System and method for optimized storage and retrieval of data on a distributed computer network
US5864854A (en) * 1996-01-05 1999-01-26 Lsi Logic Corporation System and method for maintaining a shared cache look-up table
US5778187A (en) * 1996-05-09 1998-07-07 Netcast Communications Corp. Multicasting method and apparatus
US5867494A (en) * 1996-11-18 1999-02-02 Mci Communication Corporation System, method and article of manufacture with integrated video conferencing billing in a communication system architecture
US6421726B1 (en) 1997-03-14 2002-07-16 Akamai Technologies, Inc. System and method for selection and retrieval of diverse types of video data on a computer network
WO1998049620A1 (en) 1997-04-25 1998-11-05 Symbios, Inc. Redundant server failover in networked environment
US6199110B1 (en) 1997-05-30 2001-03-06 Oracle Corporation Planned session termination for clients accessing a resource through a server
US6212521B1 (en) * 1997-09-25 2001-04-03 Fujitsu Limited Data management system, primary server, and secondary server for data registration and retrieval in distributed environment
US6157955A (en) * 1998-06-15 2000-12-05 Intel Corporation Packet processing system including a policy engine having a classification unit
US6553413B1 (en) 1998-07-14 2003-04-22 Massachusetts Institute Of Technology Content delivery network using edge-of-network servers for providing content delivery to a set of participating content providers
US6108703A (en) 1998-07-14 2000-08-22 Massachusetts Institute Of Technology Global hosting system
US6263433B1 (en) * 1998-09-30 2001-07-17 Ncr Corporation Provision of continuous database service and scalable query performance using active redundant copies
US6625152B1 (en) * 1999-10-13 2003-09-23 Cisco Technology, Inc. Methods and apparatus for transferring data using a filter index
US6564336B1 (en) * 1999-12-29 2003-05-13 General Electric Company Fault tolerant database for picture archiving and communication systems

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"How Network Load Balancing Technology Works," Microsoft TechNet, Mar. 28, 2003, 2007 Microsoft Corporation, http://technet2.microsoft.com/windowsserver/en/library/1611cae3-5865-4897-a186-7....

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110214007A1 (en) * 2000-03-16 2011-09-01 Silicon Graphics, Inc. Flexible failover policies in high availability computing systems
US8769132B2 (en) 2000-03-16 2014-07-01 Sony Computer Entertainment America Llc Flexible failover policies in high availability computing systems
US9405640B2 (en) 2000-03-16 2016-08-02 Sony Interactive Entertainment America Llc Flexible failover policies in high availability computing systems
US8966557B2 (en) 2001-01-22 2015-02-24 Sony Computer Entertainment Inc. Delivery of digital content
US7962509B2 (en) 2006-02-15 2011-06-14 Sony Computer Entertainment America Llc Systems and methods for server management
US8732162B2 (en) 2006-02-15 2014-05-20 Sony Computer Entertainment America Llc Systems and methods for server management
US9886508B2 (en) 2006-02-15 2018-02-06 Sony Interactive Entertainment America Llc Systems and methods for server management
US9483405B2 (en) 2007-09-20 2016-11-01 Sony Interactive Entertainment Inc. Simplified run-time program translation for emulating complex processor pipelines
US8126987B2 (en) 2009-11-16 2012-02-28 Sony Computer Entertainment Inc. Mediation of content-related services
US8433759B2 (en) 2010-05-24 2013-04-30 Sony Computer Entertainment America Llc Direction-conscious information sharing

Also Published As

Publication number Publication date
ATE453898T1 (en) 2010-01-15
DE602007004076D1 (en) 2010-02-11
WO2007097915A2 (en) 2007-08-30
US20070198528A1 (en) 2007-08-23
EP1987454B1 (en) 2009-12-30
JP2009527056A (en) 2009-07-23
EP1987454A2 (en) 2008-11-05
JP5015965B2 (en) 2012-09-05
WO2007097915A3 (en) 2007-12-06

Similar Documents

Publication Publication Date Title
US9886508B2 (en) Systems and methods for server management
US7716238B2 (en) Systems and methods for server management
US7185096B2 (en) System and method for cluster-sensitive sticky load balancing
US7937437B2 (en) Method and apparatus for processing a request using proxy servers
US7181524B1 (en) Method and apparatus for balancing a load among a plurality of servers in a computer system
US6938031B1 (en) System and method for accessing information in a replicated database
EP2904763B1 (en) Load-balancing access to replicated databases
EP1330907B1 (en) Method and apparatus for real-time parallel delivery of segments of a large payload file
US6185601B1 (en) Dynamic load balancing of a network of client and server computers
US6564252B1 (en) Scalable storage system with unique client assignment to storage server partitions
US6044367A (en) Distributed I/O store
US10255148B2 (en) Primary role reporting service for resource groups
US7434087B1 (en) Graceful failover using augmented stubs
US8447757B1 (en) Latency reduction techniques for partitioned processing
US20030126200A1 (en) Dynamic load balancing of a network of client and server computer
US20090037367A1 (en) System and Methodology Providing Workload Management in Database Cluster
US20050108593A1 (en) Cluster failover from physical node to virtual node
US20020169889A1 (en) Zero-loss web service system and method
JP2003022209A (en) Distributed server system
US20020087612A1 (en) System and method for reliability-based load balancing and dispatching using software rejuvenation
CN102253869A (en) Scaleable fault-tolerant metadata service
US7000016B1 (en) System and method for multi-site clustering in a network
US20160062854A1 (en) Failover system and method
WO1999053415A1 (en) Distributed processing over a network
US7694012B1 (en) System and method for routing data

Legal Events

Date Code Title Description
AS Assignment

Owner name: SONY COMPUTER ENTERTAINMENT AMERICA INC., CALIFORN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:HARRIS, ADAM PIERCE;REEL/FRAME:017652/0455

Effective date: 20060215

Owner name: SONY COMPUTER ENTERTAINMENT AMERICA INC.,CALIFORNI

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:HARRIS, ADAM PIERCE;REEL/FRAME:017652/0455

Effective date: 20060215

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: SONY COMPUTER ENTERTAINMENT AMERICA LLC, CALIFORNI

Free format text: MERGER;ASSIGNOR:SONY COMPUTER ENTERTAINMENT AMERICA INC.;REEL/FRAME:025406/0506

Effective date: 20100401

FPAY Fee payment

Year of fee payment: 4

AS Assignment

Owner name: SONY INTERACTIVE ENTERTAINMENT AMERICA LLC, CALIFO

Free format text: CHANGE OF NAME;ASSIGNOR:SONY COMPUTER ENTERTAINMENT AMERICA LLC;REEL/FRAME:038617/0474

Effective date: 20160331

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552)

Year of fee payment: 8

AS Assignment

Owner name: SONY INTERACTIVE ENTERTAINMENT LLC, CALIFORNIA

Free format text: MERGER;ASSIGNOR:SONY INTERACTIVE ENTERTAINMENT AMERICA LLC;REEL/FRAME:053323/0567

Effective date: 20180315

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 12