WO2002001347A3 - Method and system for automatic re-assignment of software components of a failed host - Google Patents

Method and system for automatic re-assignment of software components of a failed host Download PDF

Info

Publication number
WO2002001347A3
WO2002001347A3 PCT/SE2001/001448 SE0101448W WO0201347A3 WO 2002001347 A3 WO2002001347 A3 WO 2002001347A3 SE 0101448 W SE0101448 W SE 0101448W WO 0201347 A3 WO0201347 A3 WO 0201347A3
Authority
WO
WIPO (PCT)
Prior art keywords
hosts
monitoring
components
host
software components
Prior art date
Application number
PCT/SE2001/001448
Other languages
French (fr)
Other versions
WO2002001347A2 (en
Inventor
Edwin Tse
Nicolas Gosselin
Fergus Kelledy
David O'flanagan
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Priority to AU2001266503A priority Critical patent/AU2001266503A1/en
Publication of WO2002001347A2 publication Critical patent/WO2002001347A2/en
Publication of WO2002001347A3 publication Critical patent/WO2002001347A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2035Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques

Abstract

In a network of co-operating hosts (80, 82, 84, 86, 88), a method and system for automatic re-assignment of software components (110, 112) of a failed host to co-operating monitoring (82, 86) or back-up hosts. In a preferred embodiment, a Central Information Repository (CIR), such as an LDAP server, keeps track of software components (110, 112) running on the network hosts (80, 82, 84, 86, 88) and a Monitoring Partnership Program (MPP), in which some hosts (80, 82, 84, 86, 88) monitor the activity of other hosts (80, 82, 84, 86, 88), is provided. Upon failure of a monitored host (84), a monitoring host (82, 86) detects the failure, and informs the other monitoring hosts (82, 86) or the other back-up hosts, if any, of the failure of the monitored host (84). The monitoring hosts (82, 86), and/or the back-up hosts query the CIR for obtaining the identity of the software components (110, 112) running on the failed host (84) before the failure, and select which such components (110, 112) each will start. The monitoring hosts (82, 86) and/or the back-up hosts then take over and start the failed components (110, 112). Upon recovery, the monitored host (84) queries the CIR and obtains the list of its software components, informs the CIR and the monitoring or back-up hosts (82, 86) that it will take over, and starts its components (110, 112), while the monitoring and/or the back-up hosts (82, 86) shut down the components (110, 112) they temporarily run.
PCT/SE2001/001448 2000-06-30 2001-06-21 Method and system for automatic re-assignment of software components of a failed host WO2002001347A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001266503A AU2001266503A1 (en) 2000-06-30 2001-06-21 Method and system for automatic re-assignment of software components of a failedhost

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US60911100A 2000-06-30 2000-06-30
US09/609,111 2000-06-30

Publications (2)

Publication Number Publication Date
WO2002001347A2 WO2002001347A2 (en) 2002-01-03
WO2002001347A3 true WO2002001347A3 (en) 2002-06-20

Family

ID=24439380

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2001/001448 WO2002001347A2 (en) 2000-06-30 2001-06-21 Method and system for automatic re-assignment of software components of a failed host

Country Status (2)

Country Link
AU (1) AU2001266503A1 (en)
WO (1) WO2002001347A2 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6938256B2 (en) 2000-01-18 2005-08-30 Galactic Computing Corporation System for balance distribution of requests across multiple servers using dynamic metrics
US8538843B2 (en) 2000-07-17 2013-09-17 Galactic Computing Corporation Bvi/Bc Method and system for operating an E-commerce service provider
US6816905B1 (en) 2000-11-10 2004-11-09 Galactic Computing Corporation Bvi/Bc Method and system for providing dynamic hosted service management across disparate accounts/sites
US7055052B2 (en) 2002-11-21 2006-05-30 International Business Machines Corporation Self healing grid architecture for decentralized component-based systems
US8489741B2 (en) 2002-11-21 2013-07-16 International Business Machines Corporation Policy enabled grid architecture
US8140677B2 (en) 2002-11-21 2012-03-20 International Business Machines Corporation Autonomic web services hosting service
US7200781B2 (en) 2003-05-14 2007-04-03 Hewlett-Packard Development Company, L.P. Detecting and diagnosing a malfunctioning host coupled to a communications bus
CA2435655A1 (en) * 2003-07-21 2005-01-21 Symbium Corporation Embedded system administration
US7676621B2 (en) 2003-09-12 2010-03-09 Hewlett-Packard Development Company, L.P. Communications bus transceiver
DE102004050350B4 (en) 2004-10-15 2006-11-23 Siemens Ag Method and device for redundancy control of electrical devices
CA2504333A1 (en) 2005-04-15 2006-10-15 Symbium Corporation Programming and development infrastructure for an autonomic element
US8856585B2 (en) * 2011-08-01 2014-10-07 Alcatel Lucent Hardware failure mitigation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000010822A (en) * 1998-06-25 2000-01-14 Yokogawa Electric Corp Down detecting device for decentralized object
EP0981089A2 (en) * 1998-07-20 2000-02-23 Lucent Technologies Inc. Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
EP0990986A2 (en) * 1998-09-30 2000-04-05 Ncr International Inc. Failure recovery of partitioned computer systems including a database schema

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000010822A (en) * 1998-06-25 2000-01-14 Yokogawa Electric Corp Down detecting device for decentralized object
EP0981089A2 (en) * 1998-07-20 2000-02-23 Lucent Technologies Inc. Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
EP0990986A2 (en) * 1998-09-30 2000-04-05 Ncr International Inc. Failure recovery of partitioned computer systems including a database schema

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PATENT ABSTRACTS OF JAPAN vol. 2000, no. 04 31 August 2000 (2000-08-31) *

Also Published As

Publication number Publication date
AU2001266503A1 (en) 2002-01-08
WO2002001347A2 (en) 2002-01-03

Similar Documents

Publication Publication Date Title
WO2002001347A3 (en) Method and system for automatic re-assignment of software components of a failed host
CA2288016A1 (en) Method and system for recovery in a partitioned shared nothing database system using virtual shared disks
US6421688B1 (en) Method and apparatus for database fault tolerance with instant transaction replication using off-the-shelf database servers and low bandwidth networks
KR100442884B1 (en) Method for updating firmware
WO1999057632A3 (en) Initializing and restarting operating systems
CA2106280A1 (en) Apparatus and methods for fault-tolerant computing employing a daemon monitoring process and fault-tolerant library to provide varying degrees of fault tolerance
WO1999000720A3 (en) Method and arrangement for detecting a non-authorised user access to a communications network
WO2001084313A3 (en) Method and system for achieving high availability in a networked computer system
EP0974903A3 (en) Method and apparatus for providing failure detection and recovery with predetermined replication style for distributed applications in a network
EP0981089A3 (en) Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
WO2002050678A8 (en) Method of 'split-brain' prevention in computer cluster systems
AU7684598A (en) Automatic regeneration of user data from a network
DK0954779T3 (en) Procedure for reconstructing a calculation mode
WO2002012987A3 (en) Systems and methods for authenticating a user to a web server
KR20000063313A (en) Synthesis of People Search (Online) and Direct Search Agent (Offline) Using Internet
CN112506702B (en) Disaster recovery method, device, equipment and storage medium for data center
WO2000054149A3 (en) Methods and systems for reduced configuration dependency in thin client applications
WO2006073847A3 (en) Systems and methods for dynamic data backup
JPH11120012A (en) Client-server type data base management system and recording medium where program thereof is recorded
WO2002073398A3 (en) Method, system, and program for determining system configuration information
JP2003524255A (en) Internet based remote data and file recovery system and method
CN109274761A (en) A kind of NAS clustered node, system and data access method
CN111104282A (en) Node processing method and device based on block chain
WO2004070521A3 (en) Alternate server system
WO2001026355A3 (en) A method and apparatus in a communication network for updating and maintaining record data

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP