This document is OBSOLETE, and has been superseded by information in the DataONE types schema. It will be deleted after review.
A NodeList is a synchronized register for all of the nodes in the DataONE environment. It contains the information needed by DataONE to orchestrate activities across the distributed coordinating and member nodes of the network. While some information is provided by the Member Nodes themselves, the node list is maintained dynamically by the Coordinating Nodes. The node list is mutable in that it reflects the latest state of the nodes that are part of the system. Replicated copies of the node list are maintained at each of the Coordinating nodes.
Registry
ContactGroup
groupid
name
description
members
Contact
contactid
role (administrator, manager, ...)
givenName (first name)
sn (surname)
notification
type (phone, email, IRC, ...)
connection (phone number, email address, IRC channel)
Network (1..n, replaces "environment")
networkid
name
description
adminGroup
notifyGroup
Node
nodeid
name
description
location
adminGroup
notifyGroup
created (date created / registered)
modified (time stamp for modification)
lastSynchronization (time stamp)
objectFormatsSupported (list of object formats known to support)
synchronize
replicate
replicationTarget
service
version (schema version supported, MN)
baseURL (MN)
name (human readable name for service, e.g. "DataONE-0.6.1", MN)
activeNetwork (id of network this interface is active for, MN)
lastChecked (last time service was examined, CN)
method
name (MN)
isactive (set by CN)
The node list is a complex data type, with three main sub-structures: services, synchronization, and health. Some data is provided at node registration time, while other items are generated by DataONE itself in the course of managing objects.
The nodelist schema is expressed in XMLSchema and is available at:
The following list of fields represents the set of information collected and maintained by Coordinating Nodes for every node in the system.
Table 1. Quick reference to the NodeList fields described in more detail below.
Group |
Field |
Type |
Cardinality |
Generate By |
Version |
---|---|---|---|---|---|
General |
|||||
NodeReference |
1 |
CN |
0.5 |
||
NonEmptyString |
1 |
CN |
0.5 |
||
NonEmptyString |
1 |
CN |
0.5 |
||
anyURI |
1 |
MN |
0.5 |
||
|
Service |
0..n |
MN |
0.5 |
|
|
Synchronization |
0..1 |
CN |
0.5 |
|
|
NodeHealth |
0..1 |
CN |
0.5 |
|
boolean |
1 |
MN |
0.5 |
||
boolean |
1 |
MN |
0.5 |
||
NodeType |
1 |
CN |
0.5 |
||
Environment |
1 |
CN |
0.5 |
||
Services |
|||||
ServiceName |
1 |
MN |
0.5 |
||
string |
1 |
MN |
0.5 |
||
boolean |
0..1 |
MN |
0.5 |
||
|
ServiceMethod |
0..n |
MN |
0.5 |
|
NMToken |
0..1 |
CN |
0.5 |
||
xs:token |
1 |
MN |
0.5 |
||
boolean |
1 |
MN |
0.5 |
||
Synchronization |
|||||
dateTime |
1 |
CN |
0.5 |
||
dateTime |
1 |
CN |
0.5 |
||
Schedule |
1 |
CN |
0.5 |
||
|
crontabEntryType |
1 |
CN |
0.5 |
|
|
crontabEntryType |
1 |
CN |
0.5 |
|
|
crontabEntryType |
1 |
CN |
0.5 |
|
|
crontabEntryType |
1 |
CN |
0.5 |
|
|
crontabEntryType |
1 |
CN |
0.5 |
|
|
crontabEntryType |
1 |
CN |
0.5 |
|
|
crontabEntryType |
1 |
CN |
0.5 |
|
Health |
|||||
|
Ping |
1 |
CN |
0.5 |
|
|
Status |
1 |
CN |
0.5 |
|
State |
1 |
CN |
0.5 |
||
boolean |
0..1 |
CN |
0.5 |
||
dateTime |
0..1 |
CN |
0.5 |
||
boolean |
0..1 |
CN |
0.5 |
||
dateTime |
0..1 |
CN |
0.5 |
NodeList.
identifier
¶A unique identifier for the node of type NodeReference. This may initially be the same as the baseURL, however this value should not change for future implementations of the same node, whereas the baseURL may change in the future.
1
NodeReference
CN
0.5
NodeList.
name
¶A human readable name for the node. (The name of the node is being used in Mercury currently to assign a path, so the format should be consistent with dataone directory naming conventions).
1
NonEmptyString
CN
0.5
NodeList.
description
¶Description of content maintained by this node and any other free style notes.
1
NonEmptyString
CN
0.5
NodeList.
baseURL
¶Of type anyURI, it is the base URL that is complete enough with the service.method.rest attribute to create a valid call.
1
anyURI
CN
0.5
NodeList.
replicate
¶A flag to tell the CN whether or not to replicate MN data.
1
boolean
CN
0.5
NodeList.
synchronize
¶A flag to tell the CN to synchronize or not. Applies to CNs and MNs (although CNs are presumed to synchronize)
1
boolean
CN
0.5
NodeList.
type
¶The type of node in the dataONE world this one is. Legal values are “MN” and “CN”.
1
NodeType
CN
0.5
NodeList.
environment
¶The systems environment the node belongs to. Legal values are “dev”, “test”, “staging”, and “prod”.
1
Environment
CN
0.5
services.
name
¶The name of the service exposed by the node
1
ServiceName
CN
0.5
services.
version
¶The version of the service implemented. Since not all member nodes can be orchestrated to migrate versions simultaneously, the version is needed to ensure business continuity in the eventuality of dataone-service-api upgrades.
1
string
CN
0.5
services.
available
¶A flag to indicate whether or not the service is available. Determined by the CN.
0..1
boolean
CN
0.5
services.method.
name
¶the name of the method implemented by the service
0..1
NMToken
CN
0.5
services.method.
rest
¶the rest path, relative to the baseURL of the node, that calls the method
1
xs:token
CN
0.5
services.method.
implemented
¶A flag to indicate if this method is implemented on the node. Determined by the MN through the addCapabilities method.
1
boolean
CN
0.5
synchronization.
lastHarvested
¶Set by a CN, contains the time of last MN-synchronization with a CN. The dateTime is taken from the frame of reference of the member node, that is to say, it uses the latest modification date from the objects harvested.
1
dateTime
CN
0.5
synchronization.
lastCompleteHarvest
¶Set by a CN, contains the time of the last complete harvest from a MN. A complete harvest is a full re-harvesting from a member node not relying on last harvest time. This value of this field should always be the same or earlier than the lastHarvested field.
1
dateTime
CN
0.5
synchronization.
schedule
¶a set of numerical list or range values used to set the synchronization schedule with a MN, following crontab formatting rules. See wikipedia entry for a popular, if not technical, explanation of crobtab http://en.wikipedia.org/wiki/Cron.
1
Schedule
CN
0.5
health.
state
¶The state of health of the node, based on ping and status calls. Legal values are “up”, “down”, “unknown”.
1
State
CN
0.5
health.ping.
success
¶A flag showing whether the last mn_health.ping was successful or not.
0..1
boolean
CN
0.5
health.ping.
lastSuccess
¶The time of last successful mn_health.ping to the node.
0..1
dateTime
CN
0.5
health.status.
success
¶A flag showing whether the last mn_health.status method call was successful or not.
0..1
boolean
CN
0.5
health.status.
dateChecked
¶The time of the last mn_health.status call to the node.
0..1
dateTime
CN
0.5
The object format in protocol buffer format A set of values that describe a node, its Internet location, the services it supports and its replication policy.
message Node
{
required NodeReference identifier = 1;
required NonEmptyString name = 2;
required NonEmptyString description = 3;
required anyURI baseURL = 4;
repeated Service services = 5;
optional Synchronization synchronization = 6;
optional NodeHealth health = 7;
required boolean replicate = 8;
required boolean synchronize = 9;
required NMToken(string) type = 10;
message Service
{
required ServiceName name = 0;
required string version = 1;
boolean available = 2;
repeated ServiceMethod method = 3;
message ServiceMethod
{
optional NMToken name = 0;
required xs:token rest = 1;
required boolean implemented = 2;
}
}
message Synchronization
{
required dateTime lastHarvested = 0;
required dateTime lastCompleteHarvest = 1;
required Schedule schedule = 2;
message Schedule
{
required crontabEntryType sec = 0;
required crontabEntryType min = 1;
required crontabEntryType hour = 2;
required crontabEntryType mday = 3;
required crontabEntryType mon = 4;
required crontabEntryType year = 5;
required crontabEntryType wday = 6;
}
}
message NodeHealth
{
required Ping ping = 0;
required Status status = 1;
required State state = 2;
message Ping
{
optional boolean success = 0;
optional dateTime lastSuccess = 1;
}
message Status
{
optional boolean success = 0;
optional dateTime dateChecked = 1;
}
enum State
{
UP = 0;
DOWN = 1;
UNKNOWN = 2;
}
}
}