Configuring Platform Agents on Microsoft Cluster Service

on-site-related topic

Note: Scheduling workload or using file events on Microsoft Windows Process Servers is only possible when your license includes a non-zero value for the license keyProcessServerService.OS.limit. The value of this key determines how many active Platform Agents (across all platforms) you can run. You may create an unlimited number of monitoring only Platform Agents.

Using Microsoft Cluster nodes you can create a Platform Agent that has the same availability characteristics as the Microsoft Cluster. When the node of the Platform Agent fails or becomes unreachable its service is moved by Microsoft Cluster Service to a backup node.

Cluster Fail Over

  • Processes that are executing when the cluster node moves will be reported as Unknown; the system does not know what happened to them.
  • A process that finishes immediately before the cluster node moves will get a proper status update to Completed, Error, or any other final status code (except Unknown ), when the job-processor still had access to the physical disk at the time when the process finished. If the physical disk resource was already disabled or moved to the winning node in the cluster the process status will be lost and the process will get status Unknown.

Platform Agent Network Timeout

A network timeout has been introduced to solve networking issues, when no message has been received in at most twice the MessageInterval, the Platform Agent will attempt to reconnect.

Cluster Services and Auto Update of the Platform Agent

Auto-update will update one cluster node at a time. Suppose that cluster node A is in operation when the system is updated. Only the service on node A will be updated. Only when the cluster fails over to node B will the service on node B be updated.

Installation

  • Install the Platform Agent in the same directory on the Windows disk drive or another "fixed" path that is identical on all nodes. For example, if you install the Platform Agent in C:\\Program Files\\Redwood\\Agent then you must install it in this path on every node.
  • on-site-related topic Create an instance. Give this instance the same name and port number on all nodes.
  • cloud-related topic Create an instance. Give this instance the same name on all nodes.
  • Do not register the Platform Agent.

After you have installed the Platform Agents, ensure that the configuration data is identical, in particular the secret. The standard installation generates different secrets on every node, so you need to override these. This configuration data can be found in the net subdirectory relative to where you installed the Platform Agent.

It is probably easiest to stop all Platform Agent instances and copy the ${InstallDir}\net\instance\${Instance} directory. The DataRootDirectory where configuration and output files are stored as well as the installation directory must not reside on a NAS (NFS or SMB share). SAN file systems may be considered local (if they are mounted as iSCSI, for example).

Note: Redwood recommends strongly against installing the software on a networked file system. If this recommendation is ignored, and you have random errors that Redwood believes are caused by the NAS (NFS or SMB share), that Redwood cannot reproduce on local storage, you will be required to demonstrate that the issue can be reproduced when installed on local storage. The resolution to this issue may require that you reinstall on local storage.

Prerequisites

To create a service that is managed by the Microsoft Cluster Service you need:

  • A Microsoft Windows Server cluster consisting of at least two nodes.
  • A shared physical drive that is available on those nodes.

Procedure

  1. Install the Redwood Server Platform Agent on all nodes in the cluster, see Creating a Microsoft Windows Process Server (Advanced) for details. Install the product using the guidelines in the Installation section above. Ensure you create a Process Server in the central Redwood Server, such that it uses the cluster IP address and sets DataRootDirectory to a directory on the shared physical drive.
  2. Use the Microsoft Cluster Administrator software to add a new resource that is in the same group as the cluster IP address and shared physical disk. Ensure that the Platform Agent is made dependent on both resources.
  3. Start the Platform Agent (scheduler) resource. The cluster group will show that the Scheduler is off-line. Bring the Platform Agent online. Move the cluster group to a different node to verify that the Platform Agent service can run on all nodes.
  4. Start the Process Server. Using the Redwood Server ui, start the Process Server. After a few moments the scheduler should go to the Running status. If not, check the Operator Messages. To verify that everything works, submit and run a process. Move the cluster group to a different node. Run another process. Both processes should run successfully, but make sure they ran on different nodes.

Values

Here is an example of the data that is filled in

  • New Resource
Copy
  Name: Scheduler Platform Agent <instance>
  Description: Redwood Server Scheduler Platform Agent
  Resource Type: Generic Service
  Group: Suggested: same group as the IP address and physical disk
  • Possible Owners
Copy
  Add the nodes in the cluster that have the Platform Agent installed.
  • Dependencies
Copy
  Add the Cluster IP Address and the physical disk.
  • Generic Service Parameters
Copy
  Service name: jcs_<instance>   (MUST be already present on the nodes set in Window 2)
  Start parameters: Empty.
  • Registry Replication is not needed, choose Finish.

See Also