Always execute metadata commands asynchronously in controller
Open, HighPublic
Actions

Assigned To

None

Authored By

	kgaillot
	Jan 31 2024, 4:31 PM

Description

Currently, the controller initiates metadata actions on its own (not being told by the scheduler, and not executed via the executor, as all other resource actions are). Also, it executes metadata actions asynchronously when possible, but there are situations where it has to execute them synchronously. This has significant drawbacks:

Metadata actions are the only actions executed as the hacluster user instead of root.
Metadata actions are executed with a hardcoded 30s timeout, and ignore any timeout ignored in the CIB.
If any asynchronous action is pending when a synchronous metadata call is made, the asynchronous action could complete while waiting for the synchronous call, causing its SIGCHLD to be ignored and leaving it as a zombie process.

The scheduler should schedule a metadata action, as a normal resource action, when any other resource action requires metadata (see crm_op_needs_metadata()), and order the metadata action before the other one(s). There only needs to be a single metadata action per agent (not per resource). The action would be added to the graph normally, and the DC would farm it out to controllers normally.

Considerations:

Metadata actions should always assume requires="none" (that is, not require quorum or fencing).
Start and probe actions always require fresh metadata (not cached), so metadata actions needed for those should be marked in some way.
For this task, metadata actions needed for actions on a Pacemaker Remote node should be scheduled on the cluster node hosting the connection, not the remote node. (Remote metadata poses enough problems to merit its own project, T359.)

When a controller processes a metadata action, and it isn't marked as above, the controller should consider the action successful immediately (like a pseudo-op) if the metadata is already cached. Otherwise, it would send the metadata action to its local executor as usual, and cache the metadata on success.

Once done, update Pacemaker Explained re: meta-data "is not performed as root".

Related Objects
Search...

Status	Assigned	Task
		Restricted Maniphest Task
Open	None	T770 Always execute metadata commands asynchronously in controller
		Restricted Maniphest Task

Event Timeline

kgaillot triaged this task as Normal priority.Jan 31 2024, 4:31 PM

kgaillot created this task.

kgaillot created this object with edit policy "Restricted Project (Project)".

kgaillot added a parent task: Restricted Maniphest Task.

kgaillot added a subtask: Restricted Maniphest Task.

Why is this a subtask of a task that's already complete (T469)?

The "support or drop" part -- if we had decided to support it, this would have been a subtask. But we decided to drop it.

This will also have the benefit of getting fresh metadata only once per transition for a given resource agent. Currently, the agent meta-data command is invoked for every resource start, even if multiple resources share the same agent.

kgaillot updated the task description. (Show Details)Aug 28 2024, 4:40 PM

kgaillot added a project: Restricted Project.

kgaillot updated the task description. (Show Details)Sep 23 2024, 1:20 PM

kgaillot updated the task description. (Show Details)Dec 23 2024, 4:35 PM

kgaillot updated the task description. (Show Details)

kgaillot updated the task description. (Show Details)Dec 23 2024, 5:02 PM

kgaillot updated the task description. (Show Details)

kgaillot raised the priority of this task from Normal to High.Jan 2 2025, 4:27 PM

Always execute metadata commands asynchronously in controllerOpen, HighPublicActions

Description

Related ObjectsSearch...

Event Timeline

Always execute metadata commands asynchronously in controller
Open, HighPublic
Actions

Related Objects
Search...