Task scheduling API #25

luca-della-vedova · 2022-06-21T12:10:04Z

New feature implementation

Implemented feature

Closes #16.

This PR adds an API to register / remove task schedules.
The implementation on the backend side will be using the rmf_scheduler node, which has (internal) API in the rmf_scheduler_msgs.

Implementation description

Following both the issue and the rmf_scheduler implementation, this API uses cron strings to describe the schedule for a straightforward integration with the scheduler.
An alternative approach could involve having a more readable API, at the expense of having to add a conversion layer to convert it to a cron string.

Services for adding and deleting task schedules have been provided. There should also probably be a "list" service that can be used by frontend / users to read what the current schedules are. This is especially important since schedules are recurring tasks that are persistent and could last indefinitely (i.e. years). They can only be deleted or updated by knowing their unique name so if the API doesn't let the user fetch a list of schedules it might be impossible to update a schedule if the user lost its original name.

Signed-off-by: Luca Della Vedova <[email protected]>

koonpeng

We need at least 3 copies of the same data, in rmf_scheduler, in task scheduler and in api server, so some kind of synchronization is a must.
- I realize the endpoints here and in rmf_scheduler are insufficient to do proper synchronization. rmf_scheduler is missing delete events (which would be simple to add) and the endpoints here is missing query capabilities, this means that the task scheduler and api server must download the entire database periodically.
Since rmf_scheduler is an optional component, the task scheduler must have a way to determine if these endpoints are available.
This would also be a good time to think about authorization, if we do it at the api server, we can only do coarse authz based on roles (i.e. a user with create task role will be able to create/schedule any task regardless of the location, type, fleet etc), fine grained authorization based on the task details can only be done by the task scheduler so we need to have authz data in these schemas.

rmf_api_msgs/schemas/task_schedule.json

rmf_api_msgs/schemas/task_schedule_remove_request.json

Signed-off-by: Luca Della Vedova <[email protected]>

luca-della-vedova · 2022-06-27T10:12:07Z

Thanks for the feedback!

We need at least 3 copies of the same data, in rmf_scheduler, in task scheduler and in api server, so some kind of synchronization is a must.

I realize the endpoints here and in rmf_scheduler are insufficient to do proper synchronization. rmf_scheduler is missing delete events (which would be simple to add) and the endpoints here is missing query capabilities, this means that the task scheduler and api server must download the entire database periodically.

I guess my concern for downstream users is that they would have to subscribe to an "event" topic (i.e. for delete events as you mentioned) and make sure no messages are lost, otherwise the databases will not be in sync.
What would the use case for multiple copies of the same data? From what I understood there could be:

Frontend user for UI: When the page is loaded do a call to get the latest state of task schedules so it can be displayed in the UI. I do agree that it would be nice to have a "schedule event" subscription so we could have live updates of the webpage, without having to redownload the whole schedule database.
RMF traffic schedule: It should be agnostic to the database (at least for now) and only see the task when it gets published (again at least for now). In the future we could publish it "a bit in advance" to allow the traffic scheduler to plan for it better and if necessary keep its own database that could be synced in a way similar to above.
Any external user that wants to see the state of the task schedules, a "get task state API" should suffice for them.
RMF scheduler itself: It keeps the actual database.

I thought api server wouldn't need to keep the data and it can just call the matching ROS2 services when an API call is made

Since rmf_scheduler is an optional component, the task scheduler must have a way to determine if these endpoints are available.

I was thinking of using internally the ROS2 service system, where the API server can check if the rmf_scheduler service servers are available and only offer the endpoints if they are.

This would also be a good time to think about authorization, if we do it at the api server, we can only do coarse authz based on roles (i.e. a user with create task role will be able to create/schedule any task regardless of the location, type, fleet etc), fine grained authorization based on the task details can only be done by the task scheduler so we need to have authz data in these schemas.

I have no idea about this sadly, I agree that authorization is an important factor, my guess is that it would apply to everything, not only task schedules, so the authorization data would need to be everywhere, probably as a key in each of the schemas that the api server offers?

koonpeng · 2022-06-28T03:00:52Z

Regarding multiple copies of data, it is a common side effect of the CQRS pattern (somewhat similar to the architecture we have). Because each service needs different sets of data, they usually end up maintaining their own copies. An example is authorization data, rmf scheduler has no support for authorization so it doesn't store the relevant data, so then some other services on top must store and link the authz data to the schedule data. The api server also need to support querying and paging, so unless the backend service also supports them, it must maintain its own database.

Only having one database can easily cause a cascading effect where the lowest layer must store and maintain data for all upper layers. It couples the lower layer api/data contract with the upper layers, making it impossible to do changes without also changing the whole stack, which also creates many versioning, compatibiliy, testing etc problems (because the lower layer never uses these data, but it must maintain api / database compatibility for them).

I was thinking of using internally the ROS2 service system, where the API server can check if the rmf_scheduler service servers are available and only offer the endpoints if they are.

A problem with this is the good old startup race condition issue which can be solved by polling, but we also need to make a distinction between "unavailable" and "not enabled" which conflicts with the polling algorithm. Another option is to always assume rmf scheduler is enabled and just have a time out, minor downside is that we can't tell from the logs the difference between "temporary down" and "never deployed".

I feel that scheduling tasks sounds like quite a "core" feature, it sounds like something that should be available "by default". But then it would also make rmf scheduler non-optional. It wouldn't be a problem if we only add task scheduling to a future major release without foxy support, so I'm not sure if we should bother with optional task scheduling.

Signed-off-by: Luca Della Vedova <[email protected]>

luca-della-vedova · 2024-04-22T07:05:28Z

I believe there is a different upstream implementation so will close this

luca-della-vedova added 3 commits June 21, 2022 20:01

Draft of task scheduling API

844a0d4

Signed-off-by: Luca Della Vedova <[email protected]>

Add task schedule list request and response

6acfd3e

Signed-off-by: Luca Della Vedova <[email protected]>

Add start and finish time for schedule

7ff45bb

Signed-off-by: Luca Della Vedova <[email protected]>

koonpeng reviewed Jun 27, 2022

View reviewed changes

rmf_api_msgs/schemas/task_schedule.json Outdated Show resolved Hide resolved

rmf_api_msgs/schemas/task_schedule_remove_request.json Outdated Show resolved Hide resolved

Update description and required fields

0fa0d79

Signed-off-by: Luca Della Vedova <[email protected]>

Copy paste error in task schedule description

62b0608

Signed-off-by: Luca Della Vedova <[email protected]>

luca-della-vedova closed this Apr 22, 2024

luca-della-vedova deleted the feature/task_schedule_api branch April 22, 2024 07:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task scheduling API #25

Task scheduling API #25

luca-della-vedova commented Jun 21, 2022

koonpeng left a comment •

edited

Loading

luca-della-vedova commented Jun 27, 2022

koonpeng commented Jun 28, 2022

luca-della-vedova commented Apr 22, 2024

Task scheduling API #25

Task scheduling API #25

Conversation

luca-della-vedova commented Jun 21, 2022

New feature implementation

Implemented feature

Implementation description

koonpeng left a comment • edited Loading

Choose a reason for hiding this comment

luca-della-vedova commented Jun 27, 2022

koonpeng commented Jun 28, 2022

luca-della-vedova commented Apr 22, 2024

koonpeng left a comment •

edited

Loading