Adds observation term history support to Observation Manager #1439

jtigue-bdai · 2024-11-19T19:20:05Z

Description

This PR adds observation history by adding configuration parameters to ObservationTerms and having the ObservationManager handling the collection and storage of the histories via CircularBuffers.

Fixes #1208

Type of change

New feature (non-breaking change which adds functionality)

Checklist

I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

* add history cfg option to observation manager * Revert "add history cfg option to observation manager" This reverts commit 4b00994b8c9ce6e58779150dc42a707b0bc6701f. * add flatten history option to observation manager * run formatter * fix docstring

KyleM73 · 2024-11-19T23:10:46Z

From the discussion in #1208, this PR introduces observation history tracking per-term. It was discussed that per-group tracking should also be implemented. In theory, the same effect can be had by giving all obs terms the same history length with the config provided in this PR. However, I want to reiterate what I think are the two best arguments for implementing per-group obs histories as well, and would be happy to hear feedback:

When writing deployment code, it is often more straightforward and natural to save the entire observation to a 2D buffer of size (H,D) for H history and D observation dimension. In the current implementation, separate buffers would need to be kept for each observation term as defined by Isaac Lab. However, it may not be trivial to do so, whereas to evaluate RL policies on hardware it is guaranteed that the observations will be stacked into the full length-D array at some point, making saving them as a per-group history easier to replicate on hardware.
Providing an implementation at the group level is a cleaner interface for developers than having to change the history length for each obs term. Especially when subclassing observation group configs from parent classes, it would be much nicer to define the history in the observation group post init (the interface currently used for enable_corruption and concatenate_terms, etc.) as opposed to having to change it for every term. Having used only my own version of the code in this PR for the last ~6 months, I have inadvertently missed changing the history length for one or two obs terms many times, requiring restarting the run. This bug can be avoided by optionally choosing to only change the history length in a single location, namely in the obs group post init.

The PR as-is resolves per-term observation history tracking. If it is decided that Isaac Lab should only support per-term observation histories then I am happy to close my comment/proposal with this PR. Otherwise, I'm happy to help implement per-group history tracking as well.

Thanks for putting this together James!

jtigue-bdai · 2024-11-20T02:36:37Z

I agree @KyleM73,the current implementation only covers term level that we talked about (in a slightly different implementation). I think having the option for group level is still worthwhile. I also agree we should be able to apply group level history via an overall group history_length. I was thinking, if set, the group level history length would override the term history length within that group.

aravindev · 2024-11-28T16:38:04Z

source/extensions/omni.isaac.lab/omni/isaac/lab/utils/buffers/circular_buffer.py

@@ -109,7 +121,7 @@ def append(self, data: torch.Tensor):
        # at the fist call, initialize the buffer
        if self._buffer is None:
            self._pointer = -1
-            self._buffer = torch.empty((self.max_length, *data.shape), dtype=data.dtype, device=self._device)
+            self._buffer = torch.zeros((self.max_length, *data.shape), dtype=data.dtype, device=self._device)


@jtigue-bdai Isn't it better to initialize the buffer to the latest data on the first append or right after a reset?

Otherwise, we assume that zero is a valid value for the observation, which may not always be true.
For example, if we are gathering a history of, lets say, the gravitational force, which may be defined as a strictly negative value, having a buffer filled with zeros as previous observations may not be within the expected distribution.

I would suggest that, during reset or at first init, all the indices in the history is initialized to the most recent data, possibly the one being passed into append()

Thats a good point. I think you are right about filling on the first append.

aravindev · 2024-11-28T18:43:18Z

source/extensions/omni.isaac.lab/omni/isaac/lab/managers/observation_manager.py

@@ -14,6 +14,7 @@
 from typing import TYPE_CHECKING

 from omni.isaac.lab.utils import modifiers
+from omni.isaac.lab.utils.buffers import CircularBuffer

 from .manager_base import ManagerBase, ManagerTermBase
 from .manager_term_cfg import ObservationGroupCfg, ObservationTermCfg


I cannot find how to provide a suggestion to lines that are not part of the changelist, so here we go.

In the method def __str__(self) -> str:,

You may need to re-compute the observation dimensions depending on whether flattening is enabled or not.
I see two deficiencies which can cause confusions:

Currently, the printed summary does not handle history length while computing the observation dimension.

The self._group_obs_dim is wrong and does not correspond to the actual observation dimension if history is used. This is quite critical to be fixed IMO.

The shape of the observation group printed in the summary is wrong.

Hope this helps!

Thanks for the catch I will address this.

aravindev · 2024-11-28T20:38:20Z

source/extensions/omni.isaac.lab/omni/isaac/lab/managers/observation_manager.py

+                # create history buffers
+                if term_cfg.history_length > 0:
+                    group_entry_history_buffer[term_name] = CircularBuffer(
+                        max_len=term_cfg.history_length, batch_size=self._env.num_envs, device=self._env.device
+                    )
                # call function the first time to fill up dimensions
                obs_dims = tuple(term_cfg.func(self._env, **term_cfg.params).shape)


Suggested change

# create history buffers

if term_cfg.history_length > 0:

group_entry_history_buffer[term_name] = CircularBuffer(

max_len=term_cfg.history_length, batch_size=self._env.num_envs, device=self._env.device

)

# call function the first time to fill up dimensions

obs_dims = tuple(term_cfg.func(self._env, **term_cfg.params).shape)

# call function the first time to fill up dimensions

obs_dims = tuple(term_cfg.func(self._env, **term_cfg.params).shape)

# create history buffers

if term_cfg.history_length > 0:

group_entry_history_buffer[term_name] = CircularBuffer(

max_len=term_cfg.history_length,

batch_size=self._env.num_envs,

device=self._env.device,

)

obs_dims = (obs_dims[0], term_cfg.history_length * obs_dims[1], *obs_dims[2:])

if term_cfg.flatten_history_dim:

obs_dims = (obs_dims[0], np.prod(obs_dims[1:]),)

This populates the correct _group_obs_term_dim into the dictionary. Later, this is used to compute the _group_obs_dim however, I assume that its computation does not need any changes.

kousheekc · 2024-12-13T08:00:19Z

Hello, thanks for this PR, having a group level history length is exactly what I needed for my use case. I had implemented a hacky solution myself that did the job but did not notice this PR, so will be switching to this implementation now. However, I was wondering if it may be worth adding a specific task as a demo example to demonstrate this feature. Or would it make more sense to have a separate PR for that? I have some ideas in mind and I would be glad to support for this part if required.

fan-ziqi · 2024-12-14T02:33:15Z

I also need to use historical observations. Currently, I've implemented historical observations myself in rsl_rl, which is quite cumbersome. Looking forward to the merge of this PR!

Signed-off-by: Kelly Guo <[email protected]>

jtigue-bdai self-assigned this Nov 19, 2024

jtigue-bdai added 2 commits November 19, 2024 14:38

add full buffer return to CircularBuffer

f7ede99

format

9342d25

jtigue-bdai added enhancement New feature or request dev team Issue or pull request created by the dev team labels Nov 19, 2024

fix reset observation history

62ea6a6

jtigue-bdai added 3 commits November 21, 2024 10:41

fix buffer reset if buffer is None

8646503

add group history overrides

a4d4818

update changelog and extension version

da68a2b

jtigue-bdai marked this pull request as ready for review November 22, 2024 16:52

jtigue-bdai requested review from Mayankm96, jsmith-bdai, Dhoeller19 and kellyguo11 as code owners November 22, 2024 16:52

aravindev suggested changes Nov 28, 2024

View reviewed changes

aravindev reviewed Nov 28, 2024

View reviewed changes

jtigue-bdai added 3 commits December 5, 2024 10:44

fill buffer on first append after reset

d52992f

fix observation dimension with history and verify __str__ is correct

0f4a3a0

update docstrings

8b66c97

kellyguo11 changed the title ~~Add observation term history support to Observation Manager~~ Adds observation term history support to Observation Manager Dec 15, 2024

Merge branch 'main' into jat/feat/obs_term_history

0cd10af

Signed-off-by: Kelly Guo <[email protected]>

kellyguo11 merged commit f7b59b3 into main Dec 16, 2024
4 of 5 checks passed

kellyguo11 deleted the jat/feat/obs_term_history branch December 16, 2024 03:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds observation term history support to Observation Manager #1439

Adds observation term history support to Observation Manager #1439

jtigue-bdai commented Nov 19, 2024 •

edited

Loading

KyleM73 commented Nov 19, 2024

jtigue-bdai commented Nov 20, 2024

aravindev Nov 28, 2024

jtigue-bdai Dec 5, 2024

aravindev Nov 28, 2024

jtigue-bdai Dec 5, 2024

aravindev Nov 28, 2024 •

edited

Loading

kousheekc commented Dec 13, 2024 •

edited

Loading

fan-ziqi commented Dec 14, 2024

Adds observation term history support to Observation Manager #1439

Adds observation term history support to Observation Manager #1439

Conversation

jtigue-bdai commented Nov 19, 2024 • edited Loading

Description

Type of change

Checklist

KyleM73 commented Nov 19, 2024

jtigue-bdai commented Nov 20, 2024

aravindev Nov 28, 2024

Choose a reason for hiding this comment

jtigue-bdai Dec 5, 2024

Choose a reason for hiding this comment

aravindev Nov 28, 2024

Choose a reason for hiding this comment

jtigue-bdai Dec 5, 2024

Choose a reason for hiding this comment

aravindev Nov 28, 2024 • edited Loading

Choose a reason for hiding this comment

kousheekc commented Dec 13, 2024 • edited Loading

fan-ziqi commented Dec 14, 2024

jtigue-bdai commented Nov 19, 2024 •

edited

Loading

aravindev Nov 28, 2024 •

edited

Loading

kousheekc commented Dec 13, 2024 •

edited

Loading