Update agent methods to use the selected R interpreter consistently #2501

sagerb · 2024-12-17T20:57:48Z

Intent

The changes within this PR are designed to implement a consistent usage of the active version of R, when running within Positron. The changes apply to VS Code as well, however, until we implement some of the other issues within the associated epic to gather configured versions, Positron will be the place which exposes the functionality the cleanest.

The associated issue outlines this logic change for determining the active R version to use:

Use the passed-in path for the R interpreter if supplied. It must exist and be validated)
- If it doesn't exist, fall through to the next option
- If it does exist but is not a valid R interpreter (use --version output to validate), then return an error
Use the first R executable found on PATH; it must exist and be validated.
- If not valid or doesn't exist, return an error

Once the interpreter is identified, it should be used for creating/retrieving the lock file, as well as when running commands.

Resolves #2500

Type of Change

- Bug Fix
- New Feature
- Breaking Change
- Documentation
- Refactor
- Tooling

Approach

The existing codebase needed to be more consistent in determining which path was to be used when invoking the R interpreter. There was no clear, specific location where the logic associated with the determination algorithm was isolated. As this task (which is part of the epic hUse the active versions of interpreters instead of the first version on PATH) required changes to this logic, a refactor seemed the best approach to get consistent behavior in place and have a logical location to modify it if we needed to.

The main refactoring involved with this change involved removing the R interpreter-related functionality from the inspect package and adding it to a newly created package, interpreters. I intend to do the same type of refactoring for the other interpreters within this epic (Python and Quato), so this package will be expanded upon soon.

Once the existing code was refactored, it was updated to implement the logic listed in the issue (#2500).

In addition, the functionality of determining if there is a dependency on R was moved within the interpreters package. It had previously lived within the initialization code, but that appeared to have been implemented as a "guard" before usage of the inspect package. Once the changes were made, I moved the requiresR method into the inspect package.

With this in place, all other usages within the code base for an R interpreter were refactored to use the new interpreters package. The need for these other areas of the codebase to access this functionality was a major factor in why I split the functionality out into a separate package. It just didn't seem "right" to have the other areas depend upon the inspect package when they simply wanted an interpreter.

Many unit tests were either moved or updated to work with the new package. I've been careful to maintain the intention of the previous tests and add new ones.

User Impact

The concept we're trying to maintain here is that when a user has set up their environment within an IDE for a particular version of R, they expect that the same version will be used for all operations of the publisher extension. While we have not yet implemented all of the hooks to gather the "active" version of R, the API requests are in place so that we can set that value from the extension code.

While many of the other changes will not be visible to the user, a user will see the consistent behavior when creating a deployment for a project; the configuration file created will have an R section which properly reflects the new approach.

Automated Tests

Directions for Reviewers

Unfortunately, this PR covers more changes than we usually try to have within a single PR. Unfortunately, the impact to the unit tests greatly exploded the changes required. I'd suggest looking at the changes by first examining the main functionality which was refactored. This can be seen by viewing the changes to inspect/r.go as functionality was moved into interpreters/r.go. Changes to the other files fall into two buckets:

calling this new functionality to get the active R interpreter
unit test updates

I recommend using Positron for the validation process, with a configuration that includes two or more versions of R on the system. With an existing R project open, you should be able to create a deployment for a project and have its configuration file populated with an R section reflecting the values of the selected version. Switching the version via Positron and then creating a new deployment should produce a configuration that contains an R section reflecting the active version selected.

Checklist

I have updated CHANGELOG.md to cover notable changes.

…nting new usage priority of active R environment

…rpreter-with-new-active-as-priority

…il needed)

sagerb · 2024-12-19T01:39:10Z

Unit tests have been added and CI has been fixed. Ready for further review and testing.

marcosnav

I'm still reviewing, great work so far!

I have a couple suggestions that I figured to share those first since that could require some adjustments.

marcosnav · 2024-12-19T12:57:18Z

internal/interpreters/r.go

+var _ RInterpreter = &defaultRInterpreter{}
+
+var NewRInterpreter = RInterpreterFactory
+
+func RInterpreterFactory(base util.AbsolutePath,


Is there a reason to have these three instances? Also, having an exported factory-constructor introduces the risk of overriding it and seems unnecessary.

Suggestion, only have a single contructor NewRInterpreter...

func NewRInterpreter(base util.AbsolutePath, rExecutableParam util.Path, log logging.Logger) RInterpreter {

I understand that the current implementation helps on testing and setting temporary mocks, but we should avoid having global instances that can be overriden so easily.

In this case, consider dependency injection, that can be done in constructors too. For example, in the case of PostPackagesRScanHandler, it is possible to have an RInterpreter factory as dependency and property of it, instead of calling it directly, and it still can be mocked on tests.

type PostPackagesRScanHandler struct { base util.AbsolutePath log logging.Logger interpreterFactory func(base util.AbsolutePath, rExecutableParam util.Path, log logging.Logger) RInterpreter } func NewPostPackagesRScanHandler(base util.AbsolutePath, log logging.Logger) *PostPackagesRScanHandler { return &PostPackagesRScanHandler{ base: base, log: log, interpreterFactory: func(dir util.AbsolutePath, rPath util.Path, log logging.Logger) RInterpreter { return interpreters.NewRInterpreter(projectDir, rPath, log) }, } } func (h *PostPackagesRScanHandler) ServeHTTP(w http.ResponseWriter, req *http.Request) { // ... rInterpreter := h.interpreterFactory(projectDir, rPath, h.log) // ...

Looks like I tripped across the wrong pattern in Mike's code (internal/initialize/initialize.go). I didn't much care for it either, but I was trying to be consistent with the prior art. Looking through the rest of the code, it looks like this was the only place he used this technique.

I'll start refactoring this into dependency injection and getting everything to work using it.

Oh, I see. The initialize.go file was set up using exported methods, which did not use the builder pattern. That's why Mike used the global variable approach.

marcosnav · 2024-12-19T13:07:14Z

internal/interpreters/r.go

+//
+// Errors are taken care of internally and determine the setting or non-setting
+// of the attributes.
+func (i *defaultRInterpreter) Init() error {


IMO, this should be part of the constructor, it can still be it's own method, but by running this logic in the constructor you'll get a couple goodies:

You won't have to check for i.initialized in every method.

Won't need to remember to run Init() after every new instance.

E.g:

func NewRInterpreter(base util.AbsolutePath, rExecutableParam util.Path, log logging.Logger) (RInterpreter, error) { interpreter := &defaultRInterpreter{ executor: executor.NewExecutor(), // ... } err := interpreter.init() if err != nil { return nil, err } return interpreter, nil }

Yes, I had this in the original approach. Something that I can't recall right now drove me in this direction, but I'm not sure it is applicable anymore. I'll take another look and see if I have any concerns.

OK, figured out that the reason I did this was so that we could update the structure assignment values ahead of the init function being run. This was allowing the unit testing to call the factory function, modify whatever they needed to do and then invoke init.

But, I've gone ahead and modified it to now use additional dependency injections for the functionality used internally. This caused some complications, but I do allow nil to be passed in and provide default providers for the functionality. So while there may be a few nils added to the inspect and interpreter factory methods (and perhaps a few more dependent factories), I think it is consistent.

internal/interpreters/r_test.go

marcosnav

I finished reviewing, no more observations aside from what was already mentioned.

This is great work and I really like the approach you took on this one! 🏅

marcosnav

Great work! There seem to be something to tweak for windows tests, feel free to merge once that passes.

Validated with Positron.

Started with R 4.3, configuration reflected the selected interpreter

Changed to 4.4, new deployment configuration reflected the change

sagerb · 2024-12-21T07:49:42Z

The last commit places an await when calling the click action, where we were seeing the race condition and lack of selection for the deployment.

The runs within https://github.com/posit-dev/publisher/actions/runs/12440697322 show a 24 out of 25 success rate of running the PR's codebase through CI. The one failure does not look like the same kind of failure we were diagnosing, and at initial examination, it looks more like a failure within the test environment itself.

I'm merging this PR with confidence.

sagerb added 16 commits December 17, 2024 12:38

refactor R executable into separate package for re-use, while impleme…

d683678

…nting new usage priority of active R environment

Merge remote-tracking branch 'origin/main' into sagerb-extract-r-inte…

9fc2df1

…rpreter-with-new-active-as-priority

fix linting errors

3b57cb7

fix unit tests in CI

9be66c0

trial fix for CI differences

b933e77

trial fix for CI differences

20ecfcf

trial fix for CI differences

18929be

trial fix for CI differences

7d15f48

trial fix for CI differences

c66b066

wait longer for title prompt in CI e2e test

811e4af

lazy load the renv lock file location (and wait to execute renv unt…

78e4468

…il needed)

handle R executable not found - without unit tests

1b1ee2f

return CI test back to original to see if passes

ef9d391

remove temp debug code and adjust unit tests

2af2ace

additional unit tests

d77b562

reset CI test file

4e65fff

sagerb marked this pull request as ready for review December 19, 2024 01:38

sagerb requested review from dotNomad and marcosnav as code owners December 19, 2024 01:38

marcosnav reviewed Dec 19, 2024

View reviewed changes

internal/interpreters/r_test.go Outdated Show resolved Hide resolved

remove unneeded comment

c56914d

marcosnav reviewed Dec 19, 2024

View reviewed changes

sagerb added 3 commits December 19, 2024 22:41

adjust testability to use dependency injection

4b79ef2

remove unneeded comments

4389520

remove unneeded comments

cddc156

marcosnav approved these changes Dec 20, 2024

View reviewed changes

sagerb added 2 commits December 20, 2024 09:05

fix unit test missing mock call

7775a57

fix unit test missing mock call

12947da

sagerb added 7 commits December 20, 2024 09:27

fix unit test missing mock call

8964322

fix unit test missing mock call

5ee4fa0

debugging occasional vscode-ui test failure in CI

33a4886

debugging occasional vscode-ui test failure in CI

0feddff

debugging occasional vscode-ui test failure in CI

a28a3d1

debugging occasional vscode-ui test failure in CI

ed931fb

debugging occasional vscode-ui test failure in CI

818652c

sagerb merged commit 69048f2 into main Dec 21, 2024
14 checks passed

sagerb deleted the sagerb-extract-r-interpreter-with-new-active-as-priority branch December 21, 2024 07:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update agent methods to use the selected R interpreter consistently #2501

Update agent methods to use the selected R interpreter consistently #2501

sagerb commented Dec 17, 2024 •

edited

Loading

sagerb commented Dec 19, 2024

marcosnav left a comment

marcosnav Dec 19, 2024

sagerb Dec 19, 2024

sagerb Dec 19, 2024

marcosnav Dec 19, 2024

sagerb Dec 19, 2024

sagerb Dec 20, 2024

marcosnav left a comment

marcosnav left a comment

sagerb commented Dec 21, 2024

Update agent methods to use the selected R interpreter consistently #2501

Update agent methods to use the selected R interpreter consistently #2501

Conversation

sagerb commented Dec 17, 2024 • edited Loading

Intent

Type of Change

Approach

User Impact

Automated Tests

Directions for Reviewers

Checklist

sagerb commented Dec 19, 2024

marcosnav left a comment

Choose a reason for hiding this comment

marcosnav Dec 19, 2024

Choose a reason for hiding this comment

sagerb Dec 19, 2024

Choose a reason for hiding this comment

sagerb Dec 19, 2024

Choose a reason for hiding this comment

marcosnav Dec 19, 2024

Choose a reason for hiding this comment

sagerb Dec 19, 2024

Choose a reason for hiding this comment

sagerb Dec 20, 2024

Choose a reason for hiding this comment

marcosnav left a comment

Choose a reason for hiding this comment

marcosnav left a comment

Choose a reason for hiding this comment

Started with R 4.3, configuration reflected the selected interpreter

Changed to 4.4, new deployment configuration reflected the change

sagerb commented Dec 21, 2024

sagerb commented Dec 17, 2024 •

edited

Loading