Add remaining runtime tests by khansaad · Pull Request #1824 · kruize/autotune

khansaad · 2026-02-27T05:10:41Z

Description

This PR updates the generate_recommendations test to include more runtime recommendations scenarios.

Adds a separate util file for runtime

Fixes # (issue)

Type of change

Bug fix
New feature
Docs update
Breaking change (What changes might users need to make in their application due to this PR?)
Requires DB changes

How has this been tested?

Please describe the tests that were run to verify your changes and steps to reproduce. Please specify any test configuration required.

New Test X
Functional testsuite

Test Configuration

Kubernetes clusters tested on:

Checklist 🎯

Additional information

Include any additional information such as links, test results, screenshots here

Summary by Sourcery

Expand runtime recommendations coverage in local monitoring tests by adding scenario-specific test cases and extracting shared runtime helpers into a dedicated utility module.

Enhancements:

Introduce a reusable runtime_utils helper module for generating recommendations, validating runtime-related env vars, and inspecting GC flag patterns in recommendation outputs.
Refactor existing runtime recommendation test to use the new shared helper for generating and validating recommendations.
Centralize runtime-related constants and utilities previously in generic helpers into the new runtime utilities module.

Tests:

Add multiple runtime-focused tests covering Semeru GC flags, missing JVM runtime metadata, JVM version edge cases, layer/runtime mismatches, and non-runtime-capable datasources.

Signed-off-by: Saad Khan <saakhan@ibm.com>

sourcery-ai · 2026-02-27T05:10:48Z

Reviewer's Guide

Refactors runtime recommendation test setup into a reusable helper module and adds several focused runtime behavior tests while simplifying the original end-to-end test to use the new helpers.

File-Level Changes

Change	Details	Files
Extracted end-to-end runtime recommendation setup and validation logic into a new reusable helper module for runtime-focused tests.	Moved runtime recommendation validation utilities and constants from the generic helpers module into a dedicated runtime utilities module. Implemented _generate_and_list_recommendations_for_tfb to encapsulate cloning benchmarks, installing workloads, creating metric/metadata profiles and layers, and generating/listing recommendations with cleanup. Added helper functions to traverse listRecommendations output and extract runtime-related env values for assertions.	`tests/scripts/helpers/utils.py` `tests/scripts/helpers/runtime_utils.py`
Expanded runtime recommendation test coverage with multiple scenario-specific tests using the new helpers.	Simplified the existing test_runtime_recommendation to rely on _generate_and_list_recommendations_for_tfb and validate_runtime_recommendations_if_present. Added tests to validate Semeru/OpenJ9 GC policy flags when Semeru layer is active, absence of runtime recommendations when JVM metrics or version labels are missing, and behavior when JVM layer/runtime mismatch occurs. Added a test ensuring that datasources without runtime support still allow generateRecommendations to succeed but log a RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE message. Adjusted test expectations to use generic success status ranges for HTTP responses in the new logging-focused test.	`tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py` `tests/scripts/helpers/runtime_utils.py`

Possibly linked issues

#N/A: The PR implements the exact runtime recommendation test scenarios requested, including GC policies, metadata gaps, and datasource behavior.

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey - I've found 2 issues, and left some high level feedback:

The _generate_and_list_recommendations_for_tfb helper tears down the experiment and metric profile but leaves the metadata profile and created layers in place; consider deleting these as well to avoid cross-test interference and accumulated state in the test environment.
In test_non_runtime_supported_datasource_logs_message_on_generate, relying on a fixed time.sleep(2) before reading logs may make the test flaky under slow conditions; consider polling for the RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE log entry with a timeout instead of a hard sleep.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The `_generate_and_list_recommendations_for_tfb` helper tears down the experiment and metric profile but leaves the metadata profile and created layers in place; consider deleting these as well to avoid cross-test interference and accumulated state in the test environment.
- In `test_non_runtime_supported_datasource_logs_message_on_generate`, relying on a fixed `time.sleep(2)` before reading logs may make the test flaky under slow conditions; consider polling for the `RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE` log entry with a timeout instead of a hard sleep.

## Individual Comments

### Comment 1
<location path="tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py" line_range="68-74" />
<code_context>
+    (e.g., -Xgcpolicy:gencon, -Xgcpolicy:balanced, -Xgcpolicy:optthruput).
+    """
+    list_reco_json = _generate_and_list_recommendations_for_tfb(cluster_type)
+    env_values = _env_values(list_reco_json)

-    response = delete_metadata_profile(metadata_profile_name)
-    print("delete metadata profile = ", response.status_code)
+    if not _contains_any_pattern(env_values, SEMERU_GC_PATTERNS):
+        pytest.skip("Semeru/OpenJ9 GC policy not detected for current workload – skipping Semeru-specific assertion")

-    response = create_metadata_profile(metadata_profile_json_file)
-    data = response.json()
-    print(data["message"])
-    assert response.status_code == SUCCESS_STATUS_CODE
-    assert data["status"] == SUCCESS_STATUS
+    assert _contains_any_pattern(env_values, SEMERU_GC_PATTERNS), (
+        f"Expected Semeru GC policy flags {SEMERU_GC_PATTERNS} in JAVA_OPTIONS, got: {env_values}"
+    )
+
</code_context>
<issue_to_address>
**issue (testing):** The Semeru GC policy test contains a redundant assertion that can never fail once the skip condition has passed.

Because the test skips when `env_values` does not contain `SEMERU_GC_PATTERNS`, the final assertion on the same condition can never fail and adds no value. Consider either:

- Dropping the `pytest.skip` and relying on the assertion, or
- Keeping the skip but asserting a stronger condition (for example, that a specific expected flag or env var name is present).

This will ensure the Semeru-specific behavior is actually validated when the test runs.
</issue_to_address>

### Comment 2
<location path="tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py" line_range="191-200" />
<code_context>
+def test_non_runtime_supported_datasource_logs_message_on_generate(cluster_type):
</code_context>
<issue_to_address>
**suggestion (testing):** The non-runtime-supported datasource test only checks logs; it does not assert the absence of runtime recommendations in the API response.

In `test_non_runtime_supported_datasource_logs_message_on_generate`, you only assert that `RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE` is logged. To fully verify behavior, also call `list_recommendations(exp_name)` and assert that no GC-related runtime env entries are present (via `_env_values` + `_contains_any_pattern`, or a negative use of `validate_runtime_recommendations_if_present`). This will validate both logging and the recommendation payload for non-runtime-supported datasources.

Suggested implementation:

```python
@pytest.mark.runtimes
def test_non_runtime_supported_datasource_logs_message_on_generate(cluster_type):
    """
    Test Description:
    For datasources that exist but do NOT support runtime recommendations, generateRecommendations
    should still succeed but the server should log RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE and
    the recommendations payload should not contain any runtime GC-related entries.
    """
    input_json_file = "../json_files/create_tfb_exp.json"
    form_kruize_url(cluster_type)

    # Use bulk metadata profile that defines datasources with and without runtime support
    delete_and_create_metadata_profile()

    # Existing behavior: generate recommendations and assert that the
    # RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE marker is logged.
    # NOTE: This assumes the test already creates an experiment and triggers
    # generateRecommendations, capturing logs via caplog.

    # New behavior: also validate the recommendation payload for this non-runtime-supported datasource.
    #
    # We expect that there are no GC-related runtime recommendations for this experiment.
    # Use the same helper utilities as the other runtime tests to extract env values
    # and ensure they don't contain any GC-related patterns.
    recommendations = list_recommendations(exp_name)
    runtime_env_values = _env_values(recommendations)

    assert not _contains_any_pattern(
        runtime_env_values,
        HOTSPOT_GC_PATTERNS + SEMERU_GC_PATTERNS,
    ), (
        "Non-runtime-supported datasources should not have GC-related runtime recommendations "
        "in the listRecommendations payload."
    )

```

The edit above assumes a few things about the existing test and helpers:

1. **`exp_name` availability**  
   - Ensure that `exp_name` is defined earlier in `test_non_runtime_supported_datasource_logs_message_on_generate`, typically when creating the experiment or calling the generate API.  
   - If `exp_name` is not currently defined, either:
     - Parse it from `input_json_file` with an existing helper in this file, or
     - Capture it from the response of the experiment creation/generation helper already used in the other tests.

2. **`list_recommendations` signature**  
   - The code assumes `list_recommendations(exp_name)` exists and returns the recommendation payload for that experiment.  
   - If your helper has a different name or signature (e.g., takes namespace and experiment name, or returns a list of recs), adapt the call accordingly:
     - e.g. `recommendations = list_recommendations([exp_name])` or similar.

3. **`_env_values` helper**  
   - The snippet assumes that `_env_values(recommendations)` is a valid call that produces a flat list of runtime environment values to be checked against the GC patterns.  
   - If `_env_values` requires additional parameters (for example, layer names or a key path), adjust the call to mirror how it is used in the other runtime tests in this file.

4. **Location relative to the log assertion**  
   - If the existing test already has a block that:
     - Calls generateRecommendations
     - Asserts that `RUNTIMES_RECOMMENDATIONS_NOT_AVAILABLE` is in the logs via `caplog`
   - Make sure the newly added recommendation-payload assertions are placed **after** the generate call and log assertion block but before the end of the test, so that:
     - The experiment has recommendations available, and
     - The log assertion still runs as before.

If any of these assumptions differ from your codebase, keep the overall shape of the added block (call `list_recommendations(exp_name)`, derive env values via `_env_values`, then assert `not _contains_any_pattern(...)`) but adapt it to your actual helper names and data structures.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

Signed-off-by: Saad Khan <saakhan@ibm.com>

chandrams · 2026-03-12T14:36:32Z

@khansaad Have you run these tests? I see these failures - https://ci.app-svc-perf.corp.redhat.com/job/ExternalTeams/job/Autotune/job/kruize_functional_tests/447/

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani · 2026-04-02T08:17:01Z

@khansaad Can you please post the latest results ?

kusumachalasani · 2026-04-02T08:19:02Z

+	pushd benchmarks >/dev/null
+    echo "5. Installing TechEmpower (Quarkus REST EASY) benchmark into cluster"
+    pushd techempower >/dev/null
+    kubectl apply -f manifests/default_manifests -n ${APP_NAMESPACE}


For runtimes specific scripts, we want to use manifests from kruize-demos - which triggers the load by default without an external command which is as part of - kruize/kruize-demos#177

@khansaad Can you address the above comment

khansaad · 2026-04-06T05:26:24Z

@khansaad Can you please post the latest results ?
runtimes-test-results.zip

chandrams · 2026-04-07T05:33:12Z

@khansaad Why is semeru test skipped here? Can you submit it on jenkins as in the past we have seen failures on jenkins

chandrams · 2026-04-07T05:36:58Z

+	pushd benchmarks >/dev/null
+    echo "5. Installing TechEmpower (Quarkus REST EASY) benchmark into cluster"
+    pushd techempower >/dev/null
+    kubectl apply -f manifests/default_manifests -n ${APP_NAMESPACE}


@khansaad Can you address the above comment

chandrams · 2026-04-07T05:37:48Z

@@ -0,0 +1,368 @@
+"""
+Copyright (c) 2022, 2024 Red Hat, IBM Corporation and others.


Update the copyright year

chandrams · 2026-04-07T05:51:58Z

+@pytest.mark.runtimes
+def test_no_gc_recommendation_when_jvm_version_missing(cluster_type):
+    """
+    Test Description: If jvm_info metrics are present but the version label is not part


Update the test docs with these scenarios

Signed-off-by: Saad Khan <saakhan@ibm.com>

chandrams · 2026-04-08T06:07:02Z

@kusumachalasani Are you fine with the changes?

kusumachalasani · 2026-04-08T06:19:43Z

@kusumachalasani Are you fine with the changes?

I don't see the changes related to manifests using from "manifests/kruize-demos" while deploying benchmarks.

Signed-off-by: Saad Khan <saakhan@ibm.com>

khansaad · 2026-04-09T05:11:53Z

@kusumachalasani Are you fine with the changes?

I don't see the changes related to manifests using from "manifests/kruize-demos" while deploying benchmarks.

Added now

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani · 2026-04-10T05:12:44Z


+@pytest.mark.skip(reason="This will be enabled once the conditional check PR is merged(PR #1866)")
 @pytest.mark.runtimes
 def test_no_gc_recommendation_when_jvm_version_missing(cluster_type):


Please make this testcase as positive as the output you are getting as expected and is a default behaviour.

Signed-off-by: Saad Khan <saakhan@ibm.com>

khansaad · 2026-04-10T06:14:54Z

@chandrams @kusumachalasani Able to see it running both on minikube and openshift
https://ci.app-svc-perf.corp.redhat.com/job/ExternalTeams/job/Autotune/job/kruize_functional_tests/466
https://ci.app-svc-perf.corp.redhat.com/job/ExternalTeams/job/Autotune/job/kruize_functional_tests/465

chandrams · 2026-04-10T06:49:57Z

@khansaad I see unexpected test results, but the tests till seem to pass. Can you check & fix these.

test_runtime_recommendation - ENV doesn't have quarkus recommendations, but the test passes (this could be due to setup, but test has to fail when quarkus reco is not present)

test_no_recommendation_for_layer_runtime_mismatch - Heap & GC recommendations are seen, but the test passes

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani

lgtm

chandrams · 2026-04-10T10:19:22Z

@khansaad I see unexpected test results, but the tests till seem to pass. Can you check & fix these.

test_runtime_recommendation - ENV doesn't have quarkus recommendations, but the test passes (this could be due to setup, but test has to fail when quarkus reco is not present)

test_no_recommendation_for_layer_runtime_mismatch - Heap & GC recommendations are seen, but the test passes

Can you share the latest result links

Signed-off-by: Saad Khan <saakhan@ibm.com>

chandrams · 2026-04-15T07:23:52Z

@saad I have tested your changes in PR 1824 - https://ci.app-svc-perf.corp.redhat.com/job/ExternalTeams/job/Autotune/job/kruize_functional_tests/483/. I see 2 issues - Semeru test is skipped on minikube, runtime reco - doesn't have quarkus reco in env, are you enabling it in kube state metrics?

Signed-off-by: Saad Khan <saakhan@ibm.com>

chandrams · 2026-04-16T11:46:49Z

@khansaad Please create an issue to investigate the runtimes failures on minikube.

chandrams

LGTM

khansaad · 2026-04-17T14:48:15Z

investigate the runtimes failures on minikube

Created 1884 to track this

add more runtime tests

ab652f9

Signed-off-by: Saad Khan <saakhan@ibm.com>

khansaad added this to the Kruize 0.9.1 Release milestone Feb 27, 2026

khansaad self-assigned this Feb 27, 2026

khansaad added the test label Feb 27, 2026

khansaad added this to Monitoring Feb 27, 2026

khansaad added the runtimes_recommendation label Feb 27, 2026

khansaad moved this to In Progress in Monitoring Feb 27, 2026

sourcery-ai Bot reviewed Feb 27, 2026

View reviewed changes

Comment thread tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py Outdated

Comment thread tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py Outdated

khansaad added 2 commits March 4, 2026 12:26

address review comments

f2739e9

Signed-off-by: Saad Khan <saakhan@ibm.com>

fix test failures

fc83b68

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani requested a review from chandrams March 9, 2026 06:09

rbadagandi1 modified the milestones: Kruize 0.9.1 Release, Kruize 0.10.0 Release Mar 9, 2026

rbadagandi1 moved this from In Progress to Under Review in Monitoring Mar 9, 2026

chandrams requested a review from kusumachalasani March 17, 2026 07:10

add benchmark installation

0768c60

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani reviewed Apr 2, 2026

View reviewed changes

chandrams reviewed Apr 7, 2026

View reviewed changes

add test desc, update docs

e648fcc

Signed-off-by: Saad Khan <saakhan@ibm.com>

khansaad added 2 commits April 8, 2026 19:28

add petclinic benchmark experiment

ab813cc

Signed-off-by: Saad Khan <saakhan@ibm.com>

add manifest changes

0f94542

Signed-off-by: Saad Khan <saakhan@ibm.com>

skip negative scenario for now, will be enabled later

fc2580d

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani reviewed Apr 10, 2026

View reviewed changes

add a negative scenario, updated existing negative ones

551ed49

Signed-off-by: Saad Khan <saakhan@ibm.com>

chandrams reviewed Apr 10, 2026

View reviewed changes

Comment thread tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py Outdated

Comment thread tests/scripts/local_monitoring_tests/rest_apis/test_generate_recommendation.py Outdated

rename tests, fix missing quarkus recomm test failure

9317141

Signed-off-by: Saad Khan <saakhan@ibm.com>

kusumachalasani approved these changes Apr 10, 2026

View reviewed changes

khansaad added 2 commits April 13, 2026 14:33

fix issues with quarkus detection failure

0914126

Signed-off-by: Saad Khan <saakhan@ibm.com>

update layer mismatch test and other fixes

975c528

Signed-off-by: Saad Khan <saakhan@ibm.com>

khansaad requested review from chandrams and kusumachalasani April 15, 2026 04:27

rebase mvp-demo, change skip to fail for semuru

a4676a1

Signed-off-by: Saad Khan <saakhan@ibm.com>

chandrams approved these changes Apr 16, 2026

View reviewed changes

chandrams merged commit ca8bd63 into kruize:mvp_demo Apr 16, 2026
3 checks passed

github-project-automation Bot moved this from Under Review to Done in Monitoring Apr 16, 2026

		@@ -0,0 +1,368 @@
		"""
		Copyright (c) 2022, 2024 Red Hat, IBM Corporation and others.

Conversation

khansaad commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How has this been tested?

Checklist 🎯

Additional information

Summary by Sourcery

Uh oh!

sourcery-ai Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

File-Level Changes

Possibly linked issues

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chandrams commented Mar 12, 2026

Uh oh!

kusumachalasani commented Apr 2, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

khansaad commented Apr 6, 2026

Uh oh!

chandrams commented Apr 7, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chandrams commented Apr 8, 2026

Uh oh!

kusumachalasani commented Apr 8, 2026

Uh oh!

khansaad commented Apr 9, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

khansaad commented Apr 10, 2026

Uh oh!

chandrams commented Apr 10, 2026

Uh oh!

Uh oh!

Uh oh!

kusumachalasani left a comment

Choose a reason for hiding this comment

Uh oh!

chandrams commented Apr 10, 2026

Uh oh!

chandrams commented Apr 15, 2026

Uh oh!

chandrams commented Apr 16, 2026

Uh oh!

chandrams left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

khansaad commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

khansaad commented Feb 27, 2026 •

edited

Loading

sourcery-ai Bot commented Feb 27, 2026 •

edited

Loading