chore: Enhance logging, validation handling, and HR workflow tests by Harmanpreet-Microsoft · Pull Request #873 · microsoft/Multi-Agent-Custom-Automation-Engine-Solution-Accelerator

Harmanpreet-Microsoft · 2026-03-24T13:05:04Z

Purpose

This pull request significantly enhances the robustness and reliability of the E2E test suite for the HomePage by improving error handling, validation logic, and reporting. The changes include smarter handling of clarification steps, more resilient validation of AI-driven responses, comprehensive error detection, and improved HTML reporting with screenshot support.

Key improvements include:

Test robustness and error handling

Updated task plan approval methods (approve_retail_task_plan, approve_rfp_task_plan, approve_contract_compliance_task_plan) to treat clarification input as a non-error condition (expected for some workflows), returning a boolean instead of raising exceptions. This prevents unnecessary test failures for valid scenarios. [1] [2] [3]
Enhanced error detection in validate_rai_error_message to check for a wider range of error messages and fallback states, including silent rejections, and to capture screenshots for easier debugging.

AI response validation and synchronization

All major response validation methods (retail, product marketing, HR, RFP, contract compliance) now explicitly wait for the "AI Thinking Process" indicator to disappear before proceeding, reducing flakiness due to asynchronous UI updates. [1] [2] [3] [4] [5]
Response validation now includes checks for expected content patterns (e.g., headings and key phrases) to ensure the AI-generated responses are not only present but also semantically correct. [1] [2] [3]

Reporting and test infrastructure

Enabled HTML reporting for test runs by adding the --html=report.html option in pytest.ini.
Added support for capturing screenshots for all tests by default, with configurable behavior, and introduced a utility for generating clean screenshot filenames.

Minor improvements

Added a locator for the "AI Thinking Process" element to facilitate synchronization in tests.

These changes collectively make the test suite more stable, easier to debug, and provide richer feedback for failures.

...

Does this introduce a breaking change?

Yes
No

How to Test

Get the code

git clone [repo-address]
cd [repo-name]
git checkout [branch-name]
npm install

Test the code

What to Check

Verify that the following are valid

...

Other Information

test: dev-v4 to main PR

Added note for azd version 1.23.9 regarding preflight configuration.

docs: Add note for azd version 1.23.9 deployment

fix: upgraded the agent-framework to latest version

Updated logging for clarification input checks and added AI Thinking Process handling in various validation methods.

Added a test for the HR workflow to validate the Human Resources process steps. Updated function calls to use the correct naming convention for input methods.

…ame sanitization

tests/e2e-test/pages/HomePage.py

Copilot

Pull request overview

This PR strengthens the Playwright-based E2E test harness under tests/e2e-test/ by adding HTML reporting + screenshot capture, improving UI synchronization around AI response completion, and adding a focused HR workflow smoke test to reduce debugging friction.

Changes:

Added an HR-only E2E smoke test and updated RAI prompt sender method usage in the existing smoke suite.
Enhanced BIABPage response validation to wait for an “AI Thinking Process” indicator and broadened RAI error-state detection.
Enabled pytest-html reporting and added screenshot capture/attachment behavior via conftest.py.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 7 comments.

File	Description
tests/e2e-test/tests/test_MACAE_Smoke_test.py	Adds an HR-only workflow test and updates calls to the renamed RAI prompt send method.
tests/e2e-test/tests/conftest.py	Adds screenshot capture/attachment to pytest-html reports and filename sanitization utilities.
tests/e2e-test/pytest.ini	Enables HTML reporting via `--html=report.html`.
tests/e2e-test/pages/HomePage.py	Adds AI “thinking” synchronization, expands response content checks, changes task-plan approval clarification handling, and updates RAI error validation behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/e2e-test/tests/conftest.py

tests/e2e-test/pages/HomePage.py

…racy

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/e2e-test/pages/HomePage.py

tests/e2e-test/tests/conftest.py

tests/e2e-test/pages/HomePage.py

…treams usage in conftest.py

…d handling of ambiguous states

…d raise exceptions for required clarifications

Roopan-Microsoft and others added 7 commits March 16, 2026 19:23

Merge pull request #846 from microsoft/dev-v4

c9558a4

test: dev-v4 to main PR

docs: Add note for azd version 1.23.9 deployment

aa18b75

Added note for azd version 1.23.9 regarding preflight configuration.

Merge pull request #859 from microsoft/azd-note

946a475

docs: Add note for azd version 1.23.9 deployment

Merge pull request #860 from microsoft/dev-v4

bb5a11f

fix: upgraded the agent-framework to latest version

Enhance logging and AI process handling in validations

722c280

Updated logging for clarification input checks and added AI Thinking Process handling in various validation methods.

Add HR workflow test and update input method names

7086eb3

Added a test for the HR workflow to validate the Human Resources process steps. Updated function calls to use the correct naming convention for input methods.

Enhance screenshot handling in pytest configuration and improve filen…

7b086ef

…ame sanitization

Harmanpreet-Microsoft requested review from Avijit-Microsoft, Fr4nc3, Prajwal-Microsoft, Roopan-Microsoft, Vinay-Microsoft, aniaroramsft, dgp10801, marktayl1, nchandhi and toherman-msft as code owners March 24, 2026 13:05

github-code-quality bot found potential problems Mar 24, 2026

View reviewed changes

Harmanpreet-Microsoft requested a review from Copilot March 24, 2026 13:07

Copilot started reviewing on behalf of Harmanpreet-Microsoft March 24, 2026 13:08 View session

Harmanpreet-Microsoft temporarily deployed to production March 24, 2026 13:09 — with GitHub Actions Inactive

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Refactor screenshot path handling in pytest reports for improved accu…

fe406b4

…racy

Harmanpreet-Microsoft requested a review from Copilot March 24, 2026 15:09

Copilot started reviewing on behalf of Harmanpreet-Microsoft March 24, 2026 15:09 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Harmanpreet-Microsoft added 3 commits March 24, 2026 20:48

Refactor clarification handling and logging in BIABPage; update log_s…

78d2196

…treams usage in conftest.py

Refactor RAI error message validation in BIABPage; improve logging an…

62c4d1a

…d handling of ambiguous states

Refactor clarification handling in BIABPage; improve error logging an…

fc4bf5e

…d raise exceptions for required clarifications

Harmanpreet-Microsoft temporarily deployed to production March 24, 2026 17:42 — with GitHub Actions Inactive

Harmanpreet-Microsoft temporarily deployed to production March 24, 2026 17:44 — with GitHub Actions Inactive

Harmanpreet-Microsoft temporarily deployed to production March 24, 2026 17:54 — with GitHub Actions Inactive

Harmanpreet-Microsoft temporarily deployed to production March 24, 2026 18:40 — with GitHub Actions Inactive

Harmanpreet-Microsoft changed the base branch from main to dev-v4 March 25, 2026 05:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Enhance logging, validation handling, and HR workflow tests#873

chore: Enhance logging, validation handling, and HR workflow tests#873
Harmanpreet-Microsoft wants to merge 11 commits intodev-v4from
psl-macaee2e

Harmanpreet-Microsoft commented Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Harmanpreet-Microsoft commented Mar 24, 2026

Purpose

Test robustness and error handling

AI response validation and synchronization

Reporting and test infrastructure

Minor improvements

Does this introduce a breaking change?

How to Test

What to Check

Other Information

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants