feat: add Windows platform support by sepcnt · Pull Request #1418 · mmtk/mmtk-core

sepcnt · 2025-11-26T06:26:43Z

This pull request fix #961, adds initial Windows support to the project, focusing on enabling CI, memory management, thread affinity, and malloc functionality for Windows targets. The changes include updates to CI scripts and workflows to handle Windows-specific cases, platform-specific Rust code for memory and thread management, and a new implementation for malloc using Windows APIs.

Windows platform support

Added Windows-specific dependencies to Cargo.toml and implemented platform-specific code for thread affinity and CPU detection in src/scheduler/affinity.rs, using Windows APIs for processor count and thread affinity. [1] [2] [3] [4]
Implemented a Windows-specific malloc backend in src/util/malloc/library.rs, using HeapAlloc, HeapFree, and related APIs; updated allocation logic in src/util/malloc/malloc_ms_util.rs to support Windows. [1] [2] [3] [4] [5] [6]

CI and workflow updates

Modified .github/scripts/ci-common.sh and .github/workflows/minimal-tests-core.yml to add Windows jobs, set up bash shell defaults, and skip incompatible setup steps for Windows runners; also excluded malloc_jemalloc on Windows in feature initialization. [1] [2] [3] [4] [5] [6]

Memory management abstraction

Refactored src/util/memory.rs to abstract memory protection, mapping, and error handling for Windows, including platform-specific flag translation and error codes, and using Windows APIs for memory operations. [1] [2] [3] [4] [5] [6] [7] [8]

General codebase cleanup

Removed direct usage of libc::mprotect from src/policy/copyspace.rs, replacing it with platform-abstracted memory protection functions. [1] [2] [3]

These changes lay the foundation for building, testing, and running the project on Windows, with further improvements expected for full platform parity.

qinsoon

The implementation looks reasonable to me. The major issue is the implementation mmap_fixed (see details below).

qinsoon · 2025-11-27T00:50:21Z

@sepcnt Thanks for opening this PR. Just to better understand the context: do you plan to use MMTk on Windows for a specific project or runtime, or is this PR mainly to add general Windows support to mmtk-core?

Either way is fine -- I’d just like to understand your use case so we can make better long-term decisions around Windows support.

sepcnt · 2025-11-27T01:56:56Z

@sepcnt Thanks for opening this PR. Just to better understand the context: do you plan to use MMTk on Windows for a specific project or runtime, or is this PR mainly to add general Windows support to mmtk-core?

Either way is fine -- I’d just like to understand your use case so we can make better long-term decisions around Windows support.

While I am planning to build a runtime on the top of mmtk, this PR is mainly focus on making a 1:1 port.

`mmap_fixed`

qinsoon · 2025-11-27T06:00:38Z

The failed test for the Julia binding is unrelated with this PR.

wks · 2025-11-27T06:21:17Z

@sepcnt Thank you for your PR. While it is nice to support a popular platform, I think this change should be made after we refactor the mmtk-core code base and isolate OS-specific parts into dedicated modules, as suggested in #1420. This PR makes changes that are scattered over many modules, and often uses the pattern #[cfg(target_os = "windows")] and then #[cfg(not(target_os = "windows"))]. This makes the code hard to maintain.

Given that supporting Windows is not a priority of ours, it will be even better if it is possible to maintain Windows support outside our organization, as I mentioned in #1420 (comment). A refactoring is required to make this possible.

qinsoon · 2025-11-27T06:46:40Z

@sepcnt Thank you for your PR. While it is nice to support a popular platform, I think this change should be made after we refactor the mmtk-core code base and isolate OS-specific parts into dedicated modules, as suggested in #1420.

I feel it is a good idea to merge this PR before the refactoring. Supporting for an OS that is significantly different from Unix-like systems could actually help guide the refactoring process for an OS interface for us.

wks · 2025-11-27T07:07:16Z

@sepcnt Thank you for your PR. While it is nice to support a popular platform, I think this change should be made after we refactor the mmtk-core code base and isolate OS-specific parts into dedicated modules, as suggested in #1420.

I feel it is a good idea to merge this PR before the refactoring. Supporting for an OS that is significantly different from Unix-like systems could actually help guide the refactoring process for an OS interface for us.

A porting effort like this can definitely guide our progress of porting to another OS. But given that none of our core members can test this PR locally, and this PR is making a massive amount of changes to the code base using AI-assisted tools, I consider it a risk to merge it into the master branch directly. We may keep this pull request, and do our refactoring with this PR as a reference.

sepcnt · 2025-11-27T07:39:57Z

@sepcnt Thank you for your PR. While it is nice to support a popular platform, I think this change should be made after we refactor the mmtk-core code base and isolate OS-specific parts into dedicated modules, as suggested in #1420.

I feel it is a good idea to merge this PR before the refactoring. Supporting for an OS that is significantly different from Unix-like systems could actually help guide the refactoring process for an OS interface for us.

A porting effort like this can definitely guide our progress of porting to another OS. But given that none of our core members can test this PR locally, and this PR is making a massive amount of changes to the code base using AI-assisted tools, I consider it a risk to merge it into the master branch directly. We may keep this pull request, and do our refactoring with this PR as a reference.

Most of the changes are gated with cfg!. While this PR targets at Windows, I also tested on Ubuntu with below configurations:

sudo dpkg --add-architecture i386
sudo apt-get update && sudo apt-get install -y mingw-w64 wine64 wine32 xvfb
rustup target add x86_64-pc-windows-gnu

And in .cargo/config.toml:

[target.x86_64-pc-windows-gnu]
runner = "xvfb-run -a wine"
linker = "x86_64-w64-mingw32-gcc"

qinsoon · 2025-11-27T07:43:01Z

given that none of our core members can test this PR locally

Honestly, that does not seem to be a prerequisite for a PR to be merged. To me, for a PR to be merged, it needs to:

pass all the tests/checks, and better have the changes tested.
pass code review.
align with the project's goal and design.

this PR is making a massive amount of changes to the code base using AI-assisted tools, I consider it a risk to merge it into the master branch directly.

We have code reviews and CI. Those are intended to mitigate the risk of merging to master. You can do code review and list the risky parts or code that may cause issues, and expect them to be addressed properly before merging.

As for AI/LLM coding, there is a lot of discussion recently. I don't want to go deep on this. To my own experience, human can write bad code, and AI (at this stage) under human supervision can write good code. So my general attitude is neutral -- I judge a PR by its content, rather than who wrote it (AI or human).

wks · 2025-11-27T08:14:04Z

Honestly, that does not seem to be a prerequisite for a PR to be merged. To me, for a PR to be merged, it needs to:
1. pass all the tests/checks, and better have the changes tested.

2. pass code review.

3. align with the project's goal and design.

If it's not yet a prerequisite, it should be.

Code reviewing and testing are ways to know if this version works. But if we accept this pull request, not only does it mean that we are accepting this change, but it also means we will have the responsibility to maintain it. It means for every change we make, we need to ensure we don't break Windows. And if some changes accidentally breaks Windows, we need to debug it. Given the result of this PR, i.e. #[cfg] scattered all over the place, it will be a nightmare for maintenance. So unless someone volunteers to do the architectural refactoring immediately, this is not going to be friendly to developers. However, all of our core members are currently quite occupied.

We have code reviews and CI. Those are intended to mitigate the risk of merging to master. You can do code review and list the risky parts or code that may cause issues, and expect them to be addressed properly before merging.

As for AI/LLM coding, there is a lot of discussion recently. I don't want to go deep on this. To my own experience, human can write bad code, and AI (at this stage) under human supervision can write good code. So my general attitude is neutral -- I judge a PR by its content, rather than who wrote it (AI or human).

There are also legal concerns about AI-generated code. AI-trained coding agents may be trained from and/or directly produce snippets that are incompatible with our license (Apache2 + MIT). Particularly, we cannot accept snippets that are GPL-licensed.

qinsoon · 2025-12-05T02:45:50Z

@sepcnt We’ve decided to implement #1420 before merging this PR. I’m planning to work on it in January, and once that is in place we can rebase this PR on top of the new interface. You’re very welcome to contribute to #1420 if you’re interested or motivated to help with that work.

Again thanks for this PR.

This PR addresses #1420. This PR is based on top of #1418, and includes all the changes for Windows (for testing purpose). It is likely that Windows support will be removed from this PR, and will be merged separately. This PR does not try to refactor our malloc interface -- I am not sure if malloc should be included in the OS interface or not. This PR consolidates the current multiple mmap functions (such as dzmmap, mmap_fixed, mmap_noreplace, mmap_noreserve, etc), and use `MmapStrategy` to specify the expected mmap behavior. --------- Co-authored-by: sepcnt <30561671+sepcnt@users.noreply.github.com>

This reverts commit 4ab4f52.

This reverts commit 474d60a.

qinsoon · 2026-02-18T01:00:06Z

+            while addr < end {
+                let mut mbi: MEMORY_BASIC_INFORMATION = std::mem::zeroed();
+                let q = VirtualQuery(
+                    addr as *const _,
+                    &mut mbi,
+                    std::mem::size_of::<MEMORY_BASIC_INFORMATION>(),
+                );
+                if q == 0 {
+                    return Err(io::Error::last_os_error());
+                }
+
+                let region_base = mbi.BaseAddress as *mut u8;
+                let region_size = mbi.RegionSize;
+                let region_end = region_base.add(region_size);
+
+                // Calculate the intersection of [addr, end) and [region_base, region_end)
+                let _sub_begin = if addr > region_base {
+                    addr
+                } else {
+                    region_base
+                };
+                let _sub_end = if end < region_end { end } else { region_end };
+
+                match mbi.State {
+                    MEM_FREE => saw_free = true,
+                    MEM_RESERVE => saw_reserved = true,
+                    MEM_COMMIT => saw_committed = true,
+                    _ => {
+                        return Err(io::Error::other("Unexpected memory state in mmap_fixed"));
+                    }
+                }
+
+                // Jump to the next region (VirtualQuery always returns "continuous regions with the same attributes")
+                addr = region_end;
+            }
+
+            // 1. All FREE: make a new mapping in the region
+            // 2. All RESERVE/COMMIT: treat as an existing mapping, can just COMMIT or succeed directly
+            // 3. MIX of FREE + others: not allowed (semantically similar to MAP_FIXED_NOREPLACE)
+            if saw_free && (saw_reserved || saw_committed) {
+                return Err(io::Error::from_raw_os_error(
+                    windows_sys::Win32::Foundation::ERROR_INVALID_ADDRESS as i32,
+                ));
+            }
+
+            if saw_free && !saw_reserved && !saw_committed {
+                // All FREE: make a new mapping in the region
+                let mut allocation_type = MEM_RESERVE;
+                if commit {
+                    allocation_type |= MEM_COMMIT;
+                }
+
+                let res = VirtualAlloc(
+                    ptr as *mut _,
+                    size,
+                    allocation_type,
+                    strategy.prot.get_native_flags(),
+                );
+                if res.is_null() {
+                    return Err(io::Error::last_os_error());
+                }
+
+                Ok(start)
+            } else {


@wks commented in #1439:

Despite all the VirtualQuery checks we do, the subsequent VirtualAlloc may still fail in multi-threaded programs due to TOCTOU. And it takes time if the given region contains multiple mmap entries. But it is useful for sanity check.

I suggest we guard all the checks with a feature or debug_assertion so that we don't do them in production. (It's even better if we can extract all the sanity check parts to a separate function so that we can reuse it.) In production, we only call VirtualAlloc. And we can port the sanity check to Unix-like systems by parsing /proc/self/maps (or using a third-party crate for parsing it), but that still has the TICTOU problem and can only serve debug purposes.

Ideally, we should reserve the region of memory for metadata and the heap so that we won't need such checks.

qinsoon · 2026-02-18T01:01:18Z

+            // If decommit fails, we try to release the memory. This might happen if the memory was
+            // only reserved.
+            let res_release = unsafe { VirtualFree(start.to_mut_ptr(), 0, MEM_RELEASE) };
+            if res_release == 0 {
+                Err(std::io::Error::last_os_error())
+            } else {
+                Ok(())
+            }


@wks commented in #1439 (comment):

We shouldn't use MEM_RELEASE.

MEM_RELEASE is supposed to be paired with VirualAlloc and be used as if they were malloc-free pairs. The doc says "If you specify this value (MEM_RELEASE), dwSize must be 0 (zero), and lpAddress must point to the base address returned by the VirtualAlloc function when the region is reserved. The function fails if either of these conditions is not met." Obviously this is completely different from how we use munmap.

It will remove the reserved state. We don't do it, just like we don't "un-quarantine" memory in Unix-like systems.

Instead if MEM_DECOMMIT fails, we should return failure.

wks · 2026-02-24T07:44:36Z

One high-level comment is that line-end handling should be done automatically.

If the current CI scripts result in errors related to line ends on Windows, it is likely that the CI scripts are using command line tools (bash, sed, etc.) from different sources (including CygWin, MSYS, Git's command line tools, etc.) that follow different conventions.

We either

let Git check out text files and automatically convert LF to CR LF (which is controlled by the core.autocrlf Git configuration and it should be true on Windows by default), and use command line tools that assume CR LF line ending, or
let Git check out text files unchanged on Windows, and use command line tools that use Unix convention (LF).

In either of the two cases, there is no need to manually convert line ends (strip_cr) in scripts. Instead, we should let GitHub Actions select a set of tools that consistently use one of the two conventions above.

qinsoon · 2026-02-24T09:39:45Z

At this point, we try to reuse the existing CI scripts for Windows. In those scripts, we do text parsing on source code files, so the scripts need to handle Windows' CR LF. Imagine if we used a different set of CI scripts for Windows -- those scripts would have to deal with CR LF as well, as they would be running on Windows, and dealing with Windows files.

wks · 2026-02-24T10:43:47Z

... In those scripts, we do text parsing on source code files, so the scripts need to handle Windows' CR LF. ...

I disagree. Scripts can be written in a style that is agnostic of line ending styles. The scripts should see lines. That is, a file consists of multiple lines. Every time it reads a line from a text file, it gets all the text until the LF or CR LF. read is one such command.

If the source code is already converted to Windows style (CR LF) by Git, then we should use a version of bash shell so that its built-in read command assumes lines end with CR LF and automatically strips both CR and LF.
If the source code is checked out as is (LF), then we should use a version of bash where read strips LF.

From the code

strip_cr() {
  local s=$1
  printf '%s' "${s%$'\r'}"
}

we infer that the input $1 contains a CR but not LF. This can only happen if (1) Git did the conversion so that the file on disk has CR LF, and (2) the bash it is using assumes lines end with LF, so the read command stripped LF, but not CR. In other words, the git program is inconsistent with bash.

The documentation of the windows-latest image (https://github.com/actions/runner-images/blob/main/images/windows/Windows2025-Readme.md) mentions that there are several Bash shells installed, namely gitbash.exe, msys2bash.cmd and wslbash.exe.

On https://docs.github.com/en/actions/reference/workflows-and-actions/workflow-syntax, it mentioned that

bash: The default shell on non-Windows platforms with a fallback to sh. When specifying a bash shell on Windows, the bash shell included with Git for Windows is used.

The git-bash is provided by https://gitforwindows.org/ But there is no documentation about its Bash port. It is quite a surprise that Git and Git-Bash have different assumptions about line ends with the default configuration.

Maybe we can find a way to let Git-Bash use CR LF line ending in the read command. If that's impossible, the easiest solution is setting core.autocrlf to false in Git.

qinsoon · 2026-02-25T00:15:48Z

Maybe we can find a way to let Git-Bash use CR LF line ending in the read command. If that's impossible, the easiest solution is setting core.autocrlf to false in Git.

We can do something like this: https://github.com/rust-lang/rust/blob/859951e3c7c9d0322c39bad49221937455bdffcd/.gitattributes#L3

feat: add Windows platform support

7895c88

qinsoon reviewed Nov 27, 2025

View reviewed changes

Comment thread .github/workflows/minimal-tests-core.yml Outdated

Comment thread src/util/malloc/malloc_ms_util.rs Outdated

Comment thread src/util/memory.rs Outdated

Comment thread src/util/memory.rs Outdated

Comment thread src/util/memory.rs Outdated

qinsoon mentioned this pull request Nov 27, 2025

Interface for OS- or CPU-specific implementations #1420

Open

2 tasks

fix: use constants, CI placeholder, detailed comment and better

b11ef21

`mmap_fixed`

sepcnt force-pushed the feat-windows-support branch from 8226c63 to b11ef21 Compare November 27, 2025 03:06

qinsoon added the PR-extended-testing Run extended tests for the pull request label Nov 27, 2025

Merge branch 'master' into feat-windows-support

1ad2f65

wks mentioned this pull request Dec 5, 2025

Platform tiers #1430

Closed

qinsoon mentioned this pull request Jan 7, 2026

OS interface #1439

Merged

qinsoon added 3 commits February 17, 2026 02:40

Merge branch 'master' into feat-windows-support

c621b8f

Revert "Revert windows-related changes."

22d9118

This reverts commit 4ab4f52.

Revert "Further remove some changes due to windows support"

a182a9f

This reverts commit 474d60a.

qinsoon removed the PR-extended-testing Run extended tests for the pull request label Feb 17, 2026

qinsoon added 2 commits February 18, 2026 13:23

Fix windows impl

3b43963

Cleaner interface for malloc and the lack of posix_memalign on windows

dd4cf66

qinsoon reviewed Feb 18, 2026

View reviewed changes

qinsoon requested a review from wks February 18, 2026 02:20

wks mentioned this pull request Feb 27, 2026

Add CI script to check line ends #1452

Merged

Conversation

sepcnt commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qinsoon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinsoon commented Nov 27, 2025

Uh oh!

sepcnt commented Nov 27, 2025

Uh oh!

qinsoon commented Nov 27, 2025

Uh oh!

wks commented Nov 27, 2025

Uh oh!

qinsoon commented Nov 27, 2025

Uh oh!

wks commented Nov 27, 2025

Uh oh!

sepcnt commented Nov 27, 2025

Uh oh!

qinsoon commented Nov 27, 2025

Uh oh!

wks commented Nov 27, 2025

Uh oh!

qinsoon commented Dec 5, 2025

Uh oh!

qinsoon Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

qinsoon Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

wks commented Feb 24, 2026

Uh oh!

qinsoon commented Feb 24, 2026

Uh oh!

wks commented Feb 24, 2026

Uh oh!

qinsoon commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sepcnt commented Nov 26, 2025 •

edited

Loading