Skip to content

chore: bundle v4 catch-up — config redesign + EventManager + StorageClient + ProxyConfiguration#597

Open
B4nan wants to merge 4 commits into
v4from
chore/v4-catchup-bundle
Open

chore: bundle v4 catch-up — config redesign + EventManager + StorageClient + ProxyConfiguration#597
B4nan wants to merge 4 commits into
v4from
chore/v4-catchup-bundle

Conversation

@B4nan
Copy link
Copy Markdown
Member

@B4nan B4nan commented Apr 30, 2026

What this is

A non-mergeable demo branch that bundles all four v4 catch-up PRs so reviewers can see green CI on the combined state.

Bundled PR Concern
#583 Configuration redesign integration
#594 PlatformEventManager constructor adapt
#595 StorageClient adapter + KeyValueStore.getPublicUrl async URL signing
#596 ProxyConfiguration v4 API

Pinned at crawlee@^4.0.0-beta.51.

Recommended merge order: #583#594#595#596

The four focused PRs are rebased into a linear stack:

v4
 └─ #583 (config-redesign)
     └─ #594 (event-manager)
         └─ #595 (storage)
             └─ #596 (proxy)

Each downstream branch already contains its predecessors as ancestors, so the merge order matters mostly for review/CI clarity:

  • Stack order (recommended): each PR's GitHub diff shows only its own commits, CI runs are scoped, reviews stay narrow.
  • Any other order also merges cleanly (the rebased stack guarantees zero conflicts), but downstream PRs merged first will pull their predecessors along — e.g. merging fix: adapt SDK ProxyConfiguration to crawlee v4 API #596 first would land all four sets of commits in v4 at once, and the still-open upstream PRs would become trivial no-op merges with inflated-looking diffs.

Verified: a sequential merge of #583#594#595#596 into origin/v4 produces zero conflicts at each step. Locally the resulting state passes 75/75 active tests on Node 22 and Node 24. The tree is functionally equivalent to this bundle (sole diff is a stale ~69-line node_modules/@crawlee/linkedom/node_modules/cheerio block left in this bundle's lockfile from the cheerio-workaround era — pure regen artifact, not real divergence).

Do not merge this PR

Merge the four focused PRs above instead. This branch will be deleted once they land.

@B4nan B4nan force-pushed the chore/v4-catchup-bundle branch 7 times, most recently from 856d554 to 5b6598a Compare April 30, 2026 17:51
B4nan added a commit to apify/crawlee that referenced this pull request Apr 30, 2026
## Summary

`packages/linkedom-crawler/src/internals/linkedom-crawler.ts` imports
`cheerio` (`import * as cheerio from 'cheerio'`) but
`@crawlee/linkedom`'s `package.json` doesn't list it as a dependency.

It works inside the monorepo because cheerio is hoisted to the workspace
root via other packages (`@crawlee/cheerio`, `@crawlee/utils`,
`@crawlee/http`, …), so Node always finds it. **Downstream installs that
depend only on `crawlee`** (which re-exports `@crawlee/linkedom`) **and
don't pull any cheerio-using sibling** fail at runtime:

```
Error: Cannot find package 'cheerio' imported from .../node_modules/@crawlee/linkedom/internals/linkedom-crawler.js
```

This bit the apify-sdk-js v4 catch-up PRs (apify/apify-sdk-js#597) on a
clean CI install — without this fix, every consumer has to ship a
`cheerio` dev-dep workaround.

The fix is one-line: declare `cheerio: "^1.0.0"` (matching what
`@crawlee/cheerio` already pins).
@B4nan B4nan force-pushed the chore/v4-catchup-bundle branch 3 times, most recently from d649a33 to f0ced4f Compare April 30, 2026 18:56
@B4nan B4nan force-pushed the chore/v4-catchup-bundle branch 9 times, most recently from 24c1250 to 5908f2a Compare May 14, 2026 12:58
B4nan added 4 commits May 14, 2026 15:26
… redesign

Squashes the full content of #583 into a single commit
so the bundle PR shows a clean four-commit summary of the v4 catch-up
stack. See PR #583 for the per-commit history.
…anager API

Squashes the full content of #594 into a single commit
so the bundle PR shows a clean four-commit summary of the v4 catch-up
stack. See PR #594 for the per-commit history.
…terface

Squashes the full content of #595 into a single commit
so the bundle PR shows a clean four-commit summary of the v4 catch-up
stack. See PR #595 for the per-commit history.
Squashes the full content of #596 into a single commit
so the bundle PR shows a clean four-commit summary of the v4 catch-up
stack. See PR #596 for the per-commit history.
@B4nan B4nan force-pushed the chore/v4-catchup-bundle branch from 5908f2a to b6057ef Compare May 14, 2026 13:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants