-
Notifications
You must be signed in to change notification settings - Fork 73
Pull requests: scaleapi/llm-engine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: avoid global k8s Configuration singleton race in read_config_map
#764
opened Feb 23, 2026 by
lilyz-ai
Loading…
2 tasks
Adding SGP-related extensible configuration to the model engine helm chart
#762
opened Feb 20, 2026 by
arniechops
Loading…
feat: add ModelWeightsManager to auto-sync HF weights on endpoint creation
#761
opened Feb 20, 2026 by
lilyz-ai
Loading…
6 tasks done
perf improvements incl. client reuse, more efficient middleware
#759
opened Feb 15, 2026 by
olliestanley
•
Draft
Config: support IPv4 bind host for inference servers
#757
opened Feb 13, 2026 by
dustinrubin5050
Loading…
Add task_expires_seconds for async endpoint task expiration
#754
opened Feb 11, 2026 by
lukasewecker
Loading…
🚧 Add queue_message_timeout_duration parameter to endpoints (from older TASMU base commit)
#731
opened Dec 1, 2025 by
ValentineDragan
•
Draft
Add configurable queue_message_timeout_duration parameter in endpoint service configs
#709
opened Sep 16, 2025 by
ValentineDragan
•
Draft
changed host flag to be 0.0.0.0 to be compatible with both ipv4 and ipv6
#707
opened Sep 12, 2025 by
andytang-scale
Loading…
Enhance model engine with max concurrency support for sync endpoints
#701
opened Aug 24, 2025 by
saeidbarati-scale
Loading…
Update celery forwarder to use greenlets instead of processes
#689
opened Mar 1, 2025 by
dmchoiboi
Loading…
[GCP] Aagamdalal/sgp 3575 model engine update pubsub model engine code
#686
opened Feb 18, 2025 by
AaDalal
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.