SOLE-R1

SOLE-R1 is a VLM-based dense reward model for robotic manipulation. Given a natural-language task description and a robot video, it predicts per-timestep task-progress percentages that can be used as reward signals for reinforcement learning.

Tested with Python 3.10 on Ubuntu 22.04.

Repository layout

reward_server/   VLM inference server (SOLE-R1, GPT, or Gemini)
reward_client/   CLI client that queries the server and annotates videos
videos/          Sample robot videos

Reward Server

See reward_server/README.md for full setup, usage, and API documentation.

Reward Client

See reward_client/README.md for full setup and usage documentation.

Contribution and Maintenance

This repository is released as-is to accompany a paper submission.

If you find any bugs, corrections, or issues that should be resolved for anyone looking to reproduce the results in this repository, please file an issue and we will look at it as soon as we can.

For other improvements, including new features, we recommend creating your own fork of the repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SOLE-R1

Repository layout

Reward Server

Reward Client

Contribution and Maintenance

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
reward_client		reward_client
reward_server		reward_server
videos		videos
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

SOLE-R1

Repository layout

Reward Server

Reward Client

Contribution and Maintenance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages