TaskVine at the LPC #4364
Replies: 2 comments 2 replies
-
|
Hello @cmoore24-24, we're glad to hear that your recent run at the LPC with TaskVine was successful, and we appreciate you sharing the detailed description of your experience along with these very helpful materials. Here are some observations after taking a look at the debug log file -
334 destination and 876 source workers were removed during the run:
Here is the file which records the succeeded puturl transfers: https://drive.google.com/file/d/14et5oNW6YY3RkIM1fPtN1G1JjQkPuojA/view?usp=drive_link That said, my hypothesis is that cross-node transfers are blocked on the LPC cluster, or that the node |
Beta Was this translation helpful? Give feedback.
-
|
We have learned that internal communications in the Fermi LPC also need to be limited to ports 10000-11000. @JinZhou5042 can you arrange to add a ‘—transfer-port’ option to the worker (and then the factory) to make this work? |
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
-
Hello! I'm opening a topic that is about getting something to run smoothly at the FNAL LPC.
I was able to successfully run a small-scale application at the LPC with taskvine, which read input files from Notre Dame via XRootD. Here's some information about that run:
I've attachd a tar file with the vine logs from the run. I've also includes the print out of the warning I mentioned during the meeting (below). I've also included a text file that has the output of the factory associated with the run. Perhaps notable is the text error that appears which I've never seen on Notre Dame, but it didn't stop anything from working. I've also included a screenshot of the manager from the catalog, which does show a fair few failed tasks.
successful_lpc_logs.tar.gz

Beta Was this translation helpful? Give feedback.
All reactions