bulk requests stuck on STARTED state #7744

elenamplanas · 2025-02-05T17:10:32Z

Runnig dCache version 9.2.25 and Enstore.

When a recall is failed due to a problem reading the tape, not due a missing file, checksum error, etc. the bulk request remain STARTED, with RUNNING state on the file, and without any "rh" request on any pool.

Example:

[dccore12] (local) admin > \sn pnfsidof /pnfs/pic.es/data/cms/store/data/Run2024G/ZeroBias/AOD/PromptReco-v1/000/384/202/00000/f58968b7-5890-4970-abcc-b5ace5d645e5.root
0000AFE37C9682A641AE99358012553B0CE8

$ echo "\s dc* rh ls"| ssh -p 22224 dccore.pic.es|grep 0000AFE37C9682A641AE99358012553B0CE8

In this example after running \bulk request reset the rh process doesn't appear, but when we faced the problem the first time, the new rh for the stuck file, was launched.

Don't hesitate on request any information you need.

Cheers,
Elena

DmitryLitvintsev · 2025-02-05T18:34:11Z

Hi Elena.

Make sure you set:

rc onerror fail
rc set max retries 3

(max retries 3 kind of means "smaller number", you do not want to have this number to be large)

DmitryLitvintsev · 2025-02-05T18:39:49Z

As for currenr request - I suggest to cancel it via bulk admin api.

elenamplanas · 2025-02-06T11:45:39Z

Hi Dmitry,
the parameters you suggested are related the recall processes on poolmanager, but the ones stuck have entered through bulk and have no entries on the poolmanager, they are managed directly by the bulk service, sending the requests to the pool, bypassing the poolmanager. Or maybe I'm wrong?

DmitryLitvintsev · 2025-02-06T16:09:55Z

Hi Dmitry, the parameters you suggested are related the recall processes on poolmanager, but the ones stuck have entered through bulk and have no entries on the poolmanager, they are managed directly by the bulk service, sending the requests to the pool, bypassing the poolmanager. Or maybe I'm wrong?

This is how it works:

bulk -> PinManager -> PoolManager -> pool

All staging requests in dCache are handled in PoolManager

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bulk requests stuck on STARTED state #7744

bulk requests stuck on STARTED state #7744

elenamplanas commented Feb 5, 2025

DmitryLitvintsev commented Feb 5, 2025

DmitryLitvintsev commented Feb 5, 2025

elenamplanas commented Feb 6, 2025

DmitryLitvintsev commented Feb 6, 2025

bulk requests stuck on STARTED state #7744

bulk requests stuck on STARTED state #7744

Comments

elenamplanas commented Feb 5, 2025

DmitryLitvintsev commented Feb 5, 2025

DmitryLitvintsev commented Feb 5, 2025

elenamplanas commented Feb 6, 2025

DmitryLitvintsev commented Feb 6, 2025