Hi Ashley
Our experiences in solving this problem are based on CA 11.1.3 and we haven't tested this on 11.1.5 yet.
There are two aspects at play here: 1) the tuning of the server according to the hardware and 2) Buffering of the reports you have scheduled to run at the same time.
Even when we tuned the server for normal concurrent users, running the scheduled reports could still cause this error. Check that you have tuned your system properly first, according to the guide:
https://www.ibm.com/support/knowledgecenter/en/SSEP7J_11.1.0/com.ibm.swg.ba.cognos.ug_cra.doc/c_setthemaximumnumberofprocessesandconnections.html#SettheMaximumNumberofProcessesandConnectionsWe were able to generate these errors by overloading the server with nested schedules, that would execute concurrently. The resolution was to change the shape of the load, that we are putting on the server at any single moment in time. We feed the reports gradually to be produced instead of running them all at once.
This error seems random because the load is not exactly the same every time. The error thus happens when it enters the overloaded state and try to recover from it. That is why the problem is not with the actual reports. It could be that you have to restart the CA service to recover after this error event, to ensure a reliable recovery.
I assume you are checking that there are no deadlocks on the content store database.
Also remember that the burst reports are by definition more server intensive reports than the normal single reports. We don't like to run burst reports concurrently, while the single reports are much easier to run concurrently. As a convention, we reserve around half the server resources for the burst activity and then load single reports according to the other half of the capacity.
Test what your system can realistically handle, without producing these errors and then stay below that threshold.
Hope it helps.
------------------------------
Jan Vermeer
------------------------------
Original Message:
Sent: Mon February 03, 2020 08:15 AM
From: Ashley Kent
Subject: Schedules failing DPR-ERR-2072 Unable to load balance a request with absolute affinity, since upgrade to 11.1.5
Hi community,
Since we upgraded one of our clients to 11.1.5, they have having an issue with schedules failing with the following error:
DPR-ERR-2072 Unable to load balance a request with absolute affinity, most likely due to a failure to connect to the remote dispatcher. See the remote dispatcher detailed logs for more information. Check the health status of the installed system by using the dispatcher diagnostics URIs. CAF-WRN-2082 An error has occurred. Please contact your administrator. The complete error has been logged by CAF with SecureErrorID:2020-01-30-06:00:25.135-#466
- As mentioned, they has only been since upgrading to 11.1.5 from 11.0.13, it has been working for several months.
- It is always seemingly happening at 6:00am, when a lot of schedules are being kicked off (80).
- We have attempted to tweak some of the settings to the batch service, such as batch report processes from 2 to 4; peak time to end at 8am; report queue timeout from 240 seconds to 280 seconds.
- Schedules are running fine before, between (in some cases), and after. The same reports that fail are running fine previous days also.
Does anybody have any suggestions? We are probably going to try a reinstall but wondered if there other tuning changes that could resolve the above.
Thanks,
Ashley
------------------------------
Ashley Kent
------------------------------
#CognosAnalyticswithWatson