I have a system setup where we have some millions of customers and we run daily batches for each of them to calculate some daily time series for each account.
- Each Account has account number and
- We have a workflow for this task, which has min_account_number), max_account_number parameters.
- Thus I can start a workflow to go thru accounts 1..10000, 10001..20000 and so on.
- We can have multiple back end servers
Now my question is if we run e.g. 100 workflows at the same time:
1) FRONT END SEVERS: what parameters should I check in front end machines (is there a parameter, which limits how many workflows I can start at same time) ?
2) BACK END SERVERS: how to load performance for multiple back ends so that each of them is loaded equally (which parameters I should check there ) ?
3) Other considerations - for threading, locking etc ?
Cheers,
PEKKA