Parallel Processing, Queueing and Scheduling

submersible_toaster has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
•Re: Parallel Processing, Queueing and Scheduling by merlyn (Sage) on Mar 12, 2003 at 00:21 UTC
POE could manage the multiple tasks nicely, and you could put a Tk, or Curses, or Web interface on it rather straightforwardly. -- Randal L. Schwartz, Perl hacker Be sure to read my standard disclaimer if this is a reply.	[reply]
Re: Parallel Processing, Queueing and Scheduling by perrin (Chancellor) on Mar 11, 2003 at 23:44 UTC
Parallel::MPI::Simple, Parallel::MPI, Parallel::Pvm, and Spread::Queue might be worth a look.	[reply]
Re: Parallel Processing, Queueing and Scheduling by kschwab (Vicar) on Mar 12, 2003 at 03:12 UTC
Not really perl related, but we've had great success with openpbs, aka "Open Portable Batch Scheduler". open source provides a tcl ( I know, I know ) interface where you can even write your own scheduler stdout/stderr watching, as you mentioned an open api for reporting status back dependency chain ( only run job b if job a succeeds, etc) I am only using it for running sql batch jobs, as I needed the job dependency chains. Would have gotten by with cron otherwise. So, since I'm not using it like you would, I'm not sure if it covers your other needs. I will note that it expects to run scripts, and not binaries ( it reads the script into stdin for later feeding to whatever is on the shebang line ). You would have to create wrapper scripts for your render jobs... Here's the blurb from their website: The Portable Batch System (PBS) is a flexible batch queueing and workload management system originally developed for NASA. It operates on networked, multi-platform UNIX environments, including heterogeneous clusters of workstations, supercomputers, and massively parallel systems. Development of PBS is provided by Altair Grid Technologies.	[reply]
(jeffa) Re: Parallel Processing, Queueing and Scheduling by jeffa (Bishop) on Mar 12, 2003 at 00:17 UTC
So what is the question? :) Are you wanting to perform real parallel processing in Perl? If that is the case, then may i recommend C instead? You could either opt for shared memory (pthreads) or message passing (MPI). Using Perl 'threads' to provide the end user with the illusion of multi-processing is one thing, but Real™ parallel processing is best done with a tool like C, not Perl (and no, i don't even think Java cuts the mustard either ;)). UPDATE: ahhh, after seeing merlyn's recommendation of POE i changed my mind. Definitely give POE a try for this problem. jeffa L-LL-L--L-LL-L--L-LL-L-- -R--R-RR-R--R-RR-R--R-RR B--B--B--B--B--B--B--B-- H---H---H---H---H---H--- (the triplet paradiddle with high-hat)	[reply]
Re: Parallel Processing, Queueing and Scheduling by zengargoyle (Deacon) on Mar 12, 2003 at 00:44 UTC
maybe Overkill, and no MS-Win support (but it might work with cygwin type environment) is PBS. it's widely used in I2/Globus/Grid projects. you would still need to do your template part, but PBS can schedule/manage the jobs across available nodes. i don't use it myself, but we have a (ew, 600ish nodes, 2-4 CPUs/node) cluster that's booked for the next year or so managed by the PBS stuff. IRIX and Linux are on the supported list (with other *NIX flavors as well).	[reply]
Re: Parallel Processing, Queueing and Scheduling by submersible_toaster (Chaplain) on Mar 12, 2003 at 00:25 UTC
Update:Dammit , I have think I've screwed up my terminology again and confused the issue. I will try to describe it better. A number of machines exist, running a client that offers that machine's CPU as available to use. On seeing an available client, the server determines which task in the queue has first crack at the CPU resource, and sends the client a segment of that task. The server is listening constantly to other clients, regarding their progress. Users have a seperate client tool to drop tasks on the queue, and monitor their progress More to the point , tasks are being processed by plain old executables that know nothing nor need to know - ie host A does not care that while it processes parts 1-10, Host B is processing parts 11-20 of the same task. I apologise again for the confusion, the -- storm has already begun :( Update:Firstly I would just like to say thankyou to everyone for your ideas and I will of course be following up and researching many of these suggestions. If you have not been ++'d by me in this thread yet, when the vote-fairy returns it will be so. Secondly, I will be taking a clue-by-four to my silly self, as my requirements are going to have to change. Supporting all those platforms is going to create more work to achieve than it will leverage in processing grunt. Powers that be are already making linux noises RE 3d Animation applications, so supporting win32 AND n(u\|i)x flavors is biting off more than I can chew. Did I point out that PHB are no prepared to buy any more hardware yet to aid processing, and are less inclined to spend money on software to manage the queue - particulary because there isn't much hardware for it to manage (it gets even more circular after that so I'll stop now). I am once again reminded why Perlmonks is the first page I open when I arrive at work. Thanks again. -toaster. I can't believe it's not psellchecked*	[reply]
Use Condor by ibanix (Hermit) on Mar 12, 2003 at 01:19 UTC
Hi, I used to work for a fiber optics lab at a University. We had many users vying for control of the computational cluster, and needed a way to divide time among them based on time, project priority, etc, etc. I ended up going with Condor. Condor will support most of what you're asking for. It will also run MPI and PVM jobs, so you can integrate parallel-processing jobs into the system. It's not difficult to setup (does require a decent sysadmin), runs on Unix and Windows, and seems to have a decent userbase. I was able to get answers from some of the developers when I emailed. Good luck, ibanix `$ echo '$0 & $0 &' > foo; chmod a+x foo; foo;`	[reply] [d/l]
Re: Re: Parallel Processing, Queueing and Scheduling by Anonymous Monk on Mar 12, 2003 at 00:53 UTC
If the jobs are significant pieces of time, the trick that I have used for this is to store information about what jobs are needed in a database. Then let each machine open up a database connection, open a transaction, figure out which job to do, and then mark it as started. It should issue regular updates if desired. Then when it finishes it marks the job as done. Users have a tool that allows them to add jobs to the database. I didn't develop this into anything complex, but it wasn't hard to get to a usable state. And since coordination is handled in a lightly loaded database, this should scale to a very large number of machines. And it can coordinate processes that need cross-platform resources. Including human intervention! Other solutions worth considering are standard clustering technologies like http://www.mosix.org/, and various solutions that fall under the name grid computing.	[reply]
Re: Re: Re: Parallel Processing, Queueing and Scheduling by maksl (Pilgrim) on Mar 12, 2003 at 22:02 UTC
Openmosix contains several improvements by Moshe Bar on the original work of Prof. Barak (it starteted as fork to continue this open project as gpl) try it if you want to use a mosix type linux kernelpatch for clustering :)	[reply]
Re^2: Parallel Processing, Queueing and Scheduling by atcroft (Abbot) on Mar 12, 2003 at 05:18 UTC
I don't know if this helps as an idea, but on a recent work-related project, I wrote a wrapper that checked a database queue for waiting tasks (in my case, account provisioning items), changed the status to pending, processed the item (either itself or through the use of a helper application), then changed it again to a completed or failed status. I had the luxury of having each wrapper only look for one type of item, though, where you would have to do a query probably based on some priority rating or something. Lots of good suggestions already, but good luck, and hope the idea helps.	[reply]
Re: Parallel Processing, Queueing and Scheduling by pg (Canon) on Mar 12, 2003 at 03:18 UTC
This really depends on how heavy the processing would be, and how many parallel tasks would be there. Of course, as a FUNCTIONALITY, Perl can provide the paralell processing you want, and there are actually even more than one solutions, but this does not necessary to make Perl the right choice. Perl is not there for heavy processing. If speed is a must, and high-end paralell processing is a must, go c. Also, Perl modules intend to use more memory, which is a big drawback for parallel processing. If you don't want to re-develop your application later, go straight, and pick the best tool, in this case, it is c. Of course, you would lose all the good stuffs Perl can provide, for example regexp, rapid dev, etc., but trade off is everywhere in the IT world, so that is expected.	[reply]
Re: Parallel Processing, Queueing and Scheduling by talexb (Chancellor) on Mar 12, 2003 at 14:50 UTC
We use Sun's Grid Engine on Red Hat 7.2 and 8. It works very well. --t. alex Life is short: get busy!	[reply]
Re: Re: Parallel Processing, Queueing and Scheduling by hawson (Monk) on Mar 13, 2003 at 13:58 UTC
I'll second this one. Like most offerings from Sun, It's amazing general, and the docs skimp on a few things, but it works quite well. It also has clients for several different OSes, and source code is available here at http://gridengine.sunsource.net/ --Hawson	[reply]
Re: Parallel Processing, Queueing and Scheduling by AssFace (Pilgrim) on Mar 12, 2003 at 16:15 UTC
There are three ways that come to mind. COW (cluster of workstation - like the grid computing that one of the responses on here mentions), Beowulf type clustering, and then MOSIX/OpenMOSIX type clustering (which is sort of a mix of the two). As far as I know, none of those allow for cross platform. From what I know of parallel work - which from what I have done is graphics and financial data, but I wouldn't really say I'm an expert - you want to have your processing done in C. Which it sounds like you are doing - it sounds as if you want something to handle sending off data to each node on the cluster. People already talked of beowulf (the pvm and/or mpi stuff is used on that implementation), and they have talked of grids - but I didn't see Mosix on here. When I was starting up learning all of this I was interested in doing a Beowulf cluster because... well, they sound cool. But then I was starting to see a trend where for the things that I wanted to do, it was actually adding a level of complexity that wasn't needed. So I dumped on the Beowulf idea and went with OpenMosix. You can obviously read up more on your own of course, but the general idea is that you have N nodes in a cluster - all running Linux, with the kernel mods that OpenMosix requires. Then from there, you have a few options - the one I am more familiar with is having one head node that keeps track of what the others are up to and what loads they are under. You then can put your perl script on the head node and run it on there, and every time that you want to feed off the processor intensive part, then you fork off your call to the C program (using the Perl ForkManager module) and OpenMosix will pass that off to the node that is the least busy at that moment. There is also a varient that has more of the grid idea where everyone's computers run it and they are basically workstations, but can also compute stuff in the background if they are freed up. The basic concept of it is that they have a shared network filesystem (PFS - different than NFS, but similar too <g>). They don't have the shared memory of an SMP system, so the bottleneck tends to reside in the network speed. A general example for my work would be that I have my cluster, and I ssh into the head node and run the perl script, and then that spawns off the C program, passing it parameters so that it can do its thing, and then it saves out to the disk - then once the bulk processing is done, in my case the perl collects the data and makes one collective document. In your case it would vary depending on what rendering you are doing (if you are doing it where each node processes a pixel, then it is very different than if you are doing it so that each node is assigned to render out the movie frames N through N+10 and then later put those frames together into a movie). I'm not sure how well I explained all of that - but the shortest answer is to read up on OpenMosix and it is likely that someone out there is doing something very similar to what you are wanting to do.	[reply]


P is for Practical
	PerlMonks