The UC3 group here held a UC3 user's jamboree today. Part of the preparation involved preparing exercies for users to do. It's surprising how many issues pop up in code that you thought was solid when you start exercising everything. Part of this can be corrected using a test suite and/or continuous integration methods but that won't get everything and users will still find the corner cases that you miss. It's unfortunate if you want to shield users from potentially confusing errors but that's the way it is.
The other interesting part of the Jamboree was getting the opportunity to do a little bit of Octave programming. I've never had to use Matlab or Octave so it was a bit new however it's sufficiently C like that wasn't too hard to pick up. However, lack of time prevented me from using a monte carlo analysis of time series data and I had to go with some simple matrix operations (matrix multiplication, calculating eigenvectors and eigenvalues). However, the original monte carlo code is something I'll probably come back to possibly as a part of a larger example using Swift to handle the generation of samples and coordinating analysis of the samples....
Thursday, May 23, 2013
Tuesday, May 7, 2013
User Engagement with SPT and Double Chooz
At a certain point in the development cycle, the best way to improve and move a project forward is to get user feedback in order to catch bugs in situations that you didn't anticipate and to feedback from people using software in a variety of situations. For SkeletonKey, this is taking place through engagement with the Double Chooz and South Pole Telescope (SPT) groups here in order to allow them to run computations on UC3 and eventually OSG.
After discussions with users from both Double Chooz and SPT, it's quickly become apparent that the biggest issues preventing their use of UC3 and other resources are those of software and data access. In regards to software access, both groups have moderately large software stacks (~2-5GB) that need to be available in order for them to run computation. Although 5GB of software is something that can be easily transferred using scp or something similar for one-off computations, this quickly becomes unmanageable when scaling up to hundreds or thousands of jobs using this software. Even with 10Gbit network connectivity, transferring a terabyte of software is time consuming and terribly inefficient use of bandwidth! CVMFS comes to the rescue here by allowing software to only transfer the portions of the software that they access and by utilizing a squid proxy to minimize the data transfer and push the bandwidth utilization to local networks rather than to backbones. We're currently in the process of installing their software on a CVMFS repository and updating their workflows to use SkeletonKey to remotely
There are also some overarching themes in the data access problems both Double Chooz and SPT have. They both need a way to stage data in and out to the systems that are doing the computations. There are some slight differences between the requirements of each group (Double Chooz's workflow would be able to work with the non-POSIX semantics of the UC3 HDFS filesystem, SPT's workflow can't), but the primary issues are the same. We're planning on using SkeletonKey to solve this as well. By using Chirp and Parrot under the hood, both projects will be able to use data that's residing on their own systems in jobs running on other clusters removing the need to stage data in and out for jobs.
After discussions with users from both Double Chooz and SPT, it's quickly become apparent that the biggest issues preventing their use of UC3 and other resources are those of software and data access. In regards to software access, both groups have moderately large software stacks (~2-5GB) that need to be available in order for them to run computation. Although 5GB of software is something that can be easily transferred using scp or something similar for one-off computations, this quickly becomes unmanageable when scaling up to hundreds or thousands of jobs using this software. Even with 10Gbit network connectivity, transferring a terabyte of software is time consuming and terribly inefficient use of bandwidth! CVMFS comes to the rescue here by allowing software to only transfer the portions of the software that they access and by utilizing a squid proxy to minimize the data transfer and push the bandwidth utilization to local networks rather than to backbones. We're currently in the process of installing their software on a CVMFS repository and updating their workflows to use SkeletonKey to remotely
There are also some overarching themes in the data access problems both Double Chooz and SPT have. They both need a way to stage data in and out to the systems that are doing the computations. There are some slight differences between the requirements of each group (Double Chooz's workflow would be able to work with the non-POSIX semantics of the UC3 HDFS filesystem, SPT's workflow can't), but the primary issues are the same. We're planning on using SkeletonKey to solve this as well. By using Chirp and Parrot under the hood, both projects will be able to use data that's residing on their own systems in jobs running on other clusters removing the need to stage data in and out for jobs.
Subscribe to:
Posts (Atom)