We anticipate the arrival of 8 single-node KNL systems in May.Â Those systems will be made available to all ESP project team members, with no requirement that your home institution has signed the MP-NDA agreement with Intel. Email will go out to everyone when the systems are available, with instructions for access and usage.
You may find this website useful for finding and discussing information about the Intel KNL chip:
NERSC is offering an Advanced OpenMP training day this Thursday. Itâs still possible to sign up to attend virtually (Zoom), if youâre interested. If you don’t have a NERSC login id, indicate on the registration form that you heard about the training via the ALCF Early Science Program:
The Theta Early Science Program Confluence space is now available. This is meant to function like a Wiki, but using the Confluence technology. Anyone from Theta ESP projects may put content in the space, and for the most part its structure should evolve organically. ALCF will put content here as well, such as slides and videos from ESP meetings, instructions for accessing early hardware when it becomes available, etc. Do not include any NDA information (Intel or Cray) in content you put in this space; we will have another venue for that if/when needed.
The video recordings of the Kick-Off and Training Session 1 videoconferences are available on the Confluence site: Expand âMeetings/Workshopsâ under the navigation panel at the left and click to bring up the relevant meeting page.
To get access to the site, follow these instructions:
- Sign up for a Confluence account:
- Go to https://collab.cels.anl.gov and in the top-right corner of the webpage, click “Sign upâ.
- On the Sign Up screen, enter your contact information, including yourÂ preferred email address and click the “Sign upâ button.
- Obtain access to the Theta ESP Wiki space:
- Send an email to firstname.lastname@example.org with your Confluence username requesting access to the Theta ESP wiki.
- You will be notified via email within two business days when you have been added to the Theta ESP wiki along with the link to the login.
- Log into the Theta ESP Wiki space:
- Browse to https://collab.cels.anl.gov/display/ESP/Theta+ESP+Wiki .
- Log in to Confluence with your credentials from step 1.
There will beÂ a training-session videoconference on Wednesday 9 September, from 10:00 AM – 5:30 PM CDT.
The purpose of the meeting is to introduce Theta hardware, particularly the Knights Landing CPU, and the software tools to program it. Presentations will be given by Intel and Cray speakers. See agenda on the meeting website.
The intended audience is ALCF Theta ESP project members who will be working on code development and testing for Theta. You will receive an invitation to participate, either directly or through your project’s PI/co-PI. Session content will include Intel NDA material, so attendeesâ institutions must have an appropriate nondisclosure agreement signed with Intel.
To register for the videoconference, please visit the Videoconference Registration Site.
There will be a 90-minute Kick-Off videoconference for the Theta ESP projects. Presentations will cover structure of the program, timeline, expectations of the projects, and events.
The videoconference is by invitation only. All ESP project PIs and co-PIs were notified.
NOTE ON BLOG ENTRIES: This is the first entry associated with the Theta ESP. All older entries were for the ESP for what is now our production system, Mira.
As has been explained in a recent email to Mira users, the minimum partition you can use on the machine is 512 nodes. If you request fewer nodes, you still pay from your allocation for all 512, and the unused nodes are idle. On Cetus, the minimum partition size is 128 nodes.
As some of you exhaust your ESP allocations on Mira, you will notice your jobs going into the “backfill” queue. These are queued with low priority relative to positive-allocation-balance jobs, but will run if resources are available and no normal jobs are available to fit the space. The maximim size job allowed in backfill mode is 8192 nodes.
This coming Monday and Tuesday (4-5 Feb. 2013), Vesta will be down for extended maintenance, to install the latest BG/Q system driver from IBM (V1R2M0). Eventually, this driver version will be installed on Mira. Please help ALCF and yourselves by building and testing your Early Science codes on Vesta after the upgrade, if you can. Let us know if something breaks.
There will be an official notice going out soon, but be aware that Cetus is down and will be down for a number of days. This is related to the Vesta downtimeâthe BG/Q rack that’s currently designated as Cetus is being combined with Vesta to make Vesta a 2-rack system. We have a new rack that will be designated as Cetus. My best estimate is 5 days of downtime for Cetus (yesterday’s notice to vesta-notify and mira-notify lists estimated 5 days downtime for Vesta).
The Early Science period is officially underway. Mira came back online after acceptance testing on the evening of Monday 17 December. After an initial glitch in setting up the computer time allocations for the ESP projects, the correct allocations are now in place. These are what you were awarded as target allocation when your project was selected for the ESP. On Mira, the command
Â Â Â Â cbank-list-allocations -u yourUserName -r mira
will show you the amout and usage of your allocation.
Our one-rack test and development machine, Cetus (cetus.alcf.anl.gov) is now also available to ESP users.
The Early Science period should last through mid-March. When there is concrete information about the exact transition date, I’ll send out an email with the date and information about how the transition to production usage will impact the Early Science projects. You should have used up your ESP project allocations by then.