Installing, Running and Maintaining Large Linux Clusters
Found via 
SlashDork is this 
piece on  buiding up Linux clusters to more than 1000 nodes... experience confronting some of the LHC scale computing challenges: scalability, automation, hardware diversity, security, and rolling OS upgrades.  Looks like a must read (must try to understand!).  1K nodes would be a good start in 
SPTRTW