Cluster Hardware information

Current control

Most of the outlets in 1145 are numbered, corresponding to the circuit breaker. We have images of some of the circuit breaker box ( here and here).

7A and 7B are on a 20 A breaker at 120 V.

8A and 8B are on a 30 A breaker at 120 V. CAUTION: The outlet, and perhaps wires (!) appears to be rated for 20 A, so DO NOT overload this, because it may not trip the breaker!

16 is on a 20A breaker at 120 V.

1/3 is 30 A at 208V.

We spoke to Bobby, who said that in general the outlets above lab benches are almost definitely separate from those next to wall outlets, and the circuits in the two rooms are separate. In 1147, the outlet near the door to the hallway and the outlet near the door to 1145 (wall outlets) are probably on the same circuit.

Here is a helpful website regarding how much power a computer needs: http://www.extreme.outervision.com/PSUEngine

Its numbers are in “expected power”; measured wattage is in the next column over.

Case

expected power

max measured power

Our single-core 3770(K)s

x12

130 W

1560 W

106 W (0.88 mA)

1272 W

Our single-core 3930s

180 W

246 W (2.05 A @ 120 ! ) (with overclocking: 4.35 A!)

  • Our dual-core 2670v2s x2

  • Our dual-core 2690v2s x2

    17*2670+2690

  • 273 W

  • 312 W

4953 W

  • 312 W (2.6 A @ 120)

  • 417 W (3.47 A @ 120)

  • 5721 W

  • Air conditioners (specs)

  • Managed switch (specs)

  • Router (label: 0.6 A)

  • Unmanaged 24-prt (label)

  • 1250 W

  • 156 W

  • 72 W (0.6 A)

  • 72 W (0.6 A)

20 A @ 120 V
  • 75% Capacity

  • 80% Capacity

2400 W
  • 1800 W

  • 1920 W

30 A @ 208 V
  • 75% Capacity

  • 80% Capacity

6240 W
  • 4680 W

  • 4992 W

Current as of 7 Jul 2014:

Type

Num

Power (W)

7A

7B

8A

8B

16

1/3

2/4

6/8

1147: lab1

1147: lab2

3770(K)

44

106

9

12+3

8

12

uws,queens

3930

2

246

2(768)

2670v2x2

17

312

15

2

2690v2x2

1

417

1

AC unit

3

1250

1

1

1

Ind. fan

1

288

1

Switch, m

1

156

1

Switch, um

3

72

1

2

Router

1

72

1

Other

GC

monitors+

Total

1538

1250

1794

1746

144+

4680

1961

1272

1250

75% Capac.

1800

1800

1800

1800

1800

4680

3120

3120

1350 ?

1350 ?

100% Capac.

2400

2400

2400

2400

2400

6240

3328

3328

1350 ?

1350 ?

Max current

20

20

20!

20!

20

30

20(16)

20(16)

15?

15?

Voltage

120

120

120

120

120

208

208

208

120

120

Boomerang with MPI

Recommendations:

On dual:

ncores

npar=1

npar=2

npar=4

npar=5

10

20

10

62.40

62.11

62.08

20

44.59

38.77

40.11

42.06

64.28

It looks like you can pack it better if you do -n 10 NPAR=5 or 10, but you may want fast with -n 20 NPAR=4.

Average time for multiple concurrent runs

ncores

npar

ninst=1

ninst=2

ninst=3

ninst=4

10

5

62.14

107

120

152

20

4

38.51

71.5

190

So if you want to get 12 jobs done, your best options are

ncores=10,npar=5,ninst=4,nruns=3

3*152=456

ncores=10,npar=5,ninst=3,nruns=4

480

ncores=10,npar=5,ninst=2,nruns=6

642

ncores=10,npar=5,ninst=1,nruns=12

746

ncores=20,npar=4,ninst=3,nruns=4

760

ncores=20,npar=4,ninst=2,nruns=6

429

ncores=20,npar=4,ninst=1,nruns=12

462.12

Therefore, I recommend using echo “NPAR=4” >> INCAR mpirun -n 20 ./vaspexe.XXXX

and boomerang should account for the hyperthreading ability.

PDU (powerstrip)

  1. Log in on the web at 192.168.0.110. username and password are localadmin/localadmin. It needs Java to run. You may need to add a security exception to Java to allow it to run.

  2. Or log in via serial console, once you hook up the serial port on a computer to the port labeled “config” on the PDU. (You need a converter.)

    Then run:

    sudo apt-get install minicom
    minicom -s
    

Settings are 9600 BPS, 8N1, no flow control.

Make sure you get the right tty; usually /dev/ttyS1::

grep tty /var/log/dmesg

If the power strip is running, press the recessed reset button (right next to the config port).

Default root password is TrippLite.

Managed Switch (pennstation)

Username and password for the switch (pennstation) is available in the documentation repository, under linux/pennstation_password.

It runs FTOS. This is an OS that is not straightforward to learn. Documentation is available (with lots of bookmarks!) is available:

To log in via serial console (hardware is set up for bruckner):

  1. Attach the appropriate converter (to ethernet) to the SERIAL port on a machine

  2. Connect an ethernet cable between that converter and the jack labeled “console” on the switch

  3. Log in with minicom:

    sudo minicom -s
    
  4. Check your settings in Serial port setup: Rate 9600, 8N1, no flow control

    Make sure you get the right tty; usually /dev/ttyS1; check here:

    grep tty /var/log/dmesg
    
  5. Select Exit to exit the configuration and enter the console

The command-line interface (CLI) can be confusing. If you don’t know what to do, try tab completion, or, when there is more than one option, the question mark gives you the options:

>?
>terminal ?
>hostname ?

To get through different options, you may need to access the following modes:

  1. Root user (EXEC mode; prompt is now #); see above for password info:

    >enable
    #
    
  2. Configuration (prompt is now (conf)#):

    #conf
    (conf)#
    

In conf mode, prefix “do” to perform an action as if it is in exec mode

  1. Interface: #interface ; type #show ip interface brief “ for a summary of interfaces, and some examples:

    #show ip interface brief
    (conf)#interface vlan 2
    (conf)#int r gi 0/1 - 48
    

Some brief how-to tips:

  • Update FTOS: It came updated

  • Upload MAC / IP table (DHCP reservation): It is NOT p. 318 “Create Manual Binding Entries”. Instead, we need to make a DHCP pool. Here are the commands:

    # conf
    # ip dhcp server
    # show conf
    # pool grandcentral
    # host 192.168.0.100
    # hardware-address a4:ba:db:03:17:d9
    # default-router 192.168.0.1
    # dns-server 128.59.1.3
    # show conf
    # exit
    

    Then on the server, do (as root, where ethN is the appropriate result from ifconfig):

    # dhclient -r
    # dhclient
    # ifdown ethN
    # ifup ethN
    # ifconfig
    
  • Edit MAC / IP table: If you did manual binding (don’t), remove the binding with DHCP clear commands (p. 320). If you did the pool (like above), then you need to remove the pool:

    # conf
    # ip dhcp server
    # no pool erroneous-name
    
  • Set up port forwarding: Not possible (this is not Port Monitoring, which is physical ports / ethernet jacks). This is a switch, not a router.

  • Track usage of each IP at a given time (to see who’s throttling us): View Basic Interface Information (p. 416) to sort by physical port;

  • See which machines are actually connected at a given time:

    # show ip dhcp binding
    # ping 192.168.0.100
    # show ip dhcp server statistics
    
  • remote login with telnet / ssh (see above for username/password):

    $ ssh admin@192.168.0.2
    
  • Set up hostname, username, and password if necessary (see above for username/password). Commands to set up:

    # username admin password PWDGOESHERE privilege 15
    # enable password PWDGOESHERE  # I think this is to log in
    
  • logs

  • show / save configuration file : VERY IMPORTANT, every time you change something

    # show config
    # copy run start
    
  • VLAN: Technically VLAN2 is enabled, but we don’t need another vlan; see below

Network choices

VLAN or flat network:

  • 10.10 and 192.168 are segregated by VLAN for routing. Michael says it’s a big undertaking to go from flat to segregated network bc need re-IP.

Port forwarding vs VPN:

It looks like port forwarding is easier to set up, and I don’t know if we can do ssh through the VPN so easily.

  • We can set up a linux server for the router (run clearos or the like) and set up IPtables, which won’t have the limit of 32 addresses

DHCP reservation vs. static IP

DHCP reservation means the router assigns IP addresses but knows to reserve particular ones for particular MAC addresses. Static IP means the node requests the IP address.

http://www.reddit.com/r/sysadmin/comments/156pbh/static_ips_or_dhcp_reservations/

http://anselome.hubpages.com/hub/The-difference-between-static-and-dynamic-IP-addresses

Options:

  • Set up a static IP address on each node, and do away with the reservation table (i.e. the node requests its IP address; the power strip does this now).

    Advantages: (i) It’s easy to configure (ii) reconnection can be a bit faster (iii) If we get a new router or switch, we don’t have to put in a new DHCP table Disadvantages: (i) You have to make sure you’re not repeating IPs (a DHCP exclusion table + documentation of each IP taken) (ii) If we get a new router with new network parameters, the settings have to be changed manually on each node (iii) You have to set it up individually on each node

  • Keep the DHCP table on the router

  • Set up the DHCP table on the switch ourselves: See above