Skip to content

Commit c8b048e

Browse files
authored
Merge pull request #301 from NYU-RTS/mdweisner-torch-spec-sheet-1
Update 10_spec_sheet.md
2 parents 53978f9 + 70a690e commit c8b048e

1 file changed

Lines changed: 11 additions & 11 deletions

File tree

docs/hpc/10_spec_sheet.md

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -3,17 +3,17 @@
33

44
The Torch cluster has 518 [Intel "Xeon Platinum 8592+ 64C"](https://www.intel.com/content/www/us/en/products/sku/237261/intel-xeon-platinum-8592-processor-320m-cache-1-90-ghz/specifications.html) CPUs, 29 NVIDIA [H200](https://nvdam.widen.net/s/nb5zzzsjdf/hpc-datasheet-sc23-h200-datasheet-3002446) GPUs & 68 NVIDIA [L40S](https://resources.nvidia.com/en-us-l40s/l40s-datasheet-28413) GPUs connected together via Infiniband NDR400 interconnect. Further details on each kind of node is provided in the table below.
55

6-
| Type | Nodes | CPU Cores | GPUs | Memory (GB) |
7-
|---|---|---|---|---|
8-
| Standard Memory | 186 | 23,808 | N/A | 95,232 |
9-
| Large memory | 7 | 896 | N/A | 21,504 |
10-
| H200 GPU | 29 | 3,712 | 232 | 59,392 |
11-
| L40S GPU | 68 | 8,704 | 272 | 34,816 |
12-
| Login | 4 | 512 | N/A | 1024 |
13-
| Data Transfer | 2 | 64 | N/A | 512 |
14-
| Provisioning | 4 | 320 | N/A | 1024 |
15-
| Scheduler | 2 | 64 | N/A | 1024 |
16-
| Total | N/A | 38,080 | 504 | 209.5(TB) |
6+
| Type | Nodes | CPU Cores | GPUs | Memory (GB) | CPUs per Node | GPUs per Node | Memory per Node (GB) |
7+
|---|---|---|---|---|---|---|---|
8+
| Standard Memory | 186 | 23,808 | N/A | 95,232 | 128 | 0 | 512 |
9+
| Large memory | 7 | 896 | N/A | 21,504 | 128 | N/A | 3,072 |
10+
| H200 GPU | 29 | 3,712 | 232 | 59,392 | 128 | 8 | 2,048 |
11+
| L40S GPU | 68 | 8,704 | 272 | 34,816 | 128 | 4 | 512 |
12+
| Login | 4 | 512 | N/A | 1024 | 128 | N/A | 256 |
13+
| Data Transfer | 2 | 64 | N/A | 512 | 32 | N/A | 256 |
14+
| Provisioning | 4 | 320 | N/A | 1024 | 80 | N/A | 256 |
15+
| Scheduler | 2 | 64 | N/A | 1024 | 32 | N/A | 512 |
16+
| Total | N/A | 38,080 | 504 | 209.5(TB) | NA | NA | NA |
1717

1818

1919
Torch was tested in June 2025 using the [LINPACK benchmark system](https://top500.org/project/linpack/), which is the basis for all HPC systems ranked on the Top500 list. It had a theoretical maximum performance of 12.25 PF/s thanks to its powerful GPU resources, of which LINPACK was able to use 10.79 PF/s, thus placing it at [#133 on the listed](https://top500.org/system/180363/).

0 commit comments

Comments
 (0)