-
Notifications
You must be signed in to change notification settings - Fork 28
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' into alvarovm-patch-1
- Loading branch information
Showing
15 changed files
with
280 additions
and
209 deletions.
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -5,6 +5,48 @@ | |
Cerebras documentation for porting code to run on a Cerebras CS-2 system:<br> | ||
[Ways to port your model](https://docs.cerebras.net/en/latest/wsc/port/index.html) | ||
|
||
## Grafana WsJob Dashboard for Cerebras jobs | ||
A Grafana dashboard provides support for visualizing, querying, and exploring the CS2 system’s metrics and enables to access system logs and traces. | ||
See the Cerebras documentation for the [Job Information Dashboard](https://docs.cerebras.net/en/latest/wsc/getting-started/grafana.html#wsjob-dashboard) | ||
|
||
Here is a summary (tested to work on Ubuntu and MacOS)<br> | ||
|
||
On your work machine with a web browser, e.g. your laptop,<br> | ||
edit /etc/hosts, using your editor of choice | ||
```console | ||
sudo nano /etc/hosts | ||
``` | ||
Add this line | ||
```console | ||
127.0.0.1 grafana.cerebras1.lab.alcf.anl.gov | ||
``` | ||
Save, and exit the editor | ||
|
||
Download the Grafana certificate present on the Cerebras node at /opt/cerebras/certs/grafana_tls.crt to your local machine. To add this certificate to your browser keychain, | ||
|
||
1. On chrome, go to Settings->Privacy and security->Security->Manage device certificates | ||
2. Select System under "System Keychains" on the left hand side of your screen. Also select the "Certificate" tab. | ||
3. Drag and drop the downloaded certificate. Once it is added, it is visible as "lab.alcf.anl.gov" | ||
![Cerebras Wafer-Scale Cluster connection diagram](files/grafana_ctl.png) | ||
4. Select the certificate, and ensure that the "Trust" section is set to "Always Trust" | ||
![Cerebras Wafer-Scale Cluster connection diagram](files/Trust_ctl.png) | ||
|
||
|
||
On your work machine with a web browser, e.g. your laptop,<br> | ||
tunnel the grafana https port on the cerebras grafana host through to localhost | ||
``` | ||
ssh -L 8443:grafana.cerebras1.lab.alcf.anl.gov:443 [email protected] | ||
``` | ||
|
||
Point a browser at grafana. (Tested with Firefox and Chrome/Brave)<br> | ||
Open browser to a job grafana url shown in csctl get jobs, adding :8443 to hostname, e.g.<br> | ||
```console | ||
https://grafana.cerebras1.lab.alcf.anl.gov:8443/d/WebHNShVz/wsjob-dashboard?orgId=1&var-wsjob=wsjob-49b7uuojdelvtrcxu3cwbw&from=1684859330000&to=noww | ||
``` | ||
|
||
Login to the dashboard with user admin, and password prom-operator | ||
|
||
|
||
<!--- | ||
## Determining the CS-2 version | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.