Spark UI
SPYT operations launch the Spark UI web interface. Usually, it cannot be opened directly: for example, if cluster nodes are in a closed internal network (as in Kubernetes) or job addresses change when moving to other machines.
To access the interface, Task proxy is used. It creates a stable link that works even when jobs move between nodes, and verifies access rights upon entry.
Note
To work with Task proxy, the cluster administrator must first deploy this component.
Configuring Spark cluster launch
For Spark UI to work through Task proxy, when launching a Standalone cluster, you need to enable the special configuration flag spark.ui.reverseProxy. To do this, add launch parameters:
spark-launch-yt ... --params '{"spark_conf"={"spark.ui.reverseProxy"="true"}}'
If the cluster is already running without this flag, to fix the UI operation, it will need to be restarted with the required parameter.
How to access the UI
To open Spark UI, follow these steps:
-
Find out the operation ID.
It can be found on the Operations page in the YTsaurus interface or in the output (stdout) of thespark-launch-yt/spark-submitconsole commands. -
Find the service address.
Task proxy publishes information about all active web services in the Cypress system table://sys/task_proxies/services. Find rows in this table corresponding to youroperation_id.Example table contents:
domain operation_id (example) task_name service protocol 2ef4261c.my-cluster.ytsaurus.example.net a6e04b98... master ui http 51a6d485.my-cluster.ytsaurus.example.net a6e04b98... history ui http 37a5f11c.my-cluster.ytsaurus.example.net 6699a5a9... driver ui http - Row 1 (
master): UI of the master node of a standalone cluster. - Row 2 (
history): UI of the history server. - Row 3 (
driver): UI of the driver when launched via direct submit.
- Row 1 (
-
Open the link.
Copy the value from thedomaincolumn (for example,2ef4261c.my-cluster.ytsaurus.example.net) and paste it into the browser address bar.