|
@@ -73,7 +73,6 @@ For example, you can get the master metrics by `curl http://localhost:5679/actua
|
|
- ds.task.dispatch.failure.count: (counter) the number of tasks failed to dispatch, retry failure included
|
|
- ds.task.dispatch.failure.count: (counter) the number of tasks failed to dispatch, retry failure included
|
|
- ds.task.dispatch.error.count: (counter) the number of task dispatch errors
|
|
- ds.task.dispatch.error.count: (counter) the number of task dispatch errors
|
|
- ds.task.execution.count.by.type: (counter) the number of task executions grouped by tag `task_type`
|
|
- ds.task.execution.count.by.type: (counter) the number of task executions grouped by tag `task_type`
|
|
-- ds.task.running: (gauge) the number of running tasks
|
|
|
|
- ds.task.prepared: (gauge) the number of tasks prepared for task queue
|
|
- ds.task.prepared: (gauge) the number of tasks prepared for task queue
|
|
- ds.task.execution.count: (counter) the number of executed tasks
|
|
- ds.task.execution.count: (counter) the number of executed tasks
|
|
- ds.task.execution.duration: (histogram) duration of task executions
|
|
- ds.task.execution.duration: (histogram) duration of task executions
|
|
@@ -104,6 +103,12 @@ For example, you can get the master metrics by `curl http://localhost:5679/actua
|
|
### Worker Server Metrics
|
|
### Worker Server Metrics
|
|
|
|
|
|
- ds.worker.overload.count: (counter) the number of times the worker overloaded
|
|
- ds.worker.overload.count: (counter) the number of times the worker overloaded
|
|
|
|
+- ds.worker.task: (gauge) the number of tasks on the worker, including pending and running ones
|
|
|
|
+- ds.worker.execute.queue.size: (gauge) the number of pending tasks on the worker
|
|
|
|
+- ds.worker.active.execute.thread: (gauge) the number of running tasks on the worker
|
|
|
|
+- ds.worker.memory.available: (gauge) the available physical memory of the worker (GB)
|
|
|
|
+- ds.worker.cpu.usage: (gauge) the cpu usage percentage of the worker
|
|
|
|
+- ds.worker.memory.usage: (gauge) the memory usage percentage of the worker
|
|
- ds.worker.full.submit.queue.count: (counter) the number of times the worker's submit queue being full
|
|
- ds.worker.full.submit.queue.count: (counter) the number of times the worker's submit queue being full
|
|
- ds.worker.resource.download.count: (counter) the number of downloaded resource files on workers, sliced by tag `status`
|
|
- ds.worker.resource.download.count: (counter) the number of downloaded resource files on workers, sliced by tag `status`
|
|
- ds.worker.resource.download.duration: (histogram) the time cost of resource download on workers
|
|
- ds.worker.resource.download.duration: (histogram) the time cost of resource download on workers
|