在集群中運行和監控作業的工具 (Tools for running and monitoring jobs in cluster)


問題描述

在集群中運行和監控作業的工具 (Tools for running and monitoring jobs in cluster)

We got these two clusters with eight nodes each, and we are looking for a good cluster framework that would allow us to launch jobs, has an inbuilt scheduler with different scheduling policies and a monitoring system with web frontend. Each of the nodes are running on Ubuntu 11.04. Both commercial and opensource are OK.

Some of them i saw were,  TORQUE and MAUI.(Not sure if it has a web frontend for monitoring) SLURM and MAUI. GEXEC and GANGLIA.(Doesn't have a scheduler)

Which product(s) would you recommend? Also is there any advantage using cluster operating systems like MOSIX instead of tools?


參考解法

方法 1:

The paid version of Maui is called Moab (it normally uses TORQUE as the RM). It also can be sold with monitoring tools as well. I think Moab is a really good product, but I am strongly biased towards it (I work for the company that develops TORQUE/Moab).

(by excraydbeer)

參考文件

  1. Tools for running and monitoring jobs in cluster (CC BY-SA 3.0/4.0)

#monitoring #cluster-computing #pbs






相關問題

在集群中運行和監控作業的工具 (Tools for running and monitoring jobs in cluster)

對於 ASP.NET 應用程序,我應該注意哪些關鍵性能監視器 (What key performance monitors should I watch for ASP.NET application)

無法讓節點時間正常運行 (Can't get nodetime run properly)

Một cách thanh lịch để theo dõi thời gian phản hồi trung bình trong IIS 7.5 là gì? (What is an elegant way to monitor average response times in IIS 7.5?)

Dropbox如何監控? (How does dropbox monitoring?)

為遠程 influxdb 節點配置 kapacitor 時無法獲得警報 (Not able to get alerts when kapacitor configured for remote influxdb node)

用於分佈式監控和跟踪網絡延遲/丟包的良好設置 (A good setup for distributed monitoring and tracking latency/drops in a network)

使用 Oracle 10 監視哪些語句更新(以及何時)某個表行 (Monitoring which statement updates (and when) a certain table row with Oracle 10)

IIS監控工具 (IIS monitoring tool)

基於 Java 的監控應用程序 (Java-based monitoring application)

Snowflake Snowpipe - 電子郵件警報機制 (Snowflake Snowpipe - Email Alert Mechanism)

如何配置 IIB 10 以將 monitoring_event 消息作為持久性 MQ 隊列發布? (How to configure IIB 10 to publish monitoring_event messages as persisitent to persistent MQ queue?)







留言討論