Skip to main content

Monitoring Metrics

This chapter describes the observable monitoring metrics of Curvine, used for monitoring cluster status, performance, and resource usage.

Master Metrics​

Metric NameDescription
inode_dir_numNumber of directories
inode_file_numNumber of files
num_blocksTotal number of blocks
blocks_size_avgAverage block size
capacityTotal storage capacity
availableAvailable storage space
fs_usedFile system used space
used_memory_bytesUsed memory in bytes
rocksdb_used_memory_bytesRocksDB memory usage
worker_numNumber of workers (classified by status)
rpc_request_total_countTotal RPC request count
rpc_request_total_timeTotal RPC request time
replication_staging_numberNumber of blocks waiting for replication
replication_inflight_numberNumber of blocks currently being replicated
replication_failure_countTotal cumulative replication failures
operation_durationOperation duration (classified by type, excluding heartbeats)

Journal Node Metrics​

Metric NameDescription
journal_queue_lenJournal queue length
journal_flush_countJournal flush count
journal_flush_timeJournal flush time

Client Metrics​

Metric NameDescription
client_mount_cache_hitsMount cache hits
client_mount_cache_missesMount cache misses
client_metadata_operation_durationMetadata operation duration
client_write_bytesBytes written
client_write_time_usWrite time in microseconds
client_read_bytesBytes read
client_read_time_usRead time in microseconds

Worker Metrics​

Metric NameDescription
write_bytesBytes written
write_time_usWrite time in microseconds
write_countWrite count
write_blocksBlocks written (classified by type)
read_bytesBytes read
read_time_usRead time in microseconds
read_countRead count
read_blocksBlocks read (classified by type)
capacityTotal storage capacity
availableAvailable storage space
fs_usedFile system used space
failed_disksNumber of failed storage devices
total_disksTotal number of storage disks
num_blocksTotal number of blocks
num_blocks_to_deleteNumber of blocks pending deletion
used_memory_bytesUsed memory in bytes