Anuket Project

Memory RAS

Metrics List & Descriptions:

Technology/CategoryMetric/Feature NameDate TypeFormat ExampleCollectd ReleaseInternal Collectd VersionCollectd PluginDescriptionDependenciesLimitationsComments
Memory RASMemory corrected errorsInt 515225.8NonemcelogNumber of Corrected memory errors since the system boot

gets metrics from mcelog daemon.
Memory RASMemory corrected errors in 24 HoursInt515225.8NonemcelogNumber of Corrected memory errors since previous 24 hours

gets metrics from mcelog daemon.
Memory RASMemory Uncorrected errorsInt515225.8NonemcelogNumber of Corrected memory errors since the system boot

gets metrics from mcelog daemon.
Memory RASMemory Uncorrected errors in 24 HoursInt515225.8NonemcelogNumber of Corrected memory errors since previous 24 hours

gets metrics from mcelog daemon.
Memory RASSocketInt05.8NonemcelogSocker number error occurred on

gets metrics from mcelog daemon.
Memory RASChannelChar05.8NonemcelogMemory channel each channel represents a DIMM module

gets metrics from mcelog daemon.
Memory RASMemory DIMMCharB15.8NonemcelogMemory DIMM corresponding the memory used by the cores errors occurred on

gets metrics from mcelog daemon.
Memory RASMemory SlotChar15.8NonemcelogMemory slot corresponding the memory used by the cores errors occurred on

gets metrics from mcelog daemon.
Memory RASCPU IDInt0FutureFutureEDACCPU ID of the cores errors occurred on. Will be added to new EDAC plugin


Memory RASMemory PageHex0x12345FutureFutureEDACMemory page corresponding the memory used by the cores errors occurred on. Will be added to new EDAC plugin

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASMemory OffsetHex0x0FutureFutureEDACMemory offset in the page. Will be added to new EDAC plugin

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASMemory RowHex0x12345





Not part of Collectd. Currently available with kernel EDAC logs
Memory RASMemory GrainInt8FutureFutureEDACThe byte granularity or the error grain. Will be added to new EDAC plugin

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASError SyndromeHex0x6ce3FutureFutureEDACMemory syndrome corresponding the memory used by the cores errors occurred on. Will be added to new EDAC plugin

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASError TypeText
FutureFutureEDACError type. Will be added to new EDAC plugin

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASError codeInteger0101:0090FutureFutureEDACError code put out by EDAC. Will be added to new EDAC plugin

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASLoggingLog path
?
EDACConfigurable logging path

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASdimmX or rankX directory infoVarying
FutureFutureEDACExpose interface files provided by sysfs through mcX/dimmX or rankX directories

Not part of Collectd. Currently available with kernel EDAC logs
Memory RAScsrowX directory infoVarying
FutureFutureEDACExpose interface files provided by sysfs through mcX/csrowX directories

Not part of Collectd. Currently available with kernel EDAC logs
Memory RASRAS interruptsCount on each core[CoreID]:[InterruptCont]FutureFutureEDACExpose the RAS related interrupts on cores of interest via Collectd

Discussion open to see if this info can be exposed through the plugin.

Sub-sections:

RAS/mcelog Plugin High Level Design 

Memory RAS Plugin Executed Tests 

RAS Other Executed Tests